Gene PHATR_44149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_44149 
Symbol 
ID7203854 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp1122943 
End bp1125265 
Gene Length2323 bp 
Protein Length459 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186474 
Protein GI219113781 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.104122 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCCAT TCCGAATGCT TAAGGATGCT CCCCGGGAAC TAGAAGATGC GTCTTTCGGA 
GTCCAAATTA TTTATTATGA GGACTTTTAC TACGCGCTCG TTTTCCTTAC AGCAGTCTAC
CTGTCCGGAC GTGTCGCCAG CCTTTTACGG ATGCCGGCCT TGACCGGGGA AATTGTTGCT
GGCATTCTCT TGGGACCGCC ACTGGCTAAC TTTGCTCCCA TCCCTCAAGC GTGGGTGCTG
CTCGGAGAAA TCGGGTAAGT GGTGGAAGGG ACAGATAGTT TGATCTTGCA AGTATCCATT
ACCATACAGA CAGGCTAAGG CTCCGGATGG CAATAAGACT TATTTTGACA GTGCTCGAAG
CTGCCATGGA CATTGACGTC AAGAATTTGA AAGTTATTGG AATACGCGGA TTTACAATCG
CTGTAGTTGG ATCGATTTTA CCAATTTCAA TCGGAATTAG CTTGGCCTAC GGCATGGGGT
TTGAAGGTAA GGCAGCAATT GGGGCTGGCG CAACGTTCGG GCCGACATCA ATGGGCATCG
CGCTGAACGT TCTCAAGAAC GGTGGAATTT CATCAACACC CCTTGGTCAG CTGATTGTTT
CGGCCGCAAT TGTAGACGAT ATGCTCGCAT TGGTGATTTT GAGTCAGCTC GAGGCCTTGA
CCGGGGAGAT CACAGTGGTT GGTGTCGTTG TTCCGATTCT TTCGGCCTTC CTTTTTCTCT
TGCTCGGGGG TTATATTGCG CTCTTTGTAT TACCGAATAT CAGAGAGAAA TATTTGATTA
GCAAAATCCC ATCCGAACAT CGCGATGAGG CTCAACTCAT CATACTCTTC GGGATCCTTC
TAGCAATGGT ACCAGCGACG TTTTACGCCC AAGCATCGTA TTTAATGGGA GGATTCGTCA
CTGGTCTTGC CTTTTGTAGC GCAGAGGGAA CCCATCAGCT TTTCATTAGT CAGTTCAAGC
GAGTAACAAC GTGGTGCATG CGAATGTTCT TCGCGGCAAG CATTGGCTTC CAAGTCCCCG
TCAGGGATTT CGGAGACGTC CAGGTACTTT GGCGCGGAAT GGTATTTTTC TCGTCGCTGT
TCGGAAAACT TGCGGTTGGG TTTCTCGTCC CGAATATGCA TGAAAGCCGC AACTTCACCG
GCCCCCATCT TCGCGACTGC TTGATTACGG GTTTCAGCAT GGCGTCGGAA GGAGAGTTCG
CCTTCGTTAT CGCGGTGTTT GCCGTCGATA ACGGTCTAAC CGATCGTGGA TTGTACGCCA
GTGTCGTATT GGCAGTCCTC CTGAGTGAGA TTATTCCTCC GTTCCTTTTG CGATACACGA
TTGCCAAGTA TGAAGTTTCA AAAACAGAAC CAACGACTCG ACATCGTAGC GATGTCGAAT
CAGAGAACCC ATTGGAAACT TTGTCGAATT CTGAGTACAA TCATAGTCAG GGACAATAAG
TCCTACAGCA GCCGTGACTG CTACATCGGT AATGCATACA TCCAGCAAAA CTTCGAGTCG
ACAGCGATTT TGGTTTGGGA ATGAATGATG TTCCAGATGG GGTCCTTCAT CGAACAATTA
TAATCTTGTA CTTTTGTGTG AATAGAACTA TGTGTCCGTG TTACTAGCCT TTTAAGCTGC
AGTTCCAAAC AAAAAGTTCG CTTCAAACCA TGCGGAATCA AAATGGCGAG CGGAACTTCA
AAATAGGAGG AAACTTCGAT AATATGGCGC CTTCCAAAAC TTGTCGTTCT TGTGACAAAC
ATCTTTGCCG TATCGCAGTG TCTTTTTCGT CTCGACAAAA TTAGCCTCGG TGTTAGCTCA
ATCTAGTATC GGATCTTCCT CGTCATCTCC CTTTGACTTT GATGAAAGTC CCAGCCACTG
CGACATTTTC TCAAACTCCG CAAGAGTCGC CACCGTAGTT GTGCCTGCGA TACCACCGAC
GGCAAGAACA AGTCCTCGCA ACCTTGCGGC CATTTTGGAT GCCGCTTCGA ACGGATTTAT
CCCCTTATTG AAAAAACGGG TAGCATTTGC GAGAACACTT TGATCACACA ATTTTTCCTG
CAACTCGGTA ATTCGAAAAT AGGAAGCCTC CACAAAAACC TCACTCAAAA CTTTGTGGTC
CTCCGGAAGC TGCACATAAT TTTGGGTATT CTTGCGCAGA CTCACGGTCG TTCTGCCGTA
GGACAGCAGT TCGACGCGGT TCCTCAGGTA CTGCAGAATA AAGCCGAAAT GTGAAGGATC
GCGGTCAATG AACACTGCTC CGTTGCGTAC AATTTCCTGG TTGGCTTCCG CCCTTGCCAC
GTGATCGGCC AACACCGGAT TTTCGGCCAC GGTCGACCGC AAC
 
Protein sequence
MMPFRMLKDA PRELEDASFG VQIIYYEDFY YALVFLTAVY LSGRVASLLR MPALTGEIVA 
GILLGPPLAN FAPIPQAWVL LGEIGLRLRM AIRLILTVLE AAMDIDVKNL KVIGIRGFTI
AVVGSILPIS IGISLAYGMG FEGKAAIGAG ATFGPTSMGI ALNVLKNGGI SSTPLGQLIV
SAAIVDDMLA LVILSQLEAL TGEITVVGVV VPILSAFLFL LLGGYIALFV LPNIREKYLI
SKIPSEHRDE AQLIILFGIL LAMVPATFYA QASYLMGGFV TGLAFCSAEG THQLFISQFK
RVTTWCMRMF FAASIGFQVP VRDFGDVQVL WRGMVFFSSL FGKLAVGFLV PNMHESRNFT
GPHLRDCLIT GFSMASEGEF AFVIAVFAVD NGLTDRGLYA SVVLAVLLSE IIPPFLLRYT
IAKYEVSKTE PTTRHRSDVE SENPLETLSN SEYNHSQGQ