Gene P9211_14411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_14411 
Symbol 
ID5730798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1298330 
End bp1300114 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content31% 
IMG OID641285818 
ProductFlp pilus assembly protein TadD 
Protein accessionYP_001551326 
Protein GI159903982 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00033533 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAATGGCT TTGGAGAAAA AAAAATTTCA AATAAAAAGA AATTCCACAA TCAGCTATCA 
GAAGTCCAGA TTAATCAATT AATGACTGGA GCAATAAGGC ATCTTTCTTC CGGTAATTTT
GTTGACGCAG AGGCTTGCTA TATGAAATTA ATCAATGCAG GTTTTAAAGA TCCTAGAGTC
TACTCAAATA TCGGTGCTAT ATATAAACAA AGAAATAATC TTGACAAAGC ATGTTTTTAC
TTAAAAAAAG CAGTCACCTT ATTCCCAGAA TACGCTGATG CTTATTCTAA TTTAGCATTA
ATCCTAAAAG AGAAAGGTGA ATTAAAAGAA GCAGAGTTAT ATGCTAGAAA AGCGATCTCA
TTGAAACCTA ATTTATGTGA TGCATATTTA ATCCTAGGTG GAATTTTACA AGATCAAGGA
GATCTAGATC AAGCAATACT TTGTACAAGA AAGGCAATTA TATTGGACCC TAATTTGGCA
AGTTCACATC TCAACCTTGG TGTATTGCTA AAAGAAAAAG GCAATTTAAA AGAAGCTGAA
GATGCCACTA GAAAAGCAAT ATCATTGCAA TCAAACTTGG CAAACGCACA TCTCAACCTT
GGTGTATTGC TAAAAGAAAA AGGCAATTTA AAAGAAGCTG AAGATGCCAC TAGAAAAGCA
ATATCATTGC AATCAAACTT GGCAAACGCA CATCTTAATC TTGGAATTTT ATTAAAAGAC
CTTGGAAAGC TAAAAGAAGC AGAAAAAACT TTAATTAAAG CAATACAGAT AGACGGCTTG
CTCGCACCAG CGTATTACTC ATTATCAACA TTAACCGATT ATTCAAATGA GAGAGAGTTA
GAAGATAAAC TTTTTAAAAT AAATTTAGAT AATTTAGATA ATTTATCTTT TAAGATAGAT
TTATGTTTTG CAAGATCTAA TATTCTTCAT AAGAAAAATC TTTTTTTAGA AAGCTCTGAA
TATCTCAAAG AAGCCAATAA GATAAAGCTC CAATTATATC CATCTAATGC AAATAAGCAA
ATTAAAAAGT CTATTGAAAT TCTCGATAAA TATAAAGACT ATCAAGAAGA CATACCAGTA
ACAAGTCCAG TTTCGAATTG CATTTTCATT GTAGGAATGC CTCGCTCTGG TTCTACATTA
TTAGAGTCTA TTCTAAGTAT GAAAGAAAAA GTTGTTGACT TAGGAGAAAC AGTTATTTTT
GAACAAGCCT TTTCTGAATG GAATATGCAA GACAAGCAAT CCAAGGGGAA GAGTCTTTTT
GAAATCTATT ATGAAAAAAG GTCTTCTTTT GCAGAAAAGG AATGCATAAC AACAGATAAG
AATTTGTATA ACTATATGTA TTCTGGTTTT ATAGCTAGTC AATTCCCAGC AGCTAAGATT
ATTCATTGTT ATAGGAATCC TCTTGATAAT ATTCTTTCTA TGTATAGAGC AAATTTTGCA
AAAGGAAATT ATTACTCTTC ATCAATATTA GATTGCTCAA AAGTTCTTTC AAATCATATT
TATCTAATGA ATTATTATAA GAAATTATAC CCAGAAAAGA TTTATAGCCT AGATTATGAT
CAATTGGTTA TTTCACCAAA AGATCAAATC AGGTCTCTAA TAAATTGGTT GGGATGGGAA
TGGGAAGAAT TATATTTATC ACCTGACAAA AGTAATAGAG CAGTTAGTAC AGCAAGCAAT
GTGCAAGTAA GGTATCCTAT TAATAATAAA TCTCTTTATG GCTGGAAGAA ATATACAGAG
CTTTTAAAAC CAGCAATAGA TTATTTAGAC CAAAAAATCA TTTAG
 
Protein sequence
MNGFGEKKIS NKKKFHNQLS EVQINQLMTG AIRHLSSGNF VDAEACYMKL INAGFKDPRV 
YSNIGAIYKQ RNNLDKACFY LKKAVTLFPE YADAYSNLAL ILKEKGELKE AELYARKAIS
LKPNLCDAYL ILGGILQDQG DLDQAILCTR KAIILDPNLA SSHLNLGVLL KEKGNLKEAE
DATRKAISLQ SNLANAHLNL GVLLKEKGNL KEAEDATRKA ISLQSNLANA HLNLGILLKD
LGKLKEAEKT LIKAIQIDGL LAPAYYSLST LTDYSNEREL EDKLFKINLD NLDNLSFKID
LCFARSNILH KKNLFLESSE YLKEANKIKL QLYPSNANKQ IKKSIEILDK YKDYQEDIPV
TSPVSNCIFI VGMPRSGSTL LESILSMKEK VVDLGETVIF EQAFSEWNMQ DKQSKGKSLF
EIYYEKRSSF AEKECITTDK NLYNYMYSGF IASQFPAAKI IHCYRNPLDN ILSMYRANFA
KGNYYSSSIL DCSKVLSNHI YLMNYYKKLY PEKIYSLDYD QLVISPKDQI RSLINWLGWE
WEELYLSPDK SNRAVSTASN VQVRYPINNK SLYGWKKYTE LLKPAIDYLD QKII