Gene Tery_3266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3266 
Symbol 
ID4243687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5011083 
End bp5015132 
Gene Length4050 bp 
Protein Length1349 aa 
Translation table11 
GC content42% 
IMG OID638108260 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_722851 
Protein GI113476790 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2831] Hemolysin activation/secretion protein
[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTTTT TTTACAAAAA TCTATTCCTC AGAATTGTTC CGCTATCTTT GACTCTGGGA 
TATTTTGGGG TTGATATTTC ATCGGCGATC GCTCAACTTC AACCAGATAA TACATTGGGA
GAAGAAAATT CTGTCGTCAC ACCGAATATC AATATTAAAG GGATAGAAAG CGATCGCATA
GATGGCGGGG CCCAAAGAGG GGCCAACCTA TTCCACAGTT TTCAAGAATT CAACATTCAA
CAGGGACGAG GGGTTTACTT TAGCAACCCT GATGGAGTTC AAAATATCCT GACTCGGGTC
ACAGGAAACA ATGGCTCTAA CATATTAGGA ACCTTGGGTG TAGATGGAAA AGCTGATTTA
TTTTTCATTA ACCCCAATGG CATAATCTTT GGGCCAGAAG CTAAATTAGA TGTCCAAGGT
TCATTCTATG GAGCTACTGC CGATAGTATC TTATTTAAAA ATGGATTTGA ATTTGCTAGT
TCAGACCCTC AAGCGCCCCC ACTACTGACA GTTAATGTTC CCATTGGGTT GAGGTTGCCC
GAAAACCCTG GTAGTATTTT GGTTGAGGGT CAAGGTCATA ACATAAATTT TTATCTAAAC
CCAAAAGACG GAATAGCAAG ATTAGAAAAA GAGGACAGGA ATCTAGGGTT GGAAGTTTCC
CAAGGCCAAA CCTTAGGGTT AGTTGGTGGT GATATATTTT TGAGGGGAGG AAACCTCACA
TCAATTGGAG GACGTGTTGA ATTAGGAAGT GTTCACAATG GGGTAGTAGG AATCACGAAA
GACCAAACCC TTAGCTATGA TCCAAACCTG AAATTTGGCC AAATTCTAAT ATCTGAAGAG
AGTTCTATAG ATGTAAGTGG AAGTGGAAGA GTTGAGTTTC AAATTCAAGG AGAAGAGATA
AAAGTGCGAG ATGCTTCAAT AATAGTAGGA GAGATTAAAG GCGATGAAGG TGGGGGGGCT
TCAATTCTTA AAGCATCTAA GTCTATTATA ATAGAAGGTA GAATACCTTT TGAGAATAAT
ATTACATCAT CATTCAACGT GTCTTTATTT TCTGATAAGC TCGAAGCAGG AGAAGCTTTA
AACATTAATA AGACTAACAG TGCTATTATT CTTATCACTT ACGGTAAGGG AAATGTCGGG
GAGTTGACGG TAGAAACAGA AAGTTTAACG CTGGATCCTG GAGCGTTTAT TTCTACGACT
ACTTTGGGTC AGGGAAATGG AGGGAATTTA ACAGTGTTTG CTAAAGACAT AGAAATGAAT
GGTGGGGACT TTTTCGGTGG TTTGGTTAGT GTTAGTGTTG ATCAAGGTAA CGGAGGGAAT
TTAACGGTGA AAGCTGCAGA AAGTTTAACG CTGCAAGATG GAGCCATTAT TTCTACGACT
ACTTTGAGTC AGGGAAATGC AGGGAATTTA ACAGTGTCTG CTAGAGATAT ACTCATTGGT
TATTCTACTA TGTTGTCTGC AGAAACTTTG GGTCAAGGTA ACGGAGGGAA TTTAACGGTG
AAAGCTGCAG AAAGTTTAAC GCTGCAAGAT GGAGCTTTTA TTTCTACCAA GACTTCTGGT
CAGGGAAATG CAGGGAATTT AATAGTCTCT GCTAGAGATA TAGAAATAAT AGGCAAATCT
CCTGATGGTA AATCCTTCAG TAGCTTAACC GCAGAAGCAA AAAAAGAATC AACAGGAGCA
GCAGGAACCA TAACCATCAA CACCGAAACC CTCAACCTCC GGAACGAAGC CGAAATTATA
GTCACCTCTG CTACCCAACA AGCTGCAGGT GACTTAATAA TAAACAGCAA CAACATCCGC
CTCGAAAACC AAGCTAGCCT CAAAGCTGAA ACCAAAGCAG GTGAAGGCAG CATTACCATC
AATAATAATA AAGACCTCAT CCTGCGTAAC AACAGCAACA TCACCACTAA TTCTGAAGGC
ACGGCTACTG GAGGTAATAT TACCATTAAT ACGGAAAACC TAGTTGCCCC AGACAACAGC
GACATCTCTG CTAATGCTGT CGAGGCATTT GGAGGCCGAG TCAACATCAC CGCAGCCGGC
ATATTCGGCA CAGAATTCCG TACTGAAGCA ACAGACAAAA GCGACATCAC CGCCACCTCA
GAATTAGGAG CTTCATTCAG TGGAGAAGTC AATATCAACA CCCCCGATGT AGATCCTAGT
GACGGACTGA TAGAATTTGA GGATACCGTC CCAGATATCT CCAACCTCCT CGACCAAAAC
CCTTGCCGAC AAGGACAAAA AAGCAAATTC ATCATTACAG GAAGAGGCGG CTTACCACCA
ACTCTCGAAG AACCCTTGGC TCCCACCTAC GTCTGGCAAG ACTCAGAAAC CTCAGTTACT
CCTCCACCAG CTATCACTCC ATCAGAGCAA GAGTTCATTG AAGCACAAGG TTGGAGCTCT
AATTCTCAAG GTATTGAACT GATCACTGAC CCCGAAACTG CTACTCCCAC CTCCCCTTGG
TTAATACCTC CCAACTGCGA ACAATTAGAC ACCTTTGAAC CCCCACCCTA TCTACTCGCA
AGCAGTAAGG GAAATATTCC CACCTCAACT TCTGAACCTC TAAAAGTTAC TATCAAACAA
TTCAACTTTG AGGGTAACAC AAAATTTAGC AACCAAGAGC TACAACAACA ACTCACACCT
TATCTAAATA AACCCATTAC CTTTGCTCAA CTCATAGCAG CACGCACTGC CATTACAAAA
TATTATACCA AAAACAACTA CATCACATCG GGAGCATTTA TTCCTCCCCA AACTATGGCT
GAAAATGGCA CAGTCACCCT CCGAGTAGTG GAAGGAAAAG TTGGTGAAAT CAATGTCAAT
ATCCAAGGTA GATTAAATGA AAATTATATC AAAAGTCGGT TAGAAAAAGC CACCACGGCT
CCTCTGAATC AAGAAAAGTT GTTGTCTGCC CTACAAATAT TACAAATAGA CCCTTTAGTC
AAAACTCTAT CTGCGGAACT TTCATCCGGA GTCCGCCCAG ATACAAGTAG GTTAGACATA
AGAATAGAAA CAGCTAACCC CTGGCAAATA GAAACTATCA GTAATAATGG CAGGGCACCA
AGTGTGGGAA CATTCAGGCG AGGGGTAGAG GTAGACCATA GAAATGTTAC AGGTATTGGA
GACAGTTTAA GTGCTCTCTA TACTAATACT GATGGCAGCG ATATGGTAGA AGTATCTTAT
GCTATTCCTG TGAACTCCAG TAATGGTCGT ATCGAACTAT TTTATCGCCA CAGAGATAAT
AAGGTGGTAG AATCACCTTT TGAAAGACTT GATATTGAGT CAAATTCAAA TACTTACAAG
TTATCATTTA GTCAACCAAT AATGCAAACA CCCTGGCAAA CTTTAAGTTT GGGTTTATCT
GCAATAAAGC GAGATAGTCA AACTTCTATA TTAGGAGAGA ATTATCCATT ATCTGAGGGT
GCTGATAAAA ATGGGGAAAC GAAGTTATCA ATTTTGCAGC TTTTCCAAGA TTATGAACTA
CGGGGAAAAA ATCAAGTTTT GGCGTTTCAT TCTCAGTTGA ATGTAGGGTT GGGAATATTG
GATGCTACAA AGAATAGTAA TGAACCAGAT GGTCGATTTT TTTATTGGCG GGGTCAAGGA
CAATGGGTGC GTGAGTTAGG CAAAAATACT TTGTTGTTGA TGGGAGCAGA TCTACAGTTA
TCTCCATCTG ATTTGGTGCC TCAGGAAAGG TTTGGTTTAG GGGGTTATCG AAACGTGAGA
GCTTATCGTC AGGATACTCG CTTGACAGAT AATGGAGCGT TAGGAACAGT GGAGTTGCGG
TTGCCCGTGC CCTGGATATC TGGAAAAAAT AGATTATTTC AGGTAGTGCC GTTTATTGAT
GGGGGGGTAG CATGGAATAG TGATAGTAAA GAGGAGGAAG GAAGTAAGGC TTTGGCGGCG
GCCGGGGTGG GGTTGCAGGT AAATTTATGG GAGAAGATAA ATATGCGCTT AGATTATGGA
ATTCCTTTAG TGGATGTGGA CTCGCGAGAT AAAACGGCTC AGGAGGAAGG GTTTTATTTT
TCTTTTTCTA CTACTCCTTT TTCTTTTTGA
 
Protein sequence
MLFFYKNLFL RIVPLSLTLG YFGVDISSAI AQLQPDNTLG EENSVVTPNI NIKGIESDRI 
DGGAQRGANL FHSFQEFNIQ QGRGVYFSNP DGVQNILTRV TGNNGSNILG TLGVDGKADL
FFINPNGIIF GPEAKLDVQG SFYGATADSI LFKNGFEFAS SDPQAPPLLT VNVPIGLRLP
ENPGSILVEG QGHNINFYLN PKDGIARLEK EDRNLGLEVS QGQTLGLVGG DIFLRGGNLT
SIGGRVELGS VHNGVVGITK DQTLSYDPNL KFGQILISEE SSIDVSGSGR VEFQIQGEEI
KVRDASIIVG EIKGDEGGGA SILKASKSII IEGRIPFENN ITSSFNVSLF SDKLEAGEAL
NINKTNSAII LITYGKGNVG ELTVETESLT LDPGAFISTT TLGQGNGGNL TVFAKDIEMN
GGDFFGGLVS VSVDQGNGGN LTVKAAESLT LQDGAIISTT TLSQGNAGNL TVSARDILIG
YSTMLSAETL GQGNGGNLTV KAAESLTLQD GAFISTKTSG QGNAGNLIVS ARDIEIIGKS
PDGKSFSSLT AEAKKESTGA AGTITINTET LNLRNEAEII VTSATQQAAG DLIINSNNIR
LENQASLKAE TKAGEGSITI NNNKDLILRN NSNITTNSEG TATGGNITIN TENLVAPDNS
DISANAVEAF GGRVNITAAG IFGTEFRTEA TDKSDITATS ELGASFSGEV NINTPDVDPS
DGLIEFEDTV PDISNLLDQN PCRQGQKSKF IITGRGGLPP TLEEPLAPTY VWQDSETSVT
PPPAITPSEQ EFIEAQGWSS NSQGIELITD PETATPTSPW LIPPNCEQLD TFEPPPYLLA
SSKGNIPTST SEPLKVTIKQ FNFEGNTKFS NQELQQQLTP YLNKPITFAQ LIAARTAITK
YYTKNNYITS GAFIPPQTMA ENGTVTLRVV EGKVGEINVN IQGRLNENYI KSRLEKATTA
PLNQEKLLSA LQILQIDPLV KTLSAELSSG VRPDTSRLDI RIETANPWQI ETISNNGRAP
SVGTFRRGVE VDHRNVTGIG DSLSALYTNT DGSDMVEVSY AIPVNSSNGR IELFYRHRDN
KVVESPFERL DIESNSNTYK LSFSQPIMQT PWQTLSLGLS AIKRDSQTSI LGENYPLSEG
ADKNGETKLS ILQLFQDYEL RGKNQVLAFH SQLNVGLGIL DATKNSNEPD GRFFYWRGQG
QWVRELGKNT LLLMGADLQL SPSDLVPQER FGLGGYRNVR AYRQDTRLTD NGALGTVELR
LPVPWISGKN RLFQVVPFID GGVAWNSDSK EEEGSKALAA AGVGLQVNLW EKINMRLDYG
IPLVDVDSRD KTAQEEGFYF SFSTTPFSF