Gene Tery_0379 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0379 
Symbol 
ID4241613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp585492 
End bp588713 
Gene Length3222 bp 
Protein Length1073 aa 
Translation table11 
GC content45% 
IMG OID638105706 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_720320 
Protein GI113474259 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0161593 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAAAT TTCGATGTCA GCCTAAATTA AGTGCTTTTG TTGCTTTGGC CTTTTCTTTG 
TCGCCAATTG TTGCGACTGC CCAAATTGTT CCTGATGACA CTTTGGGTAA GGAGAGTTCG
GTGGTGGTTC CTGATAATAT TAAAGGTATT CCCAGTGAAC GTCTGGAAGG AGGGGCAATT
CGTGATGGTA ATCTGTTCCA CAGTTTCGGA GAATTTAATG TTGGGGAGGG ACAAGGAGCT
TATTTTGCTA ATCCTGCTTT GATTGAGAAT ATATTCACCA GGGTGACTGG GACTAATGTG
TCTAACCTTC TGGGAACTTT GGGAGTGTTG GGTAATGCTA ACCTATTTCT GATTAATCCC
AACGGCGTTA TTTTTGGTCC TAATGCTAGC TTAGATTTGC AGGGGTCTTT TGTAGTTAGT
AGCGCTGAGA GTTTTGTTTT CAACAATTTT GAGTTTAGTG CCTCTAATCC CCAAGCACCA
CCTTTATTGA CGATAAATAT TCCCCTGGGT TTGCGGTTTC AAGAGAATCC GGGACCGATA
AGTGCTAGGG CTACTAGTGA GCTTAAGGTA GGACCAAGGC AAACTTTGGG GCTGTTGGGG
GGTGAAGTTA GGTTGGATGG TACTGTCATT GAATCTCCGG GAACTCAAGT AGAGTTGGGG
AGTCTCGCTG TTCCTGGGGT AGTGGAACTA AATGAGGATT TAAGTTTCAT TTTTCCTGAA
GGGGCGGCGC GAGGGGATAT ATTTTTAGGG AATGAGGCGG TAATTAATGT CCGTAGTGGA
GGGGGTGGAA GTATAACTAT CAAGGCCCAG AATGTAGATG TTGTGGGCGG AAGTGAACTT
CTGGCAGGAA TAAGTGAAGG GTTAGGCTCC CCTGAAGCTC AGGGAGGTGA TATTGAGATT
AATGCTACTG GTAGAATTAA TTTAGCTGAT AACAGCTTGA TTGATAATGA GGTCGAAGAA
AATGCTGTGG GCAACAGTGG TAATATTAAA ATTAAGACTG GTGCACTTTC AGTTAATAAT
GGCTCAATTA TAGGTACTAC CATCTCCGGA GAAGGAACTG CTGGCAGTGT GACTATTGAA
GCGAGTGACG CTGTAGTCTT GGATGGAGTA AGAGACTTAG ACATTTTAAC TGGAGTAAGT
AGCAGTGTGG AAGAAGGAGC AGAAGGGAAT GGAGGGGTTA TCACTATTAC CACCGGCTCT
CTGGAAGTTA AAAATGGTGC TGAGATACTT TCCATTAACT GGGGTAAAGG AAATGCAGGG
ACTGTGAATA TTAATGCTAC TGAGACTGTC GTCTTGGATG GAGGAAACTC AGAATTTTCC
ACTGGAATAG CTACCAGTCT GGAAGAAGGA GCAGAAGGGA AAGCAGGGGT TGTCACTATT
ACCACAGGCT CTCTGGAAGT TAAAAATGGT GCTATAATAG ATTCCCTTAC TTTGGGTAAA
GGAGACGCAG GGGCTGTGAA TATTAATGCT ACTGAGACTG TCGTCTTGGA TGGAGAAAAC
TCAGAATTTG CTACTGGAGT AAATAGCGAT GCGGATCCAG GATCAGAAGG GAATGCAGGG
GGTGTTACTA TTACCACAGG CTCTCTGGAA GTTAAAAATG GTGCTGTGAT AGGTGCCGTT
ACCTTTAGTA AAGGAGATGC AGGGACTGTG AATATTAATG CTACTGAGAC TGTCGTCTTG
GATGGAGGAG ACTCAGAAAT TTTAACTGGA GTAAATAGCA GTGTAGCACC AGGAGCAGAA
GGGAATGCAG GGGGTGTTAC TATTACCACC GGCTCTCTGG AAGTTAAAAA TGGTGCTGTG
ATAGGTGCCG TTACCTTTAG TAAAGGAGAT GCAGGGACTG TGAATATTAA TGCTACTGAG
ACTGTCGTCT TGGATAGAGG AGACTCAGAA TTTTTAACTG TAGTAGCTAG CAGTGTAGCA
CCAGGAGCAG AAGGGAAGGC AGGGGGTGTT ACTATTACCA CCGGCTCTCT GGAAGTTAAA
AATGGTGCAG AGATAGAAGC CAGTACCTTT AGTAAAGGAG ATGCAGGGAC TGTGAATATT
AACGCTACTG AGACTGTAGT CTTGGATGGA GAAAACTCAG AAATTTTGAC TGGAGTAACT
AGCAGTGTGG AAGAAGGAGC AGAAGGGAAG GCAGGGGGTG TTACGGTTAC CACCGGCTCT
CTGGAAGTTA AAAATGGTGC TCAGCTACTT GCCGTTACCT TTGGTAAAGG AGATGCAGGG
ACTGTGAATA TTAATGCTAC TGAGACTGTA ATCTTGGATG GAGGAGACTC AGAAATTTTA
ACTGTAGTGG CTAGCGGTGT AGCACAAGGA GCAGAAGGGA ATGCAGGGGG TGTTACTATT
ACTACCGGCT CTCTGGAAGT TAAAAATGGT GCTGGGATAG CTGCCGATAC TCTTCCTGAG
TCCACAGGGA ATGGTAGCGA CATCTCCATT GACGCCACCA AAGTTACTAT CACCAATAGC
TCTGAAATAG TTGTAAGCAG CGCAGGACAG GGCAACAGTG GTAATATATT CCTCAGTGCT
GGGGACCTCT TCCTTGATCG TGGTTCTAAA GTCTCTGCCA TTACTGCTAG TGGCCAAGGA
GGTAACATGA CTTTCAATAT TGCCAATCTC TTCAGCTTGC GAAACAACAG TCCCATCTCA
ACTACGGCAG GAGGGACAGG TAATGGTGGA AACATCAACC TATCGACTGA ATTTTTCCTC
GCCAGAGACG ACAGCGACAT CACTGCCAAT GCTTTTGAAG GCAACGGTGG CAATATATCG
ATTGCAACTC AAGGTATTTT TAGCTTCCCT AATAGCACCA TTGATGCTAG CTCACAACTA
GGAATTGATG GAGTTATCGA AATCAATACC CCTGATATTG ATCCTAGTCA GGGATTAATT
AAGTTGTCAG AAAACGTGGT AGATCCTGAC CAATTGATTG CCCAAAATCC ATGCCTACAG
GGAGAAGAAA GCGAATATAT CATCACGGGG CGAGGGGGTT TGTCTCCCAG TCCTGCTCAA
ACCCTAAACA GCGACCCCTT AGAAGTGGGA TTAGTGGAAG CAGCCACAGG GACGGCAGCC
ACTGTTACCC CACCGCCACC ACCGACTTCT ACTGCCCAAA TTGTTCCCGC CAAAGGGTGG
AAACTTAATG AAAAAGGTGA GGTGATCTTA GTGAGTTACG ACCCCACTAA TACCCAGGTT
CAAAAGCACA GAGCTAATCC TGCTACCTGT CAGCCTCGCT GA
 
Protein sequence
MDKFRCQPKL SAFVALAFSL SPIVATAQIV PDDTLGKESS VVVPDNIKGI PSERLEGGAI 
RDGNLFHSFG EFNVGEGQGA YFANPALIEN IFTRVTGTNV SNLLGTLGVL GNANLFLINP
NGVIFGPNAS LDLQGSFVVS SAESFVFNNF EFSASNPQAP PLLTINIPLG LRFQENPGPI
SARATSELKV GPRQTLGLLG GEVRLDGTVI ESPGTQVELG SLAVPGVVEL NEDLSFIFPE
GAARGDIFLG NEAVINVRSG GGGSITIKAQ NVDVVGGSEL LAGISEGLGS PEAQGGDIEI
NATGRINLAD NSLIDNEVEE NAVGNSGNIK IKTGALSVNN GSIIGTTISG EGTAGSVTIE
ASDAVVLDGV RDLDILTGVS SSVEEGAEGN GGVITITTGS LEVKNGAEIL SINWGKGNAG
TVNINATETV VLDGGNSEFS TGIATSLEEG AEGKAGVVTI TTGSLEVKNG AIIDSLTLGK
GDAGAVNINA TETVVLDGEN SEFATGVNSD ADPGSEGNAG GVTITTGSLE VKNGAVIGAV
TFSKGDAGTV NINATETVVL DGGDSEILTG VNSSVAPGAE GNAGGVTITT GSLEVKNGAV
IGAVTFSKGD AGTVNINATE TVVLDRGDSE FLTVVASSVA PGAEGKAGGV TITTGSLEVK
NGAEIEASTF SKGDAGTVNI NATETVVLDG ENSEILTGVT SSVEEGAEGK AGGVTVTTGS
LEVKNGAQLL AVTFGKGDAG TVNINATETV ILDGGDSEIL TVVASGVAQG AEGNAGGVTI
TTGSLEVKNG AGIAADTLPE STGNGSDISI DATKVTITNS SEIVVSSAGQ GNSGNIFLSA
GDLFLDRGSK VSAITASGQG GNMTFNIANL FSLRNNSPIS TTAGGTGNGG NINLSTEFFL
ARDDSDITAN AFEGNGGNIS IATQGIFSFP NSTIDASSQL GIDGVIEINT PDIDPSQGLI
KLSENVVDPD QLIAQNPCLQ GEESEYIITG RGGLSPSPAQ TLNSDPLEVG LVEAATGTAA
TVTPPPPPTS TAQIVPAKGW KLNEKGEVIL VSYDPTNTQV QKHRANPATC QPR