Gene Tery_2355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2355 
Symbol 
ID4245003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3640631 
End bp3642940 
Gene Length2310 bp 
Protein Length769 aa 
Translation table11 
GC content43% 
IMG OID638107448 
Producthypothetical protein 
Protein accessionYP_722048 
Protein GI113475987 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00702] uncharacterized domain
[TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.247728 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAATC CACAAATCAA ACCCCACTTT CACATCGAAA TTATCGAACC CAAGCACGTT 
TACCTCCTAG GAGAAAATTC CACCGACGCC CTCACCGGAG AATTCTACTG TCAACTCATC
CCCCTATTAG ATGGTCAACA TAGCTTTGAG GAAATCTGTG AAACTTTAGC AGAAATAGCA
TACCCAGAAG ACGTAGAATA TGTTCTTAAC CGCCTCAAAA CTAAAGGTTA TTTAACAGAA
GCAACTCCCG AACTACCCCC ACCTGCCGCT GCATTTTGGT CATTACTGAA TGTCGAACCC
CAATTCGCCT CGGACTGTCT CCAAAAAAAT ACGGTTTATA TCGCCTCAGT GGGAGAAACA
ACCACAGATT TATTAATCGA AAGTTTAACA GCAGTGGGTA TCAAAACTAA ACCTTGGGAA
AACTACCCAC TCACATCAGA CCGCAAGGCA CTATTAGTTG TTCTCACCGA CGACTACCTC
CAACCACAAT TAAAAGAGAT TAATAAAACC GCCCTCTCAG CTAGCCAACC TTGGTTGTTA
GCAAAACCAG TAGGGGGAGT CCTGTGGTTG GGTCCCATAT TTGAACCAGA AAATACAGGA
TGTTGGGAAT GCTTAAGTCA ACGTTTACGG GGACATCGGG AAGTGGAAGC CACAGTTTTG
CGACAAAAAG AAAAATCTGT TGTCTCCAGT GCTAGCAAAA GAGATGTTAC TGGGTGCTTG
CCTCCTCCCC CTGTTTTCCT TCCTTCAACT TTGGCAACGG GAATTAATCT TATAACTACG
GAAGTTGCGA AATGGATAGT CAAAGAAAGT GGGGTAGAAA TGCCAAAGTT TACTACTCTC
GCTGGAAAAG TGGTTACTTT CAACCAAACC GACTTCTCCA CAACAACTCA CATTTTGAGT
AAACGCCCCC AATGTGAAGC TTGTGGAGAT TCCAAGTTAC TCCAAGAAAT GTCATCTTAT
CCTCTCAGTC TCAGCAGTCG CAAAAAGCAT TTTACCACAG ATGGCGGTCA CCGGGCATTC
ACTCCCGACC AAACAACCAA ACGCTACAAA AAATTGATTA GTCCCATTAC AGGAGTTGTC
AGTGCTTTGG TGCGAGTTTC TGACCCCGAA AATCCTTTAA TTCATACTTA CAGTGCTATC
CATAGTTTTG GTGCTGCGAA AAGTCTCGGT GCTTTGCGTC GTTCCCTGCG TCATAAAAGT
GGAGGTAAAG GTAAGAGCGA TCGCCAATCT AAAGCCAGTG GTTTTTGTGA AGCGATCGAA
CGTTATTCGG GAATATTCCA GGGAGACGAA CCTCGGAAAA AGTCAACTTT AGCCAAATTG
GGAAAGCAAG GTATTCATCC AGAAAGATGT CTACATTTTA GTCCGACTCA GTATGCTAAC
CGGGAGGAAT TGAATGCTAA TTCTAAAGTG GCTCATGATT GGATCCCCCA ACCTTTTGAT
GCCAGTCAAG AAATTGAATG GACTCCAGTG TGGTCACTGA CGGAGGAGAC CCACAAATAT
TTGCCTACTG CTTGGTGTTA TTATGCTTAT AAATTGCCCA AAAAGCATAA TTTTTGTGTA
GCTGATTCCA ATGGCAATGC TGCGGGCAAT ACCATAGAAG AAGCAATTTT GCAAGGATTT
ATGGAATTAG TTGAGCGCGA CAGTGTGGCA ATTTGGTGGT ATAATTCTTT GCAGCGTCCG
GGAGTAGATT TAGCAAGTTT TGACGACCCC TATTTATTGG AAGTGCAAGA ATTTTATGAA
CAAAATCAGC GGGAGTTATG GGTATTAGAT TTAACTACAG ATTTAGGGAT ACCTGCTTTT
ACTGCCGTAT CTCGTCGAGT TAATGAAGAA TATGAGCGAG TTATTACTGG TTTTGGAGCG
CATTTTGACC CGAAAATTGC CATTCTGCGA GCAGTAACGG AAGTAAATCA AATTGGGTTG
GGAATGGATC AGCAAGATAT TAGTCAGATG GAGGAAGGGC TCCAGAGATG GATGACCACA
GCAACTTTAG AAAGTCATCC TTATTTAGCT CCCCATCCAG AAATACCTGC TAAGGTTTAT
GGAGATTATC CTAAGCTTTG GAGCGATGAT ATTTATGATG ATGTTTTGAC TTGTGTGAAA
ATTGCTCAAG AGGCAGGAAT GGAAACTTTG GTATTAGATC AGACTCGTCC GGATATTGAG
TTAAAAGTAG TTAAGGTTAT TGTGCCTGGG TTGCGTCATT TTTGGTCGCG CTTTGGCAAG
GGGAGGTTAT ATGATGTTCC GGTGAAAATG GGTTTGTTGT CTGCAGCGTT GCAAGAAGAG
GAGATGAATT CAATGCCGAT GGTGTTTTAA
 
Protein sequence
MSNPQIKPHF HIEIIEPKHV YLLGENSTDA LTGEFYCQLI PLLDGQHSFE EICETLAEIA 
YPEDVEYVLN RLKTKGYLTE ATPELPPPAA AFWSLLNVEP QFASDCLQKN TVYIASVGET
TTDLLIESLT AVGIKTKPWE NYPLTSDRKA LLVVLTDDYL QPQLKEINKT ALSASQPWLL
AKPVGGVLWL GPIFEPENTG CWECLSQRLR GHREVEATVL RQKEKSVVSS ASKRDVTGCL
PPPPVFLPST LATGINLITT EVAKWIVKES GVEMPKFTTL AGKVVTFNQT DFSTTTHILS
KRPQCEACGD SKLLQEMSSY PLSLSSRKKH FTTDGGHRAF TPDQTTKRYK KLISPITGVV
SALVRVSDPE NPLIHTYSAI HSFGAAKSLG ALRRSLRHKS GGKGKSDRQS KASGFCEAIE
RYSGIFQGDE PRKKSTLAKL GKQGIHPERC LHFSPTQYAN REELNANSKV AHDWIPQPFD
ASQEIEWTPV WSLTEETHKY LPTAWCYYAY KLPKKHNFCV ADSNGNAAGN TIEEAILQGF
MELVERDSVA IWWYNSLQRP GVDLASFDDP YLLEVQEFYE QNQRELWVLD LTTDLGIPAF
TAVSRRVNEE YERVITGFGA HFDPKIAILR AVTEVNQIGL GMDQQDISQM EEGLQRWMTT
ATLESHPYLA PHPEIPAKVY GDYPKLWSDD IYDDVLTCVK IAQEAGMETL VLDQTRPDIE
LKVVKVIVPG LRHFWSRFGK GRLYDVPVKM GLLSAALQEE EMNSMPMVF