Gene Tery_2109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2109 
Symbol 
ID4243945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3291907 
End bp3294882 
Gene Length2976 bp 
Protein Length991 aa 
Translation table11 
GC content39% 
IMG OID638107217 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_721818 
Protein GI113475757 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II
[COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.911479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.633615 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATTTG ATACTTTAAT AGACCTCCTA CACCATAGAA CCCTAGATCA ACCTAAGCAA 
AAAACCTATA CTTTTCTCAA AGATGGTGAA ACAGAAGCTG ATAGTCTTAC CTATCAAATA
TTAGAACAAC ACGCTAAGGC CATAGCAGCA AATCTTCAGT CCCTAAATGC TAAAGGTGAA
AGAGTCTTGC TGTTGTACCC CCCTGGACTA AAATTGATGG CTGGATTTTT TGGCTGTTTG
TATGGGGGGG CGATCGCTAT ACCAACCTAT CCACCACGCC CTGATCAGTC CCTCTCAAAA
TTAGAAGCAA TTGCAGCAGA TGCTCAAGCA AAGTTGATTC TTACCACTAC ACCTCTATTA
CCTTACTTAA AAGGTCGCTT TGCGGAAAAT CCCATGTTAG CAACAATACA GCTGTTAGAT
ACCGATAATA TTATTGCTCA AAATCTGGAG TCCCATTGGC AAGAACCTAA TATCAACGGT
GATACATTGG CTTTTCTTCA ATACACTTCC GGTTCAACTG GAAAACCTAA AGGTGTAATG
ATCACCCACA AAAATATTCT GCACAATTTA GCTATGGGCT ATGAACAATC TGATATTACA
CCTGAAAGTA TTACAGTGAC TTGGTTACCC TTTAGTCACA ATACAGGGTT ATTAGTGGGA
GTTCTACAAC CTCTCTACGG TAATTTCCCT GTTAAAATTA TGTCGCCATT GGATTTTCTG
CAAAAACCTT TCCGTTGGCT AATGGCTATG TCTCATTACA AAGCTACTCA AAGCCTAGCT
CCTAACTTTG CTTATGACCT AGTTTGTTTT CAAACAACCC CTGAAGAACG GGCAATGCTT
GACCTAAGTA ATTGGGAGTT AGCTCTTAGC GGTGCTGAAC CAATTCGTGC GGAGACATTT
GAGCGGTTTA TCAAGACTTT TAAACCCTAT GGCTTTCGCC CAGAGGCTTT AACTGCAGGC
TATGGGATGG CAGAGAGTGT TGTCGGTATT ACTTTAGGCT TAATAACAGA ACCCCCAGTT
ATTCTGAATG TTGACAAGGC TGAATTTACT AAAAATCGGG TTTTGGTAAC AGTTGACGAG
AATGATAGTA CCCAAAAAAT TGTTAGTTGT GGTCGCGCTA GTTCAGGTGA AAAAATTCTC
ATTGTTAACC CGGAAACTTT AACTGAATGT GCAGATGACC AAGTGGGAGA GATTTGGGTT
TCTAGTCCTA GTGTTGCTCA AGGTTATTGG AGTAGACCCC AAGCAACAGC AGAAACTTTT
CAAAATTACT TAAAAGATAC ACAGGAGGGT CCGTTTCTCA GGACTGGTGA CTTAGGATTT
TTGCTCAATG ATGAATTATT TGTCACTGGT CGTCTCAAAG ATTTAATTAT TATCCGAGGT
AGTAACCATT ATCCCCAGGA TATTGAGTTA ACTGTAGACA GAAGTCATCA AGCTTTACGA
CCTAGTTGTG GTGCAGCATT TTCCGTAGAG TTGGAAAGTG AGGAAAGGTT GGTCATTGTT
CAAGAAGTTC AGGAAAGTTA CTTAGATAAA CTGGATGTAG ATGAGGTTGT TAACGCTATC
CGTCAAGCTG TATCTCAACA GCATCAGTTA CAAGTTTATG GGATATTATT ATTGAAAACA
GGAACTATTC CTAAAACTTC TAGCAACAAG ATTCAGCGTC ATGCTTGCAA AGTAGGTTTT
TTAGAGCAAA GTCTTGATGT TGTTGGCAGT AGTATTTACC AAGAATTTGA TCTGTTGGAA
AAGGAAAAAG AAGCTTTCTT AGATAGAAAA ACACTCTTAG CTACCACACT TGAAAAACGT
CAAGAATTAC TAATATCTCA TCTTCAGATT TTAATCTCAA ATATTCTCCA GGTAAATAAA
TCTCAATTAG ACTGGCAACA ACCTTTAACT AGTATGGGAT TAGATTCTCT GACAGTTGTT
GAGTTGAGCG ATCTTCTCCA AGATAGCTTA GGAACTTCTT TTCCGGCAAC ACTAATTTTT
GAATATCCCA CAGTTGAAGC TCTAGCTAAT TATTTGGCAA AAGAAGTGCT TTCCCCACAA
CCTTCTGCAA ATTATGATAT AGGGTTAGTT GCAGATGGAA ACTTAGGTTC GGGTTTTGCT
ACTCCTGTAA TAGCAATTCA AGCAAAAGGT TCTAAGCCTC CTTTTTTCTG TATCCCTGGG
GGTGTGGGAA CTGCGTTTTA TCTCCATTCT CTAGCCTTTC ATCTCGGTCA AGACCAACCA
TTTTACGGAC TACAAGCACG GGGAATGGAT GATCAAGCAG AACCTCAAAC AAATGTTGAG
GCGATCGCTG CCGACTATAT TGACGCATTA AAAAAGATTC AGCCCACAGG TCCTTATTTT
TTGGGTGGTC ATTCTTTTGG TGGGCAAGTA GCTTTTGAAA TATCCCAACA GTTACAAAAA
CAGGGAGATG AAATTGGTTT ACTTGCTGTT TTTGATATTC CTGCCCCTTT ATTTAGTAGT
TTTATAATTG CGGGTTGGGA TGATACTAAA TATATGAGTG AGGTCGTGAG GTTATTTGAA
TACTTTTTAA AGGAAAATTT AGAGATATCC TATAATGATC TGAAAGTTTT CTCTCCTGAT
GAGCAATTAA ATTATGTAAC TGAGCAACTA GTGAAAAAGC TTCATGTACG TTCTTCTCAG
GCAGCTAAAA GACAAGTGCA TAGTTTTGTC AAAGTTTTAA AAGCTAGTGT TTATGCGATG
GGTCATTATC ACCCAGAGAA ACTTTATCCG ACTCCTATTG CTCTTTTCCG TAGTAGTGAT
GTTCGTACTT GGAATAGCGC CATTGGTTTG ACTGATACTA TTCGCAACCA ACTGCAAAAG
AATTCTTTTT TGGGTTGGGA GCAACTTTCT CAACGTTCTG TAGAACTTGA GACTGTTCCT
GGGGATCATA TTACCATGAT GCTTGAACCC CATGTTCAGG TTTTAGCTTC CAGACTCAAA
ACTTATATTG ATATGCCATC AAAATTAACT ATTTAG
 
Protein sequence
MQFDTLIDLL HHRTLDQPKQ KTYTFLKDGE TEADSLTYQI LEQHAKAIAA NLQSLNAKGE 
RVLLLYPPGL KLMAGFFGCL YGGAIAIPTY PPRPDQSLSK LEAIAADAQA KLILTTTPLL
PYLKGRFAEN PMLATIQLLD TDNIIAQNLE SHWQEPNING DTLAFLQYTS GSTGKPKGVM
ITHKNILHNL AMGYEQSDIT PESITVTWLP FSHNTGLLVG VLQPLYGNFP VKIMSPLDFL
QKPFRWLMAM SHYKATQSLA PNFAYDLVCF QTTPEERAML DLSNWELALS GAEPIRAETF
ERFIKTFKPY GFRPEALTAG YGMAESVVGI TLGLITEPPV ILNVDKAEFT KNRVLVTVDE
NDSTQKIVSC GRASSGEKIL IVNPETLTEC ADDQVGEIWV SSPSVAQGYW SRPQATAETF
QNYLKDTQEG PFLRTGDLGF LLNDELFVTG RLKDLIIIRG SNHYPQDIEL TVDRSHQALR
PSCGAAFSVE LESEERLVIV QEVQESYLDK LDVDEVVNAI RQAVSQQHQL QVYGILLLKT
GTIPKTSSNK IQRHACKVGF LEQSLDVVGS SIYQEFDLLE KEKEAFLDRK TLLATTLEKR
QELLISHLQI LISNILQVNK SQLDWQQPLT SMGLDSLTVV ELSDLLQDSL GTSFPATLIF
EYPTVEALAN YLAKEVLSPQ PSANYDIGLV ADGNLGSGFA TPVIAIQAKG SKPPFFCIPG
GVGTAFYLHS LAFHLGQDQP FYGLQARGMD DQAEPQTNVE AIAADYIDAL KKIQPTGPYF
LGGHSFGGQV AFEISQQLQK QGDEIGLLAV FDIPAPLFSS FIIAGWDDTK YMSEVVRLFE
YFLKENLEIS YNDLKVFSPD EQLNYVTEQL VKKLHVRSSQ AAKRQVHSFV KVLKASVYAM
GHYHPEKLYP TPIALFRSSD VRTWNSAIGL TDTIRNQLQK NSFLGWEQLS QRSVELETVP
GDHITMMLEP HVQVLASRLK TYIDMPSKLT I