Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2109 |
Symbol | |
ID | 4243945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 3291907 |
End bp | 3294882 |
Gene Length | 2976 bp |
Protein Length | 991 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638107217 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_721818 |
Protein GI | 113475757 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II [COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.911479 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.633615 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATTTG ATACTTTAAT AGACCTCCTA CACCATAGAA CCCTAGATCA ACCTAAGCAA AAAACCTATA CTTTTCTCAA AGATGGTGAA ACAGAAGCTG ATAGTCTTAC CTATCAAATA TTAGAACAAC ACGCTAAGGC CATAGCAGCA AATCTTCAGT CCCTAAATGC TAAAGGTGAA AGAGTCTTGC TGTTGTACCC CCCTGGACTA AAATTGATGG CTGGATTTTT TGGCTGTTTG TATGGGGGGG CGATCGCTAT ACCAACCTAT CCACCACGCC CTGATCAGTC CCTCTCAAAA TTAGAAGCAA TTGCAGCAGA TGCTCAAGCA AAGTTGATTC TTACCACTAC ACCTCTATTA CCTTACTTAA AAGGTCGCTT TGCGGAAAAT CCCATGTTAG CAACAATACA GCTGTTAGAT ACCGATAATA TTATTGCTCA AAATCTGGAG TCCCATTGGC AAGAACCTAA TATCAACGGT GATACATTGG CTTTTCTTCA ATACACTTCC GGTTCAACTG GAAAACCTAA AGGTGTAATG ATCACCCACA AAAATATTCT GCACAATTTA GCTATGGGCT ATGAACAATC TGATATTACA CCTGAAAGTA TTACAGTGAC TTGGTTACCC TTTAGTCACA ATACAGGGTT ATTAGTGGGA GTTCTACAAC CTCTCTACGG TAATTTCCCT GTTAAAATTA TGTCGCCATT GGATTTTCTG CAAAAACCTT TCCGTTGGCT AATGGCTATG TCTCATTACA AAGCTACTCA AAGCCTAGCT CCTAACTTTG CTTATGACCT AGTTTGTTTT CAAACAACCC CTGAAGAACG GGCAATGCTT GACCTAAGTA ATTGGGAGTT AGCTCTTAGC GGTGCTGAAC CAATTCGTGC GGAGACATTT GAGCGGTTTA TCAAGACTTT TAAACCCTAT GGCTTTCGCC CAGAGGCTTT AACTGCAGGC TATGGGATGG CAGAGAGTGT TGTCGGTATT ACTTTAGGCT TAATAACAGA ACCCCCAGTT ATTCTGAATG TTGACAAGGC TGAATTTACT AAAAATCGGG TTTTGGTAAC AGTTGACGAG AATGATAGTA CCCAAAAAAT TGTTAGTTGT GGTCGCGCTA GTTCAGGTGA AAAAATTCTC ATTGTTAACC CGGAAACTTT AACTGAATGT GCAGATGACC AAGTGGGAGA GATTTGGGTT TCTAGTCCTA GTGTTGCTCA AGGTTATTGG AGTAGACCCC AAGCAACAGC AGAAACTTTT CAAAATTACT TAAAAGATAC ACAGGAGGGT CCGTTTCTCA GGACTGGTGA CTTAGGATTT TTGCTCAATG ATGAATTATT TGTCACTGGT CGTCTCAAAG ATTTAATTAT TATCCGAGGT AGTAACCATT ATCCCCAGGA TATTGAGTTA ACTGTAGACA GAAGTCATCA AGCTTTACGA CCTAGTTGTG GTGCAGCATT TTCCGTAGAG TTGGAAAGTG AGGAAAGGTT GGTCATTGTT CAAGAAGTTC AGGAAAGTTA CTTAGATAAA CTGGATGTAG ATGAGGTTGT TAACGCTATC CGTCAAGCTG TATCTCAACA GCATCAGTTA CAAGTTTATG GGATATTATT ATTGAAAACA GGAACTATTC CTAAAACTTC TAGCAACAAG ATTCAGCGTC ATGCTTGCAA AGTAGGTTTT TTAGAGCAAA GTCTTGATGT TGTTGGCAGT AGTATTTACC AAGAATTTGA TCTGTTGGAA AAGGAAAAAG AAGCTTTCTT AGATAGAAAA ACACTCTTAG CTACCACACT TGAAAAACGT CAAGAATTAC TAATATCTCA TCTTCAGATT TTAATCTCAA ATATTCTCCA GGTAAATAAA TCTCAATTAG ACTGGCAACA ACCTTTAACT AGTATGGGAT TAGATTCTCT GACAGTTGTT GAGTTGAGCG ATCTTCTCCA AGATAGCTTA GGAACTTCTT TTCCGGCAAC ACTAATTTTT GAATATCCCA CAGTTGAAGC TCTAGCTAAT TATTTGGCAA AAGAAGTGCT TTCCCCACAA CCTTCTGCAA ATTATGATAT AGGGTTAGTT GCAGATGGAA ACTTAGGTTC GGGTTTTGCT ACTCCTGTAA TAGCAATTCA AGCAAAAGGT TCTAAGCCTC CTTTTTTCTG TATCCCTGGG GGTGTGGGAA CTGCGTTTTA TCTCCATTCT CTAGCCTTTC ATCTCGGTCA AGACCAACCA TTTTACGGAC TACAAGCACG GGGAATGGAT GATCAAGCAG AACCTCAAAC AAATGTTGAG GCGATCGCTG CCGACTATAT TGACGCATTA AAAAAGATTC AGCCCACAGG TCCTTATTTT TTGGGTGGTC ATTCTTTTGG TGGGCAAGTA GCTTTTGAAA TATCCCAACA GTTACAAAAA CAGGGAGATG AAATTGGTTT ACTTGCTGTT TTTGATATTC CTGCCCCTTT ATTTAGTAGT TTTATAATTG CGGGTTGGGA TGATACTAAA TATATGAGTG AGGTCGTGAG GTTATTTGAA TACTTTTTAA AGGAAAATTT AGAGATATCC TATAATGATC TGAAAGTTTT CTCTCCTGAT GAGCAATTAA ATTATGTAAC TGAGCAACTA GTGAAAAAGC TTCATGTACG TTCTTCTCAG GCAGCTAAAA GACAAGTGCA TAGTTTTGTC AAAGTTTTAA AAGCTAGTGT TTATGCGATG GGTCATTATC ACCCAGAGAA ACTTTATCCG ACTCCTATTG CTCTTTTCCG TAGTAGTGAT GTTCGTACTT GGAATAGCGC CATTGGTTTG ACTGATACTA TTCGCAACCA ACTGCAAAAG AATTCTTTTT TGGGTTGGGA GCAACTTTCT CAACGTTCTG TAGAACTTGA GACTGTTCCT GGGGATCATA TTACCATGAT GCTTGAACCC CATGTTCAGG TTTTAGCTTC CAGACTCAAA ACTTATATTG ATATGCCATC AAAATTAACT ATTTAG
|
Protein sequence | MQFDTLIDLL HHRTLDQPKQ KTYTFLKDGE TEADSLTYQI LEQHAKAIAA NLQSLNAKGE RVLLLYPPGL KLMAGFFGCL YGGAIAIPTY PPRPDQSLSK LEAIAADAQA KLILTTTPLL PYLKGRFAEN PMLATIQLLD TDNIIAQNLE SHWQEPNING DTLAFLQYTS GSTGKPKGVM ITHKNILHNL AMGYEQSDIT PESITVTWLP FSHNTGLLVG VLQPLYGNFP VKIMSPLDFL QKPFRWLMAM SHYKATQSLA PNFAYDLVCF QTTPEERAML DLSNWELALS GAEPIRAETF ERFIKTFKPY GFRPEALTAG YGMAESVVGI TLGLITEPPV ILNVDKAEFT KNRVLVTVDE NDSTQKIVSC GRASSGEKIL IVNPETLTEC ADDQVGEIWV SSPSVAQGYW SRPQATAETF QNYLKDTQEG PFLRTGDLGF LLNDELFVTG RLKDLIIIRG SNHYPQDIEL TVDRSHQALR PSCGAAFSVE LESEERLVIV QEVQESYLDK LDVDEVVNAI RQAVSQQHQL QVYGILLLKT GTIPKTSSNK IQRHACKVGF LEQSLDVVGS SIYQEFDLLE KEKEAFLDRK TLLATTLEKR QELLISHLQI LISNILQVNK SQLDWQQPLT SMGLDSLTVV ELSDLLQDSL GTSFPATLIF EYPTVEALAN YLAKEVLSPQ PSANYDIGLV ADGNLGSGFA TPVIAIQAKG SKPPFFCIPG GVGTAFYLHS LAFHLGQDQP FYGLQARGMD DQAEPQTNVE AIAADYIDAL KKIQPTGPYF LGGHSFGGQV AFEISQQLQK QGDEIGLLAV FDIPAPLFSS FIIAGWDDTK YMSEVVRLFE YFLKENLEIS YNDLKVFSPD EQLNYVTEQL VKKLHVRSSQ AAKRQVHSFV KVLKASVYAM GHYHPEKLYP TPIALFRSSD VRTWNSAIGL TDTIRNQLQK NSFLGWEQLS QRSVELETVP GDHITMMLEP HVQVLASRLK TYIDMPSKLT I
|
| |