Gene Moth_1261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1261 
Symbol 
ID3833056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1304850 
End bp1306490 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content44% 
IMG OID637829197 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_430118 
Protein GI83590109 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0532703 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000308481 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGGTTT CGGGTTATGG TAAATGGTTC GAGTTGGCTG AAAAGGATTT AAAAACACGC 
AAGTTCAATG GAGTTGAGTA CCGGTATTAC GACCACGGTA CGACTAATTT GTGGGAAGAT
TTTTCCCGTT CTGTCAGCAG GCAACCTGAT AAAACGGCGC TACGTGCAGG AAATAGTTCT
TTAAGTTATA GGGAAATGCA GGAAGCTTCA CGGCGACTGG CATCCGGCTT GTGGAATAAG
TATCAGGTTA AAAAAGGTGA TGTGGTTGCC CTCTTGCTGG TGAATAGTAT CGACTTCTGC
CTTAGCTTTT ATGCAGCAAT GTATCTGGGA GCCATAGCTT TACCCCTGAG TACCAAACTT
AAAGCTACTG AACTTAATTT TATGCTCAAG GATTCGGGGG CTAGGATCTT AATAACTAAC
CCGGAGTGGC TACCCAACGT CTTGCCTTTT ATCAAAGAAA CAAGTATTGA ACAAATAATT
GTTACCGAAC CGATTACCGA TAAAATTAAT ATCAATTTTG GTAACGCTTC CATAATAACC
TTAAAGAATG TTTTTCGCGA AACGGAAATT CCACCGGCGC CTGTCGACGA ACAAGATGGC
GCGGTAATCA TGTATACCTC GGGAACTACC GGTAAACCCA AAGGCGCTTA TCTTACCCAT
TTTAATCTCC TCCAAAGTGT TATCAGCTAT GAGCGCACCC TGCAGTTAAC GGCAGCAGAT
AGTACCCTTA TTGCAGTTCC AATTTTCCAT ATAACAGGTT TAGCTGCTCT CTTTTTGCTT
TTCATGCATA TTGGCGGTAC AGTATATCTG TTACCCTTTT TCAACACCCA AGAAGTCCTC
AATATTTTAA CATGTTATTC TATTACTTTC TTCCATGCCG CTCCCACAGT CTATATCATG
CTCCTCGAAC AAGGTTACAG GCATTATCAA TTACCTGATT TACGTAAGGC AGCCTGTGGC
GGGGGGGCAA TCCCGATAGA AACGATAAAA AAAATTAAGA CATGGATACC CCAACTGGAG
TTTCATACTG TTTACGGCCT AACGGAAACC AGTTCCCCGG CAACCTTATT CCCGGGTGAC
GTAGCCACAA GTCCAAGGAT AGGCACTTCC GGGATACCAA TTCCAGTAGT CGATTGTAAA
GTAATTGACG CTGAAGGGCG GGATATTACC GGTAAAGGGG TTGGCGAGCT TTGTATCCGG
GGACCCGTTG TGACCCAACA ATACTGGAAT AATGATGAAG CTACCACCAG GGCTTTTCAA
GGAGGGTGGT TCAGGACAGG GGATGTAGCC CGGATAGATG GGGATGGTTA TGTTTATATC
ATGGATAGGT TAAAGGACAT GATTAATCGC GGCGGTGAAA AAATTTATTC CCTGGAAGTT
GAAAATGTCA TCTATTCCCA CCCGGGTGTA AAAGAAGTTG CGGTAATTGG TTCCGTGGAT
CCTATTTACG GGGAAGTAGC CAGGGCGGTA GTTGTTCCCA ATAATCATGG TAGTAGCATT
ACAGGGAGAG AGATTCAAGA CTGGGTAAGG GCGAGACTAG CTAAATATAA AGTACCGCAA
TATGTCAATT TTGTTAACGA GTTGCCGAAG AATGCCAATG GCAAAATTGA TAAAAAGCTT
CTCCGGCAAC AGTTTCAATA A
 
Protein sequence
MAVSGYGKWF ELAEKDLKTR KFNGVEYRYY DHGTTNLWED FSRSVSRQPD KTALRAGNSS 
LSYREMQEAS RRLASGLWNK YQVKKGDVVA LLLVNSIDFC LSFYAAMYLG AIALPLSTKL
KATELNFMLK DSGARILITN PEWLPNVLPF IKETSIEQII VTEPITDKIN INFGNASIIT
LKNVFRETEI PPAPVDEQDG AVIMYTSGTT GKPKGAYLTH FNLLQSVISY ERTLQLTAAD
STLIAVPIFH ITGLAALFLL FMHIGGTVYL LPFFNTQEVL NILTCYSITF FHAAPTVYIM
LLEQGYRHYQ LPDLRKAACG GGAIPIETIK KIKTWIPQLE FHTVYGLTET SSPATLFPGD
VATSPRIGTS GIPIPVVDCK VIDAEGRDIT GKGVGELCIR GPVVTQQYWN NDEATTRAFQ
GGWFRTGDVA RIDGDGYVYI MDRLKDMINR GGEKIYSLEV ENVIYSHPGV KEVAVIGSVD
PIYGEVARAV VVPNNHGSSI TGREIQDWVR ARLAKYKVPQ YVNFVNELPK NANGKIDKKL
LRQQFQ