Gene Moth_1980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1980 
Symbol 
ID3831162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2063304 
End bp2064902 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content57% 
IMG OID637829911 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_430821 
Protein GI83590812 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0106234 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTTCG CCTCGGTAAA AGAGCTGGTA ATAACCAGGG GCACCCAGTA CCGCGGGAGG 
ATATTTCTTT CCTCACCGGA AGATGGCGTG GATTTAACTT ATGACGCCTA CCTGCTGGCC
GTCAGGAGAC TGGAAAAAGC ACTGCTGGCG TTAGGTATGC GCAAGGGAGA AAGGGTGGCC
CTTCTCATGG CGAACGGCCT GAATTACGCC GTCACCTTTA CCGGGGTGAT GGCCTCCGGA
GGCGTAGTCG TACCCATCAA CCCGCATTTA AAACCGGCAG AGGTGACCCG GCTCCTGGGA
GATGCCGGGA CCAGCCTGGT TTTAACTGAC GACGGATGGT ACAGAGTATT TTACCCCCTC
CTGAAGGGGT TACCTGTTCG CCGCTTGGAC CTGGGGGTGC AGGGCGGCAG GTTGCTGGCC
CTGGAGCTGG CATCCGGGAG TAAGGGGGAT GACAGAGCAG TTGAGGCGTC CCCTCTTGGC
AGGAACGACT TAGCCCTCCT CCTGTACACC TCCGGTACTA CCGGAAAGCC TAAAGGGGTG
ATGCTAACCC ACGGTAATTT GCTGGCCGAG GCGAGGTATA TCCAGAAAGG ACACCGGTTA
ACGCCGGAAG ATACTGCCCT GTGTATCCTG CCCCTGTATC ATATAAATGG GGAAGTTGTG
ACCCTGATCA CCCCCATCTT TTCCGGCGGG CGGGTAGTGA TGCCCCATAA ATTCAGGGCC
AGCAGGTTCT GGGACTGGGT CCGGAACTAC CGGGTTACAT GGTTCAGCGC CGTTCCCACC
ATCCTGTCCA TCCTTCTTTC CCATCCTCTG CCGGATAGAT CGGCCCTCTC TTCTTTGCGT
TTTGCGCGCT CCGCCTCGGC ACCTTTACCG GTAGCCGTCC TGCGGGAATT TGAAGCCCGG
TTCGCCGTCC CTGTTATCGA GGCTTACGGC CTGTCGGAAA CCGCCAGCCA GGTAACCACC
AATCCCCTGC CCCCGGCGGT GAGAAAGCCG GGTTCCGTGG GGCTGCCTGT TGGCAATCAG
GTACGGGTGG TGAACGAAAA TGGAGAGACC GTACCTGCCG GTGTCACCGG CGAAGTCGTA
GTTCGCGGGG AAAATGTCTG CCGGGGTTAC TTTCATAATG AAGAGGCTAC TGCCGCTTCT
TTCAAAGGAG GCTGGTTTTA TACCGGCGAC CTTGGCTACC TTGATGCCGA TGGGTACCTG
TTCCTTACCG GACGGCGCAA AGAACTTATC AACCGGGGTG GGGAGAAGTT TTCTCCCCGG
GAGATCGACG AGATCTTATA CCGTTTACCC GAAGTAGAAT TAGCGGCAGC AGTAGGTGTC
CCCGATCCCC TCTACGGTGA AGAGGTGGTA GCCTTCATCC AACTGCGCCC GGGAAAAAGC
CTGGCGGAAG ATCGGGTAAT ATCCTTCTTA AGAGATTACC TGGCGGATTT TAAGGTCCCC
CGGGAGGTCA TCTTTATCCG GGATTTTCCC CGGGGGCCGA GCGGAAAGAT TCAGCGCCTG
AAGCTGGTGG ACCTGTATCT TAAAAAATTC CAGGGAGCCG CCCATGGGGC TGGGGCTGGC
ACCCGCCCCA TAAATGGTGA GGAGGTTGCT AAAAGATGA
 
Protein sequence
MEFASVKELV ITRGTQYRGR IFLSSPEDGV DLTYDAYLLA VRRLEKALLA LGMRKGERVA 
LLMANGLNYA VTFTGVMASG GVVVPINPHL KPAEVTRLLG DAGTSLVLTD DGWYRVFYPL
LKGLPVRRLD LGVQGGRLLA LELASGSKGD DRAVEASPLG RNDLALLLYT SGTTGKPKGV
MLTHGNLLAE ARYIQKGHRL TPEDTALCIL PLYHINGEVV TLITPIFSGG RVVMPHKFRA
SRFWDWVRNY RVTWFSAVPT ILSILLSHPL PDRSALSSLR FARSASAPLP VAVLREFEAR
FAVPVIEAYG LSETASQVTT NPLPPAVRKP GSVGLPVGNQ VRVVNENGET VPAGVTGEVV
VRGENVCRGY FHNEEATAAS FKGGWFYTGD LGYLDADGYL FLTGRRKELI NRGGEKFSPR
EIDEILYRLP EVELAAAVGV PDPLYGEEVV AFIQLRPGKS LAEDRVISFL RDYLADFKVP
REVIFIRDFP RGPSGKIQRL KLVDLYLKKF QGAAHGAGAG TRPINGEEVA KR