Gene Moth_0503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0503 
Symbol 
ID3831805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp521514 
End bp522992 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content58% 
IMG OID637828437 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_429376 
Protein GI83589367 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACTCT GGCAATTGCC ACAACAGCAC CGGGAGTTCG CTACTCACCC GGCCCTGATC 
TTCCAGGACC GGAGGGTTAC ATACGGCGAG CTGGTCGAAT GGATCGGAGC CTATGCCGGC
ATGTTCCAGG CCATGGGGGT TCAACCCGGG GAAAGGGTAA CCATCTGCGC TCCCAACTGC
CCGGAGTTTA TCTACAGTTA CCTGGGAGCT ATTCAGGCCG GAGCTATTGT TGTGCCTCTG
AACCTGATGC TCACCAGGGA TGAAATTGCC TATATTGTCA AAGATGCCGG CTGCAGCACC
CTGGTAATTC ACCGGGCAAT TGTAGAACGC CTGAACCTTG TGCCCCAGAT GGCCACCGCC
CTGGGGCTTA AGCACCTGGT TGTCCTGGAC GAGACTACGG CAGCGAGGGC CAAGGCAGCA
CCTCCAGCCA CCATGGTGGC GGCGAAGGAA GAAGACATCT GCGTTTTCCT CTACACCTCC
GGCACTACCG GCCGGCCCAA AGGGGCCATG CTCAGCCACC GGAATTTCTT GGCGGATATT
AAGGCCATGG ACGCCGTTTC CAACCTGGGC CCGGAGGATA ACTTCCTCTG CGTTCTGCCT
ATGTTCCACA GCTTTGCCTG GACAACCAGC GTCCTCCTGC CTTTATACCT GGGCAGCACC
ATCACTATCA AAGAGAGCTT CCAGCCTAAG GATACTTTAA AAACCTTATC CGAGGGGGAT
ATCACCGTCT TTTGCGGCGT GCCTTCCATA TATGCCGTCC TCTGGCGCCT GGCGGAGGAG
GGGCAATTTA AATCCCTTAA GTTTGCCATT TCCGGTGGGG CCCCCCTGGC GGCGGAGATC
CAGCGCGGCT TTGAAACTAA ATTTGCCTTC CCCCTGGTGG AGGGTTACGG CCTGTCGGAA
GCCGCACCTG TGGTCTGCCT GAATCCCCTG GACGGTGTCC GCAAACCGGG ATCCATCGGC
ATCCCCCTCC CAGGTATGGA GGTCAGGCTG GTAGACGACG ATGACCGGGA GGTACCCCGC
GGCGAAGTGG GCGAGCTGGT GGTTCGCGGG CCCAATGTCA TGGCCGGCTA TTACAACCAT
CCGGAGGAAA CGGCAGCTGC CCTGCGGGGT GGCTGGCTTC ATACCGGCGA CCTGGCCCGC
CAGGATGAGG ACGGCTATTT CTACATCGTC GACCGCAAGA AGGATCTGAT CATCCTGGGC
GGTTTCAATG TCTACCCCCG GGAAGTGGAG GAAGTCCTCC TGGCCCATCC GGCCGTTCTG
GAGGCGGCGG TAGTCGGCGT AGGCGACCCG GTTAAGGGCG AGACGGTTAA AGCCTATGTA
GTGCTGAAGG AAGGAGAGTC TGCCGACCGG CGCCAGCTGC AAGATTTTTT AAAGGAACAC
CTGGCCCTTT ACAAGATCCC GCGCCTTTTT GAGTTTGTAC CGGAACTCCC CAAGAGCCCA
ACTGGCAAGG TGATGAAGAA ACTGTTGAAA ACCCGTTAA
 
Protein sequence
MLLWQLPQQH REFATHPALI FQDRRVTYGE LVEWIGAYAG MFQAMGVQPG ERVTICAPNC 
PEFIYSYLGA IQAGAIVVPL NLMLTRDEIA YIVKDAGCST LVIHRAIVER LNLVPQMATA
LGLKHLVVLD ETTAARAKAA PPATMVAAKE EDICVFLYTS GTTGRPKGAM LSHRNFLADI
KAMDAVSNLG PEDNFLCVLP MFHSFAWTTS VLLPLYLGST ITIKESFQPK DTLKTLSEGD
ITVFCGVPSI YAVLWRLAEE GQFKSLKFAI SGGAPLAAEI QRGFETKFAF PLVEGYGLSE
AAPVVCLNPL DGVRKPGSIG IPLPGMEVRL VDDDDREVPR GEVGELVVRG PNVMAGYYNH
PEETAAALRG GWLHTGDLAR QDEDGYFYIV DRKKDLIILG GFNVYPREVE EVLLAHPAVL
EAAVVGVGDP VKGETVKAYV VLKEGESADR RQLQDFLKEH LALYKIPRLF EFVPELPKSP
TGKVMKKLLK TR