Gene Moth_0710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0710 
Symbol 
ID3832711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp742119 
End bp743303 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content64% 
IMG OID637828641 
Productpeptidase 
Protein accessionYP_429571 
Protein GI83589562 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR03526] putative selenium metabolism hydrolase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00268178 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0281992 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACAG AAGCCAGGCG CCGTGAGGTG CTGGAATTGG CGCGGCAGTT GCTGAGGATA 
CCCAGCCTTT CCGGCAAAGA AAAATGGGTG GCCAGGGCCA TCCGGGAGCG GATGCAAGAG
TTTGGGTACG ACGAGGCGCG GATTGACAGC CTGGGGAATG TTATTGGTAT TATTCGCGGC
CGGCGGCCAG GTCCCTGCCT CCTCTTCGAC GGTCACATGG ATACTGTTGC TGCCACCGGG
GAAGGCTGGC GCCATGATCC CATCGGCGGC GAGGTGCAGG ACGGCAGGCT CTATGGCCGC
GGCGCTTCCG ACATGAAAGG AGCCCTGGCG GCTATGGTAG CCGCCGCCGG TTACTTTGCC
CATGACCGGG AGCGGGATTT CGCCGGTACC CTGGCGGTGG CCGGCACCGT CCATGAGGAA
TGCTTTGAAG GGGTGGCGGC CAGAGAGGTT TTTAAGGCCG TTAGGCCTGA TTATGTGGTT
CTCGGCGAGG CTTCGGAATT AAACCTGAAG CGCGGCCAGC GGGGCCGGGC GGAGATCGTC
ATCACTACCC GGGGCCGTGC CGCCCATTCA TCCAACCCCG GGGCGGGGAA TAATGCCGTC
TACCAGATGG TCGAGGTGGT CCGCCGCCTG CGGGAGCTGG AGCCGCCCCT GCATCCGGTC
CTGGGGCCTG GGATTCTAGA GCTGACGGAT ATTATTTCCG CGCCTTATCC CGGAGCTTCG
GTGGTTCCTG ATACCTGCCG GGTTACCTAC GACCGCCGCC TGCTGGTAGG AGAGACCAGG
GAGGGGGTAC TGGCTCCCAT TCGTAAGGTT CTCGATGAGC TGGCGGCATC TTGCCCCGGT
TTCCGGGCCG AGGTCGCCTT CGCCCGGGGT GAGGGTAAGT GCTATACCGG CGCCCACCTC
GCCAGCGAGC GCTTTTATCC CGGATGGCTG CTGCCGGATG ATCATGAACT GGTGCGCCGG
GCCCTGGCCG GCCTGCGGGC TGCCGGCCTG CAGCCGGCTT TAAGCCATTA CTCCTTCTGC
ACCAACGGTA GCTTTTACGC CGGCGAAGCC GGGGTGCCCA CCATCGGCTT TGGTCCTTCC
CGGGAAGAAC TGGCCCATGT GGTTGATGAA TACATTGAGC TGGAACAGCT CTGGGCCGCG
GCAACGGGTT ATTACGCCCT GGCCGGCGCC CTCCTGGCTC CTTAA
 
Protein sequence
MLTEARRREV LELARQLLRI PSLSGKEKWV ARAIRERMQE FGYDEARIDS LGNVIGIIRG 
RRPGPCLLFD GHMDTVAATG EGWRHDPIGG EVQDGRLYGR GASDMKGALA AMVAAAGYFA
HDRERDFAGT LAVAGTVHEE CFEGVAAREV FKAVRPDYVV LGEASELNLK RGQRGRAEIV
ITTRGRAAHS SNPGAGNNAV YQMVEVVRRL RELEPPLHPV LGPGILELTD IISAPYPGAS
VVPDTCRVTY DRRLLVGETR EGVLAPIRKV LDELAASCPG FRAEVAFARG EGKCYTGAHL
ASERFYPGWL LPDDHELVRR ALAGLRAAGL QPALSHYSFC TNGSFYAGEA GVPTIGFGPS
REELAHVVDE YIELEQLWAA ATGYYALAGA LLAP