Gene Moth_2167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2167 
Symbol 
ID3833016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2267151 
End bp2268272 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content61% 
IMG OID637830089 
Productalanine racemase 
Protein accessionYP_430999 
Protein GI83590990 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0787] Alanine racemase 
TIGRFAM ID[TIGR00492] alanine racemase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000196511 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000176162 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
TTGTCGCGGC CGGTTTGGGC CGAGATTGAC CTGGAAGCAG TGGCCCGCAA TGTCCGGGCC 
ATTAAAAAGA TACTGGCGCC GCAAACAGAA ATTATGGCCA TCGTTAAAGC CAATGCCTAC
GGCCACGGGG CCGGGCCGGT GGCCAGGACA GCCCTGGCTA ACGGCGTCAG CTGGCTGGGT
GTAGCTACCC TGGGTGAAGC CCTGGACCTG CGGCGGGAAG GAATTACTGC CCCGCTTCTT
ATTTTGGGCT ATACCCCGCC GGAGGATGCC GGGCGGGTGG TAGAGGCCGA TATTTCCCAG
ACGGTCTTCA GCGTTGACCA GGCCCGGGCC CTGAACGCCG CCGCCGCCGC TGTAGGTACC
AGGGCCCGCC TGCATTTAAA GATCGATACC GGTATGGGCC GGCTGGGTTT TCTGCCCCGG
GAGGCGGTGA CTGCGGCCCG GGCTATAGCG GATCTGCCCC ATGTCACCCT GGAAGGTATT
TTCACCCACT TTGCCGCGTC CGATGCCGCT GATAAGACCT ACACCCAGCG GCAGCTGGGC
TTATTTCAAC AAGTGATTGC CGAACTGGAG AAACAGGGCA TAACCTTTCC CTGGCGCCAT
GCAGCCAACA GCGGGGCCAT CATCGATCTC CCCGGGACCC ACTTTAACCT GGTCCGCGCC
GGGATTATCC TTTATGGCCA CTATCCCTCG CCGGAGGTCC AGCGGAAGAG GCTGGCGTTA
ACGCCGGTGA TGACGCTAAA AACCAGGGTA GTCCTGGTAA AGGAGGTGCC GGCCGGGTCG
TATATCAGCT ATGGCTGCAC CTACCGCACC CCCGGTCCGG CCAGGATTGC CACCCTCCCG
GTGGGTTATG CCGACGGCTA TTCCCGTCTC CTTTCCAACC GGGCCGAGGT CCTGGTCCGC
GGTCGCCGGG CACCGATCGT CGGCCGTATC TGTATGGACC AGTGCATGAT AGATGTTACA
GCTATACCGG AAGTCCGGGT TGGTGATGAA GTAGTCCTCT TCGGTCGCCA GGGAGGACAG
ACTCTGACGG TAGAAGAGGT GGCGGCCTGG ATGGGGACCA TTAACTACGA AATCCTGTGC
CTGATATCCA AGCGCGTGCC ACGTGTATAT CTTCATAGTT AA
 
Protein sequence
MSRPVWAEID LEAVARNVRA IKKILAPQTE IMAIVKANAY GHGAGPVART ALANGVSWLG 
VATLGEALDL RREGITAPLL ILGYTPPEDA GRVVEADISQ TVFSVDQARA LNAAAAAVGT
RARLHLKIDT GMGRLGFLPR EAVTAARAIA DLPHVTLEGI FTHFAASDAA DKTYTQRQLG
LFQQVIAELE KQGITFPWRH AANSGAIIDL PGTHFNLVRA GIILYGHYPS PEVQRKRLAL
TPVMTLKTRV VLVKEVPAGS YISYGCTYRT PGPARIATLP VGYADGYSRL LSNRAEVLVR
GRRAPIVGRI CMDQCMIDVT AIPEVRVGDE VVLFGRQGGQ TLTVEEVAAW MGTINYEILC
LISKRVPRVY LHS