Gene Moth_0990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0990 
Symbol 
ID3830866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1017408 
End bp1019081 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content60% 
IMG OID637828919 
Producthydantoinase/oxoprolinase 
Protein accessionYP_429848 
Protein GI83589839 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.454811 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGTAG GCCTGGATAT GGGTGGTACC CATACCGATG TCGTCCTCAT AGCCGACGGC 
CGGGTGCAAC GCTACTGCAA GACACCGACC GACCCTGAAG ATTACCTCCA TACCGTTACC
AGGGCCCTGG ACGAGGTCCT GGAGGATGTT GCAACCGGCG ATGTACAGCG CCTGAACCTC
AGCACCACTG TCTGCACCAA TGCCATCGTG ACCGGCAGGA CCAGCCCGGT AGGCTTGCTC
CTGGAACCGG GGCCGGGATT AAATCCCGCC GGGCTGACCT GCGGCGCTAA AAATTTTATC
CTCTCGGGCG CCATCGATCA TCGCGGTCGC CCTACCTCTT CCCTGGTAGC AACCGAAATC
GAAGCTGCCG ATAAAGAGCT AAGGAAAGCA GCCATCCGTC ACCTGGCCAT AGTAGGTAAG
TTTTCCACCC GCAACCCGGA ACATGAATTG CAGATCAAGG AAGCCCTTTC CCCGACCTAT
GACTTTATCA CCCTGGGCCA TCGCCTGTCG GGACGTTTGA ACTACCCCAG GCGGGTTTTT
ACCGCCTATC TGAACAGTGC CGTAGCATCT ATCTATGATG CCTTTGCCAC CGCCATAAAT
GCCTATGCCG ATGGCAGGCA ACTGCCAGTA GTACCGGATA TACTAAAAGC CGATGGTGGC
ACCCTCTCCC TGGCCGCATC CCGGTCCCTC CCGGTGGAGA CCATCCTCTC CGGTCCCTCG
GCCAGTATTA TGGGAGCCCT GGCCCTGGCG CCCTCTTCCC GGGATACTAT TATTCTCGAT
ATCGGCGGTA CGACCACTGA TATCGCCTTC CTGGCCGACG GTGTCCCCCT CTTTGAACCT
TTAGGCATAA CCCTGGCCGG TTACCCAACC CTGGTACGCG CCCTTTACAG TTATTCCTTG
GGTCTGGGCG GCGACAGCTG CCTGCGGGTC AGGGACGGGC GCCTGAGCAT CGGTCCGGAG
CGCCTGGGGC CGGCCCTGGC CCTGGGCGGC CCCGCCCCTA CTCCCACCGA CGCCCTGATT
ACCATGGGCC GCCTGGACCT GGGGGATAAA GGGGCTGCCC GCCGGGGCCT GGATCAGTTA
GGGGATCAAC TGGGCCTCAA TACCGGGGAG GTGGCTAATG CCATTATCAA ACAAATGGCC
GGCGAGATTG CCCGCCAGAC CCGGGCCCTC CTTGAGAGAA TTAATAGCCG CCCGGTATAT
ACCGTGCGGG AAGTCCTGGA GGACAAAAAA CTACGCCCGG AACAGGTAAT CGTCATCGGT
GCCCCGGCGC CCCTCCTGGC GGCCGAGCTG GAAGCCGCTT TCGGCCTCCC GGTGGTAGCA
CCCTCCCTGG CCGGGGTTGC CAATGCCATT GGCGCCGCCC TGAGCCAGCC CACTACGGAA
ATCACCCTCC AGGCCGATAC AGAGCAGGGC TTCCTGACGA TTCCCGAAGA GGGCATCAGG
GAGAAAGTCC AACGCGGTTT CAACCTGGAG GCAGCCCGCC AGCGGGCCCT GGCAGCCCTG
CAGGATCGCC TGCGGCGCCT GGCACCGGCG GCAGCCGGTT CCGAATTGGA AGTAGTTGAG
GAACAATCCT TTAACATGGT CTCCGGCTTT TATACCACGG GCAAAAACAT CAGAGTCAAA
GTCCAGGTTA AACCCCAGGT CAGCCCCCTG GAGGGAGGTG AGCATAATGT ATAA
 
Protein sequence
MYVGLDMGGT HTDVVLIADG RVQRYCKTPT DPEDYLHTVT RALDEVLEDV ATGDVQRLNL 
STTVCTNAIV TGRTSPVGLL LEPGPGLNPA GLTCGAKNFI LSGAIDHRGR PTSSLVATEI
EAADKELRKA AIRHLAIVGK FSTRNPEHEL QIKEALSPTY DFITLGHRLS GRLNYPRRVF
TAYLNSAVAS IYDAFATAIN AYADGRQLPV VPDILKADGG TLSLAASRSL PVETILSGPS
ASIMGALALA PSSRDTIILD IGGTTTDIAF LADGVPLFEP LGITLAGYPT LVRALYSYSL
GLGGDSCLRV RDGRLSIGPE RLGPALALGG PAPTPTDALI TMGRLDLGDK GAARRGLDQL
GDQLGLNTGE VANAIIKQMA GEIARQTRAL LERINSRPVY TVREVLEDKK LRPEQVIVIG
APAPLLAAEL EAAFGLPVVA PSLAGVANAI GAALSQPTTE ITLQADTEQG FLTIPEEGIR
EKVQRGFNLE AARQRALAAL QDRLRRLAPA AAGSELEVVE EQSFNMVSGF YTTGKNIRVK
VQVKPQVSPL EGGEHNV