Gene Athe_1032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1032 
Symbol 
ID7409589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1126531 
End bp1128171 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content34% 
IMG OID643715398 
ProductHydantoinase/oxoprolinase 
Protein accessionYP_002572906 
Protein GI222529024 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAATAG GCCTTGATGT TGGTGGTACA ACCATCGACA CAGTGGCAAT AGAAAACGGA 
AAAGTTGTTG CATTTAAAAA ATTCGAACGT GGTAGAAGTT TAATAGACTC TGTATTAGAA
AGTTTAAACC AGTTTATATC TAAAGAGATA ATTTCGAACC TTGAAAGAAT AACACTCAGT
ACAACCGTTA CTACAAACGC AATAGTTCAA AATAATTTAG ATAAGGTTGG AATGATAATA
GAAAACGGTA TTGGATCAAA TCCTGAATTT TTGATGTGTG GTGATATGAA TTTTTTGGCA
GATGGATATA TAAATAACAG AGGCATAGAG GTAAAACCAG TAAATATCGA AAGCGTAAAG
AACGCTTTGC ATAGCTTTAA ACAAGAAGGT ATAGAAAACT TGGGCATTGT TGGCAAGTTC
TCTGTTCGAA ATCCACAGCA TGAACTTGCT GTGTATGAGG CAGCTAAAGA ATACAACTTT
AAATTTATGT CAGTTGGCTA CAAACTTTCT GGCAAACTCA ACTTTCCCCG AAGAGTTTTT
TCCACATACT TGAACTGTGC AGTCCACAGT ACATTTAACA TGTTTTATAA AAGTATTTTA
ACATTTTCAC AAGAAAGAAA AATTCCTCTT GAGAAGATAT TTATTCAAAA GCCAGATGGG
GGAATTGTAA ATTTGCAAAA TATTGAAAAT TTTCCGATAT TCTCTATTCT CTCAGGTCCA
GCTGCATCTG CTCAAGGCGG ATTTGTCTTA TCTAATTTTG CAAAAAATGC AGTCATAATT
GACATTGGTG GAACAACAAC TGACATTGCG TTCTTGTCAA ACGGTAATCT TGTTCTTGAA
CCATATGGAG CAAAAATTGG AAAGTATCCT ACTTTGGTAA GATCAATATA TTCACGTTCT
GTGGGGCTGG GGGCTGAAAG TATTGTTAAA ATAGTAGATA ATACAATTAA GATAGGACCA
GAAACAAAAA GATGGTATGA ACAGGACAAA AATGAGTTTG CTACTTTGCA TGATGTTCTT
CAATTTTCAA GTTCACACTC TGAAAGTCCT GTAAATGTTC GTTTAAAGTC TTTATCTTCT
CAGCTGAATA TGAGCCAAGA AGACTTTTGT AAATTAGTGA TAAGCAGGGC AGTAGCAATG
ATAGAAGAAA AATTGAAAGA GGGAATTGAG TACATCAATA ATCTTCCAGT TTATACCATA
AATAAGCTAT TGTATGGAGA AAAATTCGCA CCTCAGAAGA TAATTCTAAT TGGTGGACCA
GCTCAGCTTT TAAAACCTTA TTTAGAGAAC AAGTTTAAAA TAAAAGTTGA AGTTCCGAAA
TATTACATGG TTGCAAACGC AATTGGTTGT GCGATCGGCT CAATCTCAAA AGAGTACAAC
TTAGTTGCAG ACACTATCCA AGGCAAAATG ATAATTCCAG AGCTAAATAT ATATCATAAT
ATTCCTGCAG ATTTTACTAT TGACCAGGCA AAGAAGGTTT TAATTGAAAA GGTCTTAGAG
TGCAGAGAAG CTAAAAATCA AGACGATATA GAGATTGTAG ATGAAAGTTC ATTTAATATG
ATAAGAAATT TTAGATTCTG TGGAAGAATG ATTAGAATTA AAGCTCAGTT AAAACCCAAG
CTTTTAAATT TGAGAGATTG A
 
Protein sequence
MIIGLDVGGT TIDTVAIENG KVVAFKKFER GRSLIDSVLE SLNQFISKEI ISNLERITLS 
TTVTTNAIVQ NNLDKVGMII ENGIGSNPEF LMCGDMNFLA DGYINNRGIE VKPVNIESVK
NALHSFKQEG IENLGIVGKF SVRNPQHELA VYEAAKEYNF KFMSVGYKLS GKLNFPRRVF
STYLNCAVHS TFNMFYKSIL TFSQERKIPL EKIFIQKPDG GIVNLQNIEN FPIFSILSGP
AASAQGGFVL SNFAKNAVII DIGGTTTDIA FLSNGNLVLE PYGAKIGKYP TLVRSIYSRS
VGLGAESIVK IVDNTIKIGP ETKRWYEQDK NEFATLHDVL QFSSSHSESP VNVRLKSLSS
QLNMSQEDFC KLVISRAVAM IEEKLKEGIE YINNLPVYTI NKLLYGEKFA PQKIILIGGP
AQLLKPYLEN KFKIKVEVPK YYMVANAIGC AIGSISKEYN LVADTIQGKM IIPELNIYHN
IPADFTIDQA KKVLIEKVLE CREAKNQDDI EIVDESSFNM IRNFRFCGRM IRIKAQLKPK
LLNLRD