Gene Mthe_1404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1404 
Symbol 
ID4463027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1504912 
End bp1506594 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content58% 
IMG OID639700422 
Producthydantoinase/oxoprolinase 
Protein accessionYP_843819 
Protein GI116754701 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGTAG GAATAGACGT CGGTGGAACA ACAACCAACG CGGCGCTGGT GGACGGCAAT 
AAAGTTGTTA AGACCGCCAT CGGCCCGACA GACCATCAGG AGATCCTCGG CAGCCTGCTC
AGAACAATGG ACAGGCTCAT CGAGGGTGTT GACGTTGAGA GGATCGAGAG GGTGGTGCTC
AGCACCACGC TGATCACAAA CCTCATCGCG GAAGGAAAGG CGGATAAGGT CGGCCTGGTT
CTGATACCCG GTCCCGGAGT CAATCCACGG GACTACAGGT TCAGGACAGA GCCTGTGATA
CTGGATGGTG CGATCGACTA CAGGGGAAGA GAGATCGCGC CACTCAGGGA CGATCAGATA
AGAGCGGCTG CGCAGAGCCT CGCCGACCAG GGGTACAGAA AAGTCGCAGT CGTGGGAAAG
TTCTGCCAGA GGAATCATGA GCATGAGACG CACGTCAGGG AGATATTCTC GAAGGTCGCT
CCCGCGATCG AGGTCGAGAT GGGACACAGG GTCTCAGGCC AGCTGAACTT CCCGCGGAGG
GCTGCCACGA CAATGCTGAC CCTCGCGACC CGCGACCACT ACAGGCGGTT CGCGGAGCAG
GCAGAGCGCG CCATGCGGGA TCGCGGAATA AGAGCTCCGA TATACATTCT TAAAGCGGAT
GGCGGAACGC TCCCGCTTGA CAAATCTCTG GATAAGCCTG TTGAGACGAT ATTCTCGGGA
CCGGCTGCAA GCGTCATGGG AGTGATGGCC CTGACCCCCA AGGGGCAGAC ATCAGTTGTA
GTGGATATAG GAGGAACAAC AACAGATCTC GCTCTGATTC TCTCCGGAAA ACCCCTACTC
TCATCGAAGG GCGCGAAGAT AGAGGACATG CTGACGCATG TAAGAGCTTT CGCTGTGCGC
TCCATAGGGA TCGGCGGGGA CAGCGTCGTC AGAGTGTCTG ATGGAAAGAT CACTGTGGGT
CCGGATCGAG CAGGGCCTGC ATTCGCGCTC GGCGGGCCGG AGCCGACGCC AACCGATGCT
CTCATGGTTC TCGGTCACAC GAACCTTGGG GACGTGGCCC TTGCTAGGAA GGGCATTGGC
ATAATAGCGA AGATCCTCAG ATGCAGCACC GAGGATGCGG CCAGAATGAT AGTTGATACT
GTTGTGGAGA GGATAGTTGA TACCGTGAAC ATGATGTTTC TCGAGTGGGA GCAGGAGCCA
GCGTACAGGA TCTGGGAGCT GCTTCAGAGG ACGAAGGCCA GACCGCAGAA TGTCGTCGGG
GTTGGTGGAG CTTCGCCGCC GCTGGTGCCG CTGGTCGCGA AGAGGCTCAA TGCGAATGCC
ATCATCCCGG AGCACGCACC CGTGGCAAAC GCTATAGGCG CCGCGGTCGC CAGGCCCACG
ATGACTTTGA GCCTCAGGAT AGATACCGAG AGGGGCATGT ACACGGTCGA GGAGGATGGC
ACGCTCGGCG AGGCGAAGGG GAGGAACCTC AGCCTCGAGG GAGCGCAGGA GATGGCGAGA
CGGCTCCTGA GGGAGAGGGC CGAGCGCTTC GGAATCCACG AGTATGCTGA CGAGGCCGAG
GTGGTGGACA GTGAGATCTT CAATATGGTC AGGGGTTGGT CGACTGTTGG GAAGCTTATC
GATGTCAGGA TGGAGATCCC AGCAGGAATC ATCACATCAT GGAGGAGAGA TCATGGCAGC
TGA
 
Protein sequence
MFVGIDVGGT TTNAALVDGN KVVKTAIGPT DHQEILGSLL RTMDRLIEGV DVERIERVVL 
STTLITNLIA EGKADKVGLV LIPGPGVNPR DYRFRTEPVI LDGAIDYRGR EIAPLRDDQI
RAAAQSLADQ GYRKVAVVGK FCQRNHEHET HVREIFSKVA PAIEVEMGHR VSGQLNFPRR
AATTMLTLAT RDHYRRFAEQ AERAMRDRGI RAPIYILKAD GGTLPLDKSL DKPVETIFSG
PAASVMGVMA LTPKGQTSVV VDIGGTTTDL ALILSGKPLL SSKGAKIEDM LTHVRAFAVR
SIGIGGDSVV RVSDGKITVG PDRAGPAFAL GGPEPTPTDA LMVLGHTNLG DVALARKGIG
IIAKILRCST EDAARMIVDT VVERIVDTVN MMFLEWEQEP AYRIWELLQR TKARPQNVVG
VGGASPPLVP LVAKRLNANA IIPEHAPVAN AIGAAVARPT MTLSLRIDTE RGMYTVEEDG
TLGEAKGRNL SLEGAQEMAR RLLRERAERF GIHEYADEAE VVDSEIFNMV RGWSTVGKLI
DVRMEIPAGI ITSWRRDHGS