Gene Msed_1219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1219 
Symbol 
ID5103833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1192442 
End bp1193614 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content52% 
IMG OID640507111 
Productamidohydrolase 
Protein accessionYP_001191304 
Protein GI146303988 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.82253 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAAAT TTAGCGGTAA AATTTTCGAC GGGACTAAAC TCATTGAGGG GACAGTCTTG 
GTTGAGGGTA ATCAAATCGT TGAGGTCGAG GAGGGAAAGA TGGAGAGTGA CTCCACGAAG
GGTTTCATCA TCCCTGGAAT GATTGATGCC CACCTCCACT TCTTCGGAGT TCATGAGGAC
AACGTAATGT CTTGGAACCT CGTGAACGAG ATCGACGTGG CCATTAGGAG CACCAGGGAT
ATGGAGAGAC TTCTTAGGTC AGGGTTTACA ACGGTCAGGG ATCTGGGAAG TAAGGTGGCA
ACTAGGCTCT CCAACCTGGA GAGATCTGGG GAGATCATAG GCCCTAGAGT AATAGCCTCA
GGTTACTCCT TGGCCATCAC GGGAGGAGAC GACGATCCGC GGGATTTGCC CCTTGAGATG
GCACAAAAGC TGTCGTACTC CTTCTATTGT GATTCTCCCT ATGAGTGCAG GAAGGCCGTG
AGGTTAGCCG TGAGGCAGGG AGCTGGAGTC ATAAAGGTCT ACGCCTCGGG AGCGTTTTCC
CAGGGCGGAA AGATCCTTCC AGGGCTGGGA CCATACGAGC TGAAATCCAT CGTGGAGGAG
TCTCATAGGT TTGGGCTTAA GGTAGCGTCC CACGCCTATG GAAGGGAGGC AATCCTCAAC
TCAGTTGAGG CAGGAGTGGA CACCATTGAA CACGGGCTTG GCCTCGATAA GGACACGGCA
TCCATGATGT TAGACAGGGG GACATGTTAT ATCCCTACCC TAGCTACGTA TGAGATCCCA
TTTCACGTGG CAAACCCAGA GGTAAGGAGA TACAGGGAAG AGGCGGTCTC AAGGCATATG
AAGGAAGACG TTAAGTTAGC CAAGTCCGTG GGACTTAAGA TCGCCACCGG GACGGATTAC
GTGGGTTCAG ATGCTAGACC ACATGGCAAA AATTACAGGG AAGCGGTCCT CCTCTCGCAG
TTCATGGGAA ACGACGAAGT TCTTGCATCA ACAACTTCCG TGGCGGCGGA GTGTCTGGGA
ATAAGGGCTG GTAGGATAGA GAAGGGATAT CTGGCAGACC TCGTGGTTCT GAGGAATGAT
CCTCTCCAGA ACGTGGAGAA CCTTTCGCCC GAGAACGTGC TTTACGTCGT TAAGGACGGG
AAAATGTATC GAGGAGTAGG AAGGGAGGAC TAA
 
Protein sequence
MLKFSGKIFD GTKLIEGTVL VEGNQIVEVE EGKMESDSTK GFIIPGMIDA HLHFFGVHED 
NVMSWNLVNE IDVAIRSTRD MERLLRSGFT TVRDLGSKVA TRLSNLERSG EIIGPRVIAS
GYSLAITGGD DDPRDLPLEM AQKLSYSFYC DSPYECRKAV RLAVRQGAGV IKVYASGAFS
QGGKILPGLG PYELKSIVEE SHRFGLKVAS HAYGREAILN SVEAGVDTIE HGLGLDKDTA
SMMLDRGTCY IPTLATYEIP FHVANPEVRR YREEAVSRHM KEDVKLAKSV GLKIATGTDY
VGSDARPHGK NYREAVLLSQ FMGNDEVLAS TTSVAAECLG IRAGRIEKGY LADLVVLRND
PLQNVENLSP ENVLYVVKDG KMYRGVGRED