Gene Mbar_A3252 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A3252 
Symbol 
ID3627729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp4175140 
End bp4176438 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content47% 
IMG OID637702088 
ProductN-ethylammeline chlorohydrolase 
Protein accessionYP_306713 
Protein GI73670698 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.147814 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGACA TAATTGTAAA AAATGCTTAC GTTATGACAA TGGACCCCGA TGAGGGAGAT 
CTCAAAAATG GGACTGTTGT CATCGAAGAC GGAAAGATTA CGGAAATCGG AGAAAAGACC
AGTGAAAGTG CCGATACCGT AATTGATGCA AAACATTCGG TAGTAATGCC AGGGCTTGTA
AATACGCATA CTCATGCTGC AATGACTCTT TTTCGGGGTT ATGCCGACGA TTTGCAGCTT
GCAGACTGGC TTGAAGGGCA TATATGGCCT GCCGAGGCAA AGCTGACTGC AGAAGATGTT
TATAAAGGCA GTCTGCTTGC CTGCCTGGAG ATGATCAGGT CAGGCACTAC TTCTTTTGCG
GACATGTACT TTTATATGGA CGAGACTGCA AAAGCTGTTG AGGCATCAGG GCTTCGGGCT
TCACTTTGCC ATGGGCTTAT CGAACTCTGG AACGAAGAAA AGGGCGCAAC AGACCTAAAA
GAAGGGAAGC GCTTTGTCCG GGCCTGGCAG GGAGCGGCTG ACGGCAGGAT AAAAACAATG
TATGGGCCTC ATGCCCCGAA TACCTGCTCT GAAGAATTTC TTGCAAAGGT AAGGGAAGAA
GCCAACAGGG ATGGTGCAGG AATCCATATC CATCTCCTTG AAACGGAAGC CGAACTCCTG
GCTATGAAAG AAAGGTACGG GAAATGTTCG GTGCACCTTC TGGAAGACAT AGGATTTTTA
GGGCCTGATG TGCTTGCTGC TCACTGTGTC TGGCTTTCGG ACGGCGACAT AGAAATCCTG
GGAAAAAGAG GAGTAAATGT TTCTCATAAT GTCATAAGTA ATATGAAACT AGCTTCAGGG
ATTGCACCTG TATACAAGAT GCTCGAAAAA GGAGTCAATG TGAGCCTTGG TACGGATGGT
TGTGCCTCAA ACAATAACCT TGACCTTTTT GAGGAGATGA AAACGGCTGC TCTTCTGCAT
AAAGTCAATA CTTTCAGCCC TACTGCCCTT CCTGCGCGAC AGGTGCTTCA AATGGGTACT
GTGAACGGTG CAAAAGCCCT TGGCACGGAA ACCGGCATGT TGAAAGTAGG AATGAAAGCG
GACCTTATCG TGGTAGATAT GAAAAAAGCG CATCTTACCC CCTGTTTTGA TGTTCCCTCC
CACTTGGTGT ACTCTGCAAA AGGAAGCGAC GTCAGGACAA CAATTGTAAA TGGGAAAGTC
CTTATGGATG ATTACAAAGT GCTGGCCCTG GACGAGCAGA AAGTTATGGA AGATGCTCAA
AAAGCCGCAG AAGAGCTTGT TACAAGGGTA AACGCCTGA
 
Protein sequence
MADIIVKNAY VMTMDPDEGD LKNGTVVIED GKITEIGEKT SESADTVIDA KHSVVMPGLV 
NTHTHAAMTL FRGYADDLQL ADWLEGHIWP AEAKLTAEDV YKGSLLACLE MIRSGTTSFA
DMYFYMDETA KAVEASGLRA SLCHGLIELW NEEKGATDLK EGKRFVRAWQ GAADGRIKTM
YGPHAPNTCS EEFLAKVREE ANRDGAGIHI HLLETEAELL AMKERYGKCS VHLLEDIGFL
GPDVLAAHCV WLSDGDIEIL GKRGVNVSHN VISNMKLASG IAPVYKMLEK GVNVSLGTDG
CASNNNLDLF EEMKTAALLH KVNTFSPTAL PARQVLQMGT VNGAKALGTE TGMLKVGMKA
DLIVVDMKKA HLTPCFDVPS HLVYSAKGSD VRTTIVNGKV LMDDYKVLAL DEQKVMEDAQ
KAAEELVTRV NA