Gene Mbar_A0486 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A0486 
Symbol 
ID3626600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp583700 
End bp584809 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content43% 
IMG OID637699379 
Productchlorohydrolase family protein 
Protein accessionYP_304048 
Protein GI73668033 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGGCA CTGAACAGAT AATTCCCGGG ACAATTATTG CAGGCCCTGA ACTGGAACCT 
ATTGAAGGGT ATATCTGCGT AAAGAGAGGA ATAATTACGG AGATTGGGGA AGAGCATACA
CTCTCTAAGA ACATAATTGC TCCCTGTTTT GTCAATGCAC ATACTCACCT TGGAGACTCG
GTATTTAAAG ACCCCCCTCT GGGAAAAGTT TCCGGATTTC GGACTCAAAG AGACCTGGAT
GCTCTTGTAA AGCCTCCTGA TGGGTTAAAA CACAGGATAT TAAGAGACAC GCCATACAAA
GCCTTGATCG AGGGCATGAG AAAGTCCTTG CTAGACATGA TAGAAACCGG AACCTGTGCT
TTTGCGGATT TCAGAGAAGG AGGCGTTGTA GGGATTGCAG CCCTGAATAA AGCACTCGAA
GGACTTAAAC TTCATTCCAT GATACTGGGT AGACCTACAG AGCCTGAACT TCCACTTCAG
GTTGTACTGG CAGAAGTAAG AAGAATTTTT CTACATTCTA CCGGGCTTGG AATGAGTGGG
GCAAATGACC TTGATCTTGA ACTATTGGAG AATATTGCTA CCTACACACG ACAACATAAA
AAATTCTTTG CAATTCACGC CGGAGAAAAA GATACGAGCG ACATAGAAAA AGCATTGTCA
CTTGAGCCAG ATCTTCTGAT TCATTTGACA AATGCCACAA AAAAAGATCT TGAAGACGTG
GCTGATGCGA AGATTCCGGT TGTCGTCTGT CCCAGGTCTA ACCTAATTAC AGGAGCAGGA
ATTGCGCCTG TTGCCGAGAT GCTGGAAGCT GGGATAAAAG TAGCTGCAGG AACGGATAAT
GTAATGTTAA ATTCTGTAAA TATGTTTGCT GAGATGGAAT TTATGTCTAA GATTTTTTCC
ATTGATGACA GGCAAGTATT TAAAATTTGC ACACTTAATG GTTCCTTTGT AATGGGATCT
GATTCTACGG GTTCGATACA AAAGGGGAAT AAAGCTAATC TCATGATCCT GAATGGAAAT
TCCAATAATC TTGCAGGGAT AAAGAACCCC ATAAGTGGAA TCACAAGGCG GGCAAGACCC
GATGACATAC TATCAGTGCT TCATTCGTAA
 
Protein sequence
MYGTEQIIPG TIIAGPELEP IEGYICVKRG IITEIGEEHT LSKNIIAPCF VNAHTHLGDS 
VFKDPPLGKV SGFRTQRDLD ALVKPPDGLK HRILRDTPYK ALIEGMRKSL LDMIETGTCA
FADFREGGVV GIAALNKALE GLKLHSMILG RPTEPELPLQ VVLAEVRRIF LHSTGLGMSG
ANDLDLELLE NIATYTRQHK KFFAIHAGEK DTSDIEKALS LEPDLLIHLT NATKKDLEDV
ADAKIPVVVC PRSNLITGAG IAPVAEMLEA GIKVAAGTDN VMLNSVNMFA EMEFMSKIFS
IDDRQVFKIC TLNGSFVMGS DSTGSIQKGN KANLMILNGN SNNLAGIKNP ISGITRRARP
DDILSVLHS