Gene Mpal_1029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1029 
Symbol 
ID7271763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1054947 
End bp1056263 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content61% 
IMG OID643569666 
Productamidohydrolase 
Protein accessionYP_002466100 
Protein GI219851668 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.951911 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGATA GAATGGGAAG AGAGAAAGCG TGCCTGATCA CAGGCACCTG TATTGAAGGA 
CGGCCTGTTG AGATTCTGAT CGATGAGACC GGGACGATTG CTGCGATCGA TGAGAAGATC
AGCGGGTCGG AGCGCTCGAA TGCCGAGGTG ATCATCGATG GGTCTGGGAC CCTCGCGATG
CATACGCTCG CGAACACGCA CACCCATGCC GCGATGTCCC TGCTTCGAGG GTACGCCGAC
GATATGATCC TGCAGGACTG GCTCGGGCAG AAGATCTGGC CCCTCGAAGC CTGCCTGACC
GGGGAGGACG TGTATTGGGG AACCCGACTC GCCTGCATCG AGATGATCCG GAGCGGGACC
ACGGCCTTCA ATGATATGTA CTTCTTCATG GAGGAGGCGG CCCGTGCCGT CGATCATGCC
GGGATCAAGG CCCAGCTCTG CTATGGGTTC ATCGACCTGA ACGATCACGA CAAGCGGGAG
CATGAGATCA GAGCGACCGA GGCGCTGGTC ACCTCGATCA AAGGAATGCA GAACCCGCGG
ATCAAACCGG CCGTCGGGCC GCATGCTGTC TACACGGTCT CACCGGAAGG ACTTGCATGG
CTGGCCGCCT ACAGTGCGTC GGAACAGATC GGGATCCATG TCCACCTGTC AGAGACCGAG
CAGGAGGTTC TCGATGCGCA GAAGAACAGT GGAAAGCGGC CGCCGGCGAT TCTCGACCAG
GCCGGGTGCC TGACCGACCG GACGATCGCA GCCCACTGCT GCTGGCTCGA CCAGGCGGAC
TGCCGGCTGC TTGCAGAGCG GGGGACAACG GTCTCGCATA ACCCGGCCAG TAATATGAAA
CTGTCTGTGA ACAGGGCAAT GCCCTATCCC TGGCTCGTTG AGGCCGGGGT TCCGGTCACC
CTCGGCACCG ACGGTTGTGC CTCCAACAAC AACCTGGACC TCTTTGAGGA GATGAAGATC
GCGGCCCTGC TTCAGAAGTT CGCCCAGAAC AACCCGACCT GCCTGCCGGC AGAAGAGGCG
CTCTCCATGG CCACTGTAAC TGGAACCCGG GCTCTCGGGT TCGGGAGCGG CCTTCTTGTA
GTCGGGGAGC CGGCGGATAT CATGCTCATC GACAGAATGG TCCCCTGCAA CACGCCCCTC
CACCACCAGA CCTCGAACAT GGTCTATGCC TGCAATGGCG GGGCCGTGAA GACGGTCCTC
TGCAATGGGC GGGTGGTGAT GCAGGATGGG GTGATCCCTG GCGAAGAAGA GGTGCTCAAC
AACGCCTCAC GAGCAGCAGC AGACCTGGTC AGGCGGGCTG CCGATCAGGC TGAGTGA
 
Protein sequence
MVDRMGREKA CLITGTCIEG RPVEILIDET GTIAAIDEKI SGSERSNAEV IIDGSGTLAM 
HTLANTHTHA AMSLLRGYAD DMILQDWLGQ KIWPLEACLT GEDVYWGTRL ACIEMIRSGT
TAFNDMYFFM EEAARAVDHA GIKAQLCYGF IDLNDHDKRE HEIRATEALV TSIKGMQNPR
IKPAVGPHAV YTVSPEGLAW LAAYSASEQI GIHVHLSETE QEVLDAQKNS GKRPPAILDQ
AGCLTDRTIA AHCCWLDQAD CRLLAERGTT VSHNPASNMK LSVNRAMPYP WLVEAGVPVT
LGTDGCASNN NLDLFEEMKI AALLQKFAQN NPTCLPAEEA LSMATVTGTR ALGFGSGLLV
VGEPADIMLI DRMVPCNTPL HHQTSNMVYA CNGGAVKTVL CNGRVVMQDG VIPGEEEVLN
NASRAAADLV RRAADQAE