Gene Mpal_2010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2010 
Symbol 
ID7271991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2134242 
End bp2135324 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content64% 
IMG OID643570624 
Productprotein of unknown function DUF201 
Protein accessionYP_002467034 
Protein GI219852602 
COG category[R] General function prediction only 
COG ID[COG2232] Predicted ATP-dependent carboligase related to biotin carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.493163 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.206685 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGGAA ACGTCCTTGT CGCAGGGTTT ACGACACGCC ATGTCGCACG CTCTGCGGCC 
GCTGCCGGGT ACCGGGTCAC CGCCGTCGAC CACTTCTGCG ATCAGGACCT GAGCTGGTAT
ACGGACGAAC AGATCCGGTT CGATGATCTC GACGATCTCC CTGATGCGAT CGAGGGGATC
TGCCAGCGGC ACCAGTTCGA CTTCATGGTC GTCACCTCGG GGGCGGAGAC CGTCGACTGC
CCGGTCCCGC TGATGGGAAC GCCGGCAGTG CAGGTCGAAC CGTTCCTCGA CAAGGGGATG
ATGCAGGAGT TCTTTGAAGG ACTGGAGATG CCGATCCCAC CAAGGGCAGC ACCGGGCACC
TACCCGGTCT TTCTCAAGCC GCTGACCGGG GCTGGCGGCT GGCGGAATGC GATCGTACAC
AGCATTGACG AGGAACGGGC CTGGGAGGCG CTCTTCCCCG GAGCCCCATA CCTGGCCCAG
GAGATCGTCG ACGGGGTGCC GGCGAGTGTC TCCTGTATTG GCGACGGGTC GAGGGCGGTG
GCTGTGGCGG TGAACCGGCA GGTGATGCGC GGCGGGGACG AGGCGGCGTT CGGGTTCTCG
GGATCGATGA CCCCGTTCGA CACGCCGATG GCTGCAGAGA TGGTCAGGGT TGCCGAACAG
GTGGTCGCAG CCAGCGGGTG CGTCGGGTCG GTCGGGGTCG ACTTCATTGT TGGGGATGAT
CTTCACCTGA TCGAGATCAA TCCGCGGTTC CAGGGGACCG TCGACACCGT CGAGATGGCC
ACCGGATGCA ACCTCTTCGA TCTCCATGTC GCTGGATGCG AAGGGCGTCT GCCCGTTCTC
CCGCCACGGG TGGCGGGGCG GTACGCGGTT CGGTCGATCC TCTTTGCAGA AGAGGAACTG
GTCGTCACGG GAGACCTCAC CGGTCTGGCC CCGATCGTCG CGGATATCCC CTGGCCGGGG
ACGGTGATCG AGGAGGGCGG AGCGATCGTC AGCGTGTACG GGCAGGGGCC GACTGAGGCC
CAGGCACGCG CCTCGCTGGA TAACAATATT ATCACTGTGC GTACATATAT GAGCCAATGG
TAG
 
Protein sequence
MKGNVLVAGF TTRHVARSAA AAGYRVTAVD HFCDQDLSWY TDEQIRFDDL DDLPDAIEGI 
CQRHQFDFMV VTSGAETVDC PVPLMGTPAV QVEPFLDKGM MQEFFEGLEM PIPPRAAPGT
YPVFLKPLTG AGGWRNAIVH SIDEERAWEA LFPGAPYLAQ EIVDGVPASV SCIGDGSRAV
AVAVNRQVMR GGDEAAFGFS GSMTPFDTPM AAEMVRVAEQ VVAASGCVGS VGVDFIVGDD
LHLIEINPRF QGTVDTVEMA TGCNLFDLHV AGCEGRLPVL PPRVAGRYAV RSILFAEEEL
VVTGDLTGLA PIVADIPWPG TVIEEGGAIV SVYGQGPTEA QARASLDNNI ITVRTYMSQW