Gene Mboo_2035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2035 
Symbol 
ID5411162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2110660 
End bp2111634 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content62% 
IMG OID640869277 
Productputative agmatinase 
Protein accessionYP_001405192 
Protein GI154151574 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01230] agmatinase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.108559 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.06349 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAACG ACCTGGAGAA GATGGCGGCG CAGTGCAGGA CTTTTACCAA AGAGATGGTG 
GACAACCCGT ACCGGGGGCT TGCCACGTTC TTTGGCCTGC CGTACACCGA ATCGCTCGAC
AACCTCGACA TTGCGCTCAT CGGGGTTCCC ATAGATCTGG GAGTCACCGA CCGGAGCGGA
ACCCGGATGG GCCCGAGGGC ATTGCGCAAC GAGTCCCGGG GCGTCGGAGC CTACAACCAC
TGCACCCGTT CGACCCCCTG CACGGCACAC CGGATCGCTG ATGTCGGAGA CGTGCCCTTC
CGTTCGGTGT ACCGGATCGA AGAAGCGCTG GACGATATCT CCGCGTACTA CCGCGGGATT
GCGGCAGCCG GAGTCACCCC CGTGACCGCG GGAGGGGATC ACTCGATCAC CTTCCCGATC
TTACAGGGCC TTGCCCCAAA AGAGAAGGTC TGCCTGGTCC ACTTCGATTC CCACTGCGAC
ACCGCCCCAC CGATCCATGG CTGCGGGTAC ACCCACGGTT CCCCGATGAA AAACACGGTG
GAGGCAGGGC TTGTGGACGC TGAACACTCC CTCCAGATCG GGATACGGGG CTCAAGCGAA
CCACTCTGGG AATTCTCCTC TGCAAGCGGT ATGCGGGTGA TCCACATCGA GGAGTTCTAC
GAGATGGGCT GGAAAGGCGC AGTAAAAGAG ATCCACGACC TTGTCGGTGA CAGCCCGGTG
TACCTCTCTT TTGATATCGA CTGCCTTGAC CCGGCCTTTG CCCCGGGCAC TGGGACACCG
GTCGCCGGCG GCATGTCCAC GTTTGAAGCG CTCCAGATGG TGAGGGGAAT GCAGGGCCTG
GATGTCATCG GCGGCGACCT CGTGGAGGTC TCCCCACCCT ACGATCATGC GGGTATCACC
GCCCTTGCCG GGGCGACCCT CCTCTTTGAG ATTCTCTGCC GTGCGGCCGA GGCACGGGAA
CGCCGGGGGG CCTGA
 
Protein sequence
MDNDLEKMAA QCRTFTKEMV DNPYRGLATF FGLPYTESLD NLDIALIGVP IDLGVTDRSG 
TRMGPRALRN ESRGVGAYNH CTRSTPCTAH RIADVGDVPF RSVYRIEEAL DDISAYYRGI
AAAGVTPVTA GGDHSITFPI LQGLAPKEKV CLVHFDSHCD TAPPIHGCGY THGSPMKNTV
EAGLVDAEHS LQIGIRGSSE PLWEFSSASG MRVIHIEEFY EMGWKGAVKE IHDLVGDSPV
YLSFDIDCLD PAFAPGTGTP VAGGMSTFEA LQMVRGMQGL DVIGGDLVEV SPPYDHAGIT
ALAGATLLFE ILCRAAEARE RRGA