Gene Mboo_2140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2140 
SymbolpyrG 
ID5409958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2212273 
End bp2213853 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content58% 
IMG OID640869385 
ProductCTP synthetase 
Protein accessionYP_001405297 
Protein GI154151679 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0504] CTP synthase (UTP-ammonia lyase) 
TIGRFAM ID[TIGR00337] CTP synthase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGTATA TTTTTGTCAC CGGCGGAGTC ATGAGCGGCC TGGGGAAAGG GATCACGGCC 
GCTTCTGTGG GGCGGATCCT GAAAAACCGG GGGTACCGGG TCACGGCGGT CAAGATCGAT
CCGTACCTTA ATATCGATGC CGGCACGATG AACCCGGCCC AGCATGGGGA AGTCTTTGTC
CTGAAGGACG GGGGGGAGGT CGACCTTGAC CTGGGTAACT ATGAACGGTT CCTAGACATC
GAGCTCACCT CCTCGCACAA TATCACGACG GGGAAGGTCT ACCGCACGGT GATCGAAAAA
GAGCGCCGTG GCGATTTCCT GGGTGAGACC GTCCAGATCA TCCCCCACAT CACCGACCAG
ATCAAGACCT GTATCCGTCA GGCCGCAGAA GAGACATTTC CCGACGGCAC GAAAGCCGAT
GTCTGCCTTG TCGAGGTTGG CGGGACGGTC GGAGATATCG AGAGCATGCC GTTTCTCGAG
GCCGTTCGGC AGATGCGGGG AGAACTCGAT GAGCATGACT ACGTCCTTGT TCACGTGACC
CTTGTTCCCG AGGATGCGAT GGGGGACTTA AAAACCAAGC CCACCCAGCA CTCGGTAAAG
GCGCTCCGGG AGCTCGGCCT TCACGCCGAT ATTATCGTTT GCCGGAGCGA GCGGGTGGTA
GGGGCAAACA CCAAGCGCAA GATCTCGGCG TTCTGCGACC TCCCCCTCAG CGCTGTCATT
TCGGCTGCAA CCGCCCGGGA TACCTACGAG GTTCCCATGG AGATGGAAAA GGAGGGGATC
GCCGATGTCC TCTCGACCCA TCTCGGCCTT GAGAAGAAAG AGACCGACCC CTCGTGGTAC
CGGCTCGTCA CCAAAGAATA CACCAACCGC GTCACCGTTG CCATCGTGAG CAAATACGGG
ATCGAGGATG TGTACATCAG CATCAAGGAA GCGCTTAAGC ACGCGGGCCG CGCCCTTTCG
ACCGAGGTGA AGATCGTCTG GCTCGATGCC GAGCGGTACG AACCCTGCTC GCTCAAGGAT
TATGATGGCA TCCTTATCCC GGGAGGTTTC GGGAAGCGGG GGATCGAGGG CAAGATCGGG
GCAATCCGGT TTGCACGTGA GAACAAGGTC CCTTTCCTTG GTCTCTGCCT CGGGTTCCAG
CTTGCGACAA TCGAGTTTGC CCGGCACAAG TGCGGGATTG CCGATGCAAC AAGCGAGGAG
TTTGGCGAAG GCTCGCACGT GATCGCGCTC CTTCCCGAAC AGGAGAGCGT GACTGAACTG
GGCGGCACTA TGCGGCTTGG TGACTATACC TCAGATATCA GGGACAAAAC GCTTGCGATG
AAACTCTACG GGAAATCCCA GATCATCGAG CGCCACCGGC ACCGGTACGA GGTAAACCCT
CACTATATCG AAAAACTCGA AAAAGAGGGA CTGGTCTTCT CTGCAACGAA CAAAAACCGG
ATGGAGTGCC TGGAACTCCC CGGCCACCCG TTCTTCTTTG CAACCCAGTT CCACCCCGAG
TTCAAGTCCC GGCCGACCCG CCCGTCGCCG CCGTACCTCG GCTTTGTCGA AGCGTGCCGG
GCAAACAAGC GGACCACATA A
 
Protein sequence
MKYIFVTGGV MSGLGKGITA ASVGRILKNR GYRVTAVKID PYLNIDAGTM NPAQHGEVFV 
LKDGGEVDLD LGNYERFLDI ELTSSHNITT GKVYRTVIEK ERRGDFLGET VQIIPHITDQ
IKTCIRQAAE ETFPDGTKAD VCLVEVGGTV GDIESMPFLE AVRQMRGELD EHDYVLVHVT
LVPEDAMGDL KTKPTQHSVK ALRELGLHAD IIVCRSERVV GANTKRKISA FCDLPLSAVI
SAATARDTYE VPMEMEKEGI ADVLSTHLGL EKKETDPSWY RLVTKEYTNR VTVAIVSKYG
IEDVYISIKE ALKHAGRALS TEVKIVWLDA ERYEPCSLKD YDGILIPGGF GKRGIEGKIG
AIRFARENKV PFLGLCLGFQ LATIEFARHK CGIADATSEE FGEGSHVIAL LPEQESVTEL
GGTMRLGDYT SDIRDKTLAM KLYGKSQIIE RHRHRYEVNP HYIEKLEKEG LVFSATNKNR
MECLELPGHP FFFATQFHPE FKSRPTRPSP PYLGFVEACR ANKRTT