Gene Mext_1963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1963 
Symbol 
ID5833667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2194677 
End bp2195789 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content72% 
IMG OID641367764 
Productphosphoribosylaminoimidazole carboxylase, ATPase subunit 
Protein accessionYP_001639433 
Protein GI163851390 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.199721 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTCTC CCACCTCCTC GCCGCAGATC CGGCCCGGCG GCACCCTCGG CATCGTCGGC 
GGCGGCCAGC TCGGCCGCAT GATCGCGCTC GCGGCAGCCA ATTACGGCCT CAAGGTGCAC
ATCTACGCCC CCGATGCCGA CAGCCCGGCC TTCGACGTGG CCCATGCCCA CACGCTGGCG
CCCTACGACG ACGCGGCGGC GCTGGCCGCC TTCGCCGATG CCTGCGACGT GGTCACCTAC
GAGTTCGAGA ACATCCCCCA CGCCACCGCC GCCGTGCTCG CCGAGCACGC GACCCTGCGC
CCGAGCGCGA CGGCGCTGCT CACGACGCAG GATCGCCTGT CCGAGAAGGA CTTCGTGACC
TCGCTCGGCA TTCCGACCGC GCCCTACCGG GCGGTCGATA CGGTCGAGGA TCTCGTGCGG
TCCCTGGAGG CGCTCGGCCG CCCCGCCGTG TTGAAGACCC GGCGCTTCGG CTACGACGGG
AAGGGCCAGC GGATGATCCG CGAGGGCGAC GACCCGGCTG CCCTCCTCGC CGAGTTCAAG
GGCGCGCCCT GCATCCTCGA AGGGTTCGTG CCGTTCGAGC GCGAAATCTC GGTGGTCGCC
GCCCGCGGGC CGGACGGGAC CTTCGCGGCC TACGACCCCT GCGCCAACGA GCACCGCGAC
CATATCCTTG CGCTCACCCG CGTGCCCGCT CCCGGCCTGA CCCGGACGAC GGGTGACGCG
GCGGTCGCCA TCGCCCGCGC CATCGCCGAG GCGCTGGACT ATGTCGGGGT GCTTGCGGTC
GAGATGTTCG AGATCGCCGG GCCCGAGGGG GCCGCCCGCC TCGTCGTCAA CGAGATCGCG
CCCCGCGTCC ACAATTCCGG GCACTGGACC ATCGAGGGCG CGCTGACCTC GCAATTCGCG
CAAACCGTGC GCGCGGTCTG CGGTTGGCCG CTCGGCGACA CCGCCCGCAC CGGCGGCATG
GCGGTGGAGA TGGAGAACCT CATCGGCGCC GAGGCCGATG CCTGGGCGGA CCTGCTAGCG
GAGCCGGGCG CCCATCTCCA CCTCTACGGC AAGGCCGAGG CCCGTCCCGG CCGCAAGATG
GGGCACGTCA CCCGGCTCAA GCCGCTCGAC TAA
 
Protein sequence
MASPTSSPQI RPGGTLGIVG GGQLGRMIAL AAANYGLKVH IYAPDADSPA FDVAHAHTLA 
PYDDAAALAA FADACDVVTY EFENIPHATA AVLAEHATLR PSATALLTTQ DRLSEKDFVT
SLGIPTAPYR AVDTVEDLVR SLEALGRPAV LKTRRFGYDG KGQRMIREGD DPAALLAEFK
GAPCILEGFV PFEREISVVA ARGPDGTFAA YDPCANEHRD HILALTRVPA PGLTRTTGDA
AVAIARAIAE ALDYVGVLAV EMFEIAGPEG AARLVVNEIA PRVHNSGHWT IEGALTSQFA
QTVRAVCGWP LGDTARTGGM AVEMENLIGA EADAWADLLA EPGAHLHLYG KAEARPGRKM
GHVTRLKPLD