Gene Mlg_2236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2236 
Symbol 
ID4270268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2536972 
End bp2538120 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content69% 
IMG OID638126992 
Productphosphoribosylaminoimidazole carboxylase 
Protein accessionYP_743068 
Protein GI114321385 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.434308 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGA TACTGTTGCC GGGGGCCACC CTGGGGGTGC TGGGCAACGG CCAGTTGGGC 
CGGATGTTCG CCCTGGCCGC GCGTCGCATG GGCTATCGGG TGGCGTGCTT CGGCCCGGGC
CGGGACAGTC CCGCCGGACA GGTCTGCGAC ATCGAGGTGA CCGCCGACTA CCGCGATGAG
CAGGCCCTGC GTGACTTTGC CCGCCGGGTG GATGGGGTCA CCTTTGAGTT CGAGAATGTG
CCGGCCGAGG CCGGTGATCT GCTGGCCGGG TACGTCCCGG TCCGCCCCCA CCATACGGTG
CTGCACGTGG CCCAGAACCG TTGGCGGGAA AAGACTTGGC TGAGCGAGCA GGGCTTTCCG
GTGGGGGCCT TTGCCACCGT GGAGCGGGAG GAGGAACTGG CTGCCGCCCT CGAACGGGTG
GGCACGCCCG CGGTGCTGAA GACCGCCGGC TTTGGCTATG ACGGCAAAGG CCAAGCGTTG
ATTCGGGAGC CTTCCGAGGC CGCCTGGGCG TGGGCGGCGA TCGGGGGGCA GGCGGCGGTG
CTGGAGGCCT TCGTGGATTT CCACATGGAG GTCTCGATGG TGGCCGCCCG CGGCTTGGAC
GGCAGCTTCA CCCACTACGG CGTGCTGGAG AACCGCCACC GTGATCACAT CCTCGATCTC
ACCCTGCCGG AGGCCCCGCT GGGGCCGCAG CTTCGTGAGC AGGCGGAGGA TGTCACCCGC
GGCATCCTCG AGGCGCTGGA TGTGGTCGGG GTGCTCTGCG TGGAGTTTTT TGTCGCCAGT
GACGGACGGC TGCTGGTCAA TGAGCTGGCC CCGCGCCCCC ATAACTCCGG CCACCTGACC
TTCGATGCGG CGATGACCAG CCAGTTTGAG CAGCAGGTGC GCGCCCTTTG CGGGCTGCCC
TTGGGGGACA GCCGATTGCT GCGGCCGGCG GCCATGGTGA ACTTGCTGGG GGATCTCTGG
GGCGATGGCG AGCCGGATTG GGCCGCGGCG CTCAAGGACC CGGAGGTCAA GCTCCACCTC
TACGGCAAGG CCGAGGCCAA GCCGGGGCGG AAGATGGGCC ACGTCACTGC CTTCGGTGAG
GACCGCGATG ACGCGGCGCG GCGGGCCCTG AGCGCCCGTG AGCGGCTGCA AGCGGGCGCC
GGAGGCTGA
 
Protein sequence
MSKILLPGAT LGVLGNGQLG RMFALAARRM GYRVACFGPG RDSPAGQVCD IEVTADYRDE 
QALRDFARRV DGVTFEFENV PAEAGDLLAG YVPVRPHHTV LHVAQNRWRE KTWLSEQGFP
VGAFATVERE EELAAALERV GTPAVLKTAG FGYDGKGQAL IREPSEAAWA WAAIGGQAAV
LEAFVDFHME VSMVAARGLD GSFTHYGVLE NRHRDHILDL TLPEAPLGPQ LREQAEDVTR
GILEALDVVG VLCVEFFVAS DGRLLVNELA PRPHNSGHLT FDAAMTSQFE QQVRALCGLP
LGDSRLLRPA AMVNLLGDLW GDGEPDWAAA LKDPEVKLHL YGKAEAKPGR KMGHVTAFGE
DRDDAARRAL SARERLQAGA GG