Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2236 |
Symbol | |
ID | 4270268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2536972 |
End bp | 2538120 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638126992 |
Product | phosphoribosylaminoimidazole carboxylase |
Protein accession | YP_743068 |
Protein GI | 114321385 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.434308 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAAGA TACTGTTGCC GGGGGCCACC CTGGGGGTGC TGGGCAACGG CCAGTTGGGC CGGATGTTCG CCCTGGCCGC GCGTCGCATG GGCTATCGGG TGGCGTGCTT CGGCCCGGGC CGGGACAGTC CCGCCGGACA GGTCTGCGAC ATCGAGGTGA CCGCCGACTA CCGCGATGAG CAGGCCCTGC GTGACTTTGC CCGCCGGGTG GATGGGGTCA CCTTTGAGTT CGAGAATGTG CCGGCCGAGG CCGGTGATCT GCTGGCCGGG TACGTCCCGG TCCGCCCCCA CCATACGGTG CTGCACGTGG CCCAGAACCG TTGGCGGGAA AAGACTTGGC TGAGCGAGCA GGGCTTTCCG GTGGGGGCCT TTGCCACCGT GGAGCGGGAG GAGGAACTGG CTGCCGCCCT CGAACGGGTG GGCACGCCCG CGGTGCTGAA GACCGCCGGC TTTGGCTATG ACGGCAAAGG CCAAGCGTTG ATTCGGGAGC CTTCCGAGGC CGCCTGGGCG TGGGCGGCGA TCGGGGGGCA GGCGGCGGTG CTGGAGGCCT TCGTGGATTT CCACATGGAG GTCTCGATGG TGGCCGCCCG CGGCTTGGAC GGCAGCTTCA CCCACTACGG CGTGCTGGAG AACCGCCACC GTGATCACAT CCTCGATCTC ACCCTGCCGG AGGCCCCGCT GGGGCCGCAG CTTCGTGAGC AGGCGGAGGA TGTCACCCGC GGCATCCTCG AGGCGCTGGA TGTGGTCGGG GTGCTCTGCG TGGAGTTTTT TGTCGCCAGT GACGGACGGC TGCTGGTCAA TGAGCTGGCC CCGCGCCCCC ATAACTCCGG CCACCTGACC TTCGATGCGG CGATGACCAG CCAGTTTGAG CAGCAGGTGC GCGCCCTTTG CGGGCTGCCC TTGGGGGACA GCCGATTGCT GCGGCCGGCG GCCATGGTGA ACTTGCTGGG GGATCTCTGG GGCGATGGCG AGCCGGATTG GGCCGCGGCG CTCAAGGACC CGGAGGTCAA GCTCCACCTC TACGGCAAGG CCGAGGCCAA GCCGGGGCGG AAGATGGGCC ACGTCACTGC CTTCGGTGAG GACCGCGATG ACGCGGCGCG GCGGGCCCTG AGCGCCCGTG AGCGGCTGCA AGCGGGCGCC GGAGGCTGA
|
Protein sequence | MSKILLPGAT LGVLGNGQLG RMFALAARRM GYRVACFGPG RDSPAGQVCD IEVTADYRDE QALRDFARRV DGVTFEFENV PAEAGDLLAG YVPVRPHHTV LHVAQNRWRE KTWLSEQGFP VGAFATVERE EELAAALERV GTPAVLKTAG FGYDGKGQAL IREPSEAAWA WAAIGGQAAV LEAFVDFHME VSMVAARGLD GSFTHYGVLE NRHRDHILDL TLPEAPLGPQ LREQAEDVTR GILEALDVVG VLCVEFFVAS DGRLLVNELA PRPHNSGHLT FDAAMTSQFE QQVRALCGLP LGDSRLLRPA AMVNLLGDLW GDGEPDWAAA LKDPEVKLHL YGKAEAKPGR KMGHVTAFGE DRDDAARRAL SARERLQAGA GG
|
| |