Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_2288 |
Symbol | |
ID | 7091415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 2478377 |
End bp | 2479477 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643465612 |
Product | phosphoribosylaminoimidazole carboxylase, ATPase subunit |
Protein accession | YP_002362582 |
Protein GI | 217978435 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.299764 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGTTTA CGCTGTCCCC CGGCGCGCTC GCGCCCGGTT CAAGGATCGG CATTCTCGGC GGCGGCCAGC TGGGCCGCAT GCTGGCGATG GCGGCCGCGC GGCTCGGCCT TCACGCGCAT ATTTATGCGC CGGAGCCGGA CAGCACGGCT TTTGAGGTCT GCCGCGAGCG CACCCTTGCC GCCTATGAGG ATGAGGAAGC GCTCGCCCGC TTTGCGGAAA GCGTCGACGC CGTCACTTAC GAATTCGAAA ATGTTCCGGC CCGGACGGCG GCGCTGCTCG CGGGCCTGCG CCCGGTCCGG CCAAGCGCCG CGGCGCTTGC GGTCTGTCAG GATCGCCTGA TCGAGAAAGA ATTTCTCGCC GACATTGGCG TCGCCACCGT CAATTTCATG CAGGTCGATC ATGCCGGCGC GATGGCGCGG GCGGTGGCGC AGCTCGGCCG GCCCTCGATC CTGAAGACGC GGCGGTTCGG CTACGACGGC AAGGGCCAGG TGCTGGTCCG CGAAGGCGCG GACCTCGCCG TGACCTTCCG CTCGCTTGGC GGCGGTCCGG CCATTCTCGA AGCTGTGGCG CCCTTCACAA AAGAGATTTC CGTCGTCGCG GCGCGCGGCG CGAGCGGCGA ATTCGCCGCC TTCGACGTCT GCGAGAACAC GCATGAGAAC CACATTCTGA AATTCACCAC GGCGCCCGCA AGGATCGCCG TGCAGACGGC GGCCGAGGCC GTCCTGCTGA CGCGGGCGAT CGCCGAGGCG CTCGATTATG TCGGCGTCCT CGCCGTCGAG ATGTTCGTTA TCGAGCCCGC GGGCGGCGGC GAGCAACTGC TCGTCAATGA AATCGCGCCG CGCGTGCATA ATTCGGGTCA TTGGACCCTC GATGGCGCGG CGACCTCGCA ATTTGAGCAG CATATCCGCG CCATCGCCGG CTGGCCGCTC GGCGCGACGT TCCTCAATGG GAGCGAAGTC GAGATGGAAA ATCTGATCGG CGAGGATATT TATGCGTTCG AGGCGATTTT GCGCGAGCCC GGCGCCTGTC TCCACCTCTA TGGCAAGGCC GAAGCTCGGG CGGGCCGCAA GATGGGGCAT GTGACCCGCA TCCGGCGCTA G
|
Protein sequence | MAFTLSPGAL APGSRIGILG GGQLGRMLAM AAARLGLHAH IYAPEPDSTA FEVCRERTLA AYEDEEALAR FAESVDAVTY EFENVPARTA ALLAGLRPVR PSAAALAVCQ DRLIEKEFLA DIGVATVNFM QVDHAGAMAR AVAQLGRPSI LKTRRFGYDG KGQVLVREGA DLAVTFRSLG GGPAILEAVA PFTKEISVVA ARGASGEFAA FDVCENTHEN HILKFTTAPA RIAVQTAAEA VLLTRAIAEA LDYVGVLAVE MFVIEPAGGG EQLLVNEIAP RVHNSGHWTL DGAATSQFEQ HIRAIAGWPL GATFLNGSEV EMENLIGEDI YAFEAILREP GACLHLYGKA EARAGRKMGH VTRIRR
|
| |