Gene Msil_2288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2288 
Symbol 
ID7091415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2478377 
End bp2479477 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content67% 
IMG OID643465612 
Productphosphoribosylaminoimidazole carboxylase, ATPase subunit 
Protein accessionYP_002362582 
Protein GI217978435 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.299764 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTTTA CGCTGTCCCC CGGCGCGCTC GCGCCCGGTT CAAGGATCGG CATTCTCGGC 
GGCGGCCAGC TGGGCCGCAT GCTGGCGATG GCGGCCGCGC GGCTCGGCCT TCACGCGCAT
ATTTATGCGC CGGAGCCGGA CAGCACGGCT TTTGAGGTCT GCCGCGAGCG CACCCTTGCC
GCCTATGAGG ATGAGGAAGC GCTCGCCCGC TTTGCGGAAA GCGTCGACGC CGTCACTTAC
GAATTCGAAA ATGTTCCGGC CCGGACGGCG GCGCTGCTCG CGGGCCTGCG CCCGGTCCGG
CCAAGCGCCG CGGCGCTTGC GGTCTGTCAG GATCGCCTGA TCGAGAAAGA ATTTCTCGCC
GACATTGGCG TCGCCACCGT CAATTTCATG CAGGTCGATC ATGCCGGCGC GATGGCGCGG
GCGGTGGCGC AGCTCGGCCG GCCCTCGATC CTGAAGACGC GGCGGTTCGG CTACGACGGC
AAGGGCCAGG TGCTGGTCCG CGAAGGCGCG GACCTCGCCG TGACCTTCCG CTCGCTTGGC
GGCGGTCCGG CCATTCTCGA AGCTGTGGCG CCCTTCACAA AAGAGATTTC CGTCGTCGCG
GCGCGCGGCG CGAGCGGCGA ATTCGCCGCC TTCGACGTCT GCGAGAACAC GCATGAGAAC
CACATTCTGA AATTCACCAC GGCGCCCGCA AGGATCGCCG TGCAGACGGC GGCCGAGGCC
GTCCTGCTGA CGCGGGCGAT CGCCGAGGCG CTCGATTATG TCGGCGTCCT CGCCGTCGAG
ATGTTCGTTA TCGAGCCCGC GGGCGGCGGC GAGCAACTGC TCGTCAATGA AATCGCGCCG
CGCGTGCATA ATTCGGGTCA TTGGACCCTC GATGGCGCGG CGACCTCGCA ATTTGAGCAG
CATATCCGCG CCATCGCCGG CTGGCCGCTC GGCGCGACGT TCCTCAATGG GAGCGAAGTC
GAGATGGAAA ATCTGATCGG CGAGGATATT TATGCGTTCG AGGCGATTTT GCGCGAGCCC
GGCGCCTGTC TCCACCTCTA TGGCAAGGCC GAAGCTCGGG CGGGCCGCAA GATGGGGCAT
GTGACCCGCA TCCGGCGCTA G
 
Protein sequence
MAFTLSPGAL APGSRIGILG GGQLGRMLAM AAARLGLHAH IYAPEPDSTA FEVCRERTLA 
AYEDEEALAR FAESVDAVTY EFENVPARTA ALLAGLRPVR PSAAALAVCQ DRLIEKEFLA
DIGVATVNFM QVDHAGAMAR AVAQLGRPSI LKTRRFGYDG KGQVLVREGA DLAVTFRSLG
GGPAILEAVA PFTKEISVVA ARGASGEFAA FDVCENTHEN HILKFTTAPA RIAVQTAAEA
VLLTRAIAEA LDYVGVLAVE MFVIEPAGGG EQLLVNEIAP RVHNSGHWTL DGAATSQFEQ
HIRAIAGWPL GATFLNGSEV EMENLIGEDI YAFEAILREP GACLHLYGKA EARAGRKMGH
VTRIRR