Gene GM21_1817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1817 
Symbol 
ID8137148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2117103 
End bp2118149 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content61% 
IMG OID644869428 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_003021628 
Protein GI253700439 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value0.00980836 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAAAGAGA CTAAGATTAC CTATAAAGAC GCAGGTGTAG ACATAGATGC CGGCAACACT 
TTCGTGCAGA TGATCAAGCC GCTGGTCAAG GCGACTTCGC GTCCGGAAGT GCTGGCGGAC
ATCGGCGGTT TCGGCGGGTT GTTCTCCCTC AACATGGGCA AGTACAAGCA CCCGGTGCTT
GTCTCCGGCA CAGACGGGGT CGGGACCAAA CTGAAGCTCG CCTTCCTCGC CGACCGCCAT
GACACCATCG GCATCGACCT CGTCGCCATG TGCGTGAACG ACATCATCGT GCAGGGAGCC
GAGCCGCTCT TCTTCCTCGA TTATCTTGCC ACCGCGAAGC TCGACCCGGT TAAGGCCGCC
TCCATCATCA AAGGGGTGTC CGAGGGGTGC GTGCAGGCTG GGTGCGCCCT GATCGGCGGC
GAAACCGCCG AGATGCCCGG CTTCTACACC GGCGACGAGT ACGACATGGC CGGTTTTGCC
GTGGGGGTCG TCGAGCGCGA GAAAATCATC GACGGCTCCT CCATCACCGT CGGCAACCGC
CTGATCGGGT TGGCCTCCTC CGGGCTGCAC AGCAACGGCT ACTCCCTGGC CAGGAAGGTC
ATCCTCGAGC ACATGGGGCT CGGCATCGAC GACGAACTCC CCGGCCTCGG TAAAACCGTC
GCCGAAGAGC TCCTCACCCC GACCCGCATC TACGTGCGCA GCGTGATGAA CCTTTTGCGC
GACTTCAACA TCTCGGGCCT GGCCCACATC ACCGGCGGGG GTCTGCTGGA GAACATCCCC
CGCGTGCTTC CCAACGGCTG CAAGGCCGTC ATCAAGAAGG AGAGCTGGGA GGTCCCCGAG
ATATTCCGGA TCATGCAGAA GGCCGGCAAC ATCGAGGAAA ACGAGATGTT CAGGACCTTC
AACTGCGGCA TCGGCATGGT GCTGGTCGTT CCCGAGAAAG AGGCCGAGGA GATCATGATC
AGGCTCTCCG GGCTCAACGA GACCGCTTTC GTGATCGGCG AAGTGGCCAA GTGCGACGCC
GGCAAGGAGT GCGTGGAACT CGTTTAG
 
Protein sequence
MKETKITYKD AGVDIDAGNT FVQMIKPLVK ATSRPEVLAD IGGFGGLFSL NMGKYKHPVL 
VSGTDGVGTK LKLAFLADRH DTIGIDLVAM CVNDIIVQGA EPLFFLDYLA TAKLDPVKAA
SIIKGVSEGC VQAGCALIGG ETAEMPGFYT GDEYDMAGFA VGVVEREKII DGSSITVGNR
LIGLASSGLH SNGYSLARKV ILEHMGLGID DELPGLGKTV AEELLTPTRI YVRSVMNLLR
DFNISGLAHI TGGGLLENIP RVLPNGCKAV IKKESWEVPE IFRIMQKAGN IEENEMFRTF
NCGIGMVLVV PEKEAEEIMI RLSGLNETAF VIGEVAKCDA GKECVELV