Gene Mmcs_3059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3059 
Symbol 
ID4111891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp3235346 
End bp3236116 
Gene Length771 bp 
Protein Length256 aa 
Translation table11 
GC content70% 
IMG OID638032189 
Productphosphoribosyl isomerase A 
Protein accessionYP_640222 
Protein GI108800025 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0106] Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase 
TIGRFAM ID[TIGR00007] phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase
[TIGR01919] 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase/N-(5'phosphoribosyl)anthranilate isomerase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCCGTCG TGCCAGAGAA GTCGGTGTCC GAGAAAAGAC CGTTGATCCT CCTCCCCGCC 
GTCGATGTCG TCGAAGGCCG TGCCGTGCGC CTGGTACAGG GCAAGGCCGG CAGTGAAACC
GAGTACGGGT CGGCGCTCGA CGCCGCGCTC GGCTGGCAGC GCGACGGGGC CGAGTGGATC
CATCTGGTGG ATCTCGACGC CGCGTTCGGG CGGGGGTCGA ACCGCGAACT GCTCGCCGAC
GTCGTGGGCC GCCTCGATGT GGCGGTCGAA CTGTCCGGCG GCATCCGTGA CGACGAGTCG
CTCGAGGCAG CGCTGGCCAC CGGATGCGCC CGGGTCAACA TCGGCACCGC CGCGCTGGAG
AACCCGCAGT GGTGCGCGAA AGTCGTCGCC GAGTTCGGCG ACAAGGTGGC AGTGGGCCTC
GACGTCAAGA TCGTCGACGA TCAGCACCGC CTGCGCGGAC GGGGTTGGGA GACCGACGGC
GGGGACCTGT GGGAGGTGCT CGACCGCCTC GACTCCGAAG GCTGCTCGCG CTACGTCGTC
ACCGACGTGA CCAAGGACGG CACCCTTCAG GGGCCGAACC TCGATCTGCT CGGCCGCGTC
GCCGACCGCA CCGATGCGCC GGTGATCGCC TCCGGCGGGG TGTCCAGCCT CGACGATCTG
CGCGCGATCG CCACGTTGAC CGACCGGGGC GTCGAGGGTG CGATCGTCGG CAAAGCGCTG
TACGCCGGGC GCTTCACGCT GCCCGAGGCG CTGGCAGCGA TGGGGCAGTA G
 
Protein sequence
MSVVPEKSVS EKRPLILLPA VDVVEGRAVR LVQGKAGSET EYGSALDAAL GWQRDGAEWI 
HLVDLDAAFG RGSNRELLAD VVGRLDVAVE LSGGIRDDES LEAALATGCA RVNIGTAALE
NPQWCAKVVA EFGDKVAVGL DVKIVDDQHR LRGRGWETDG GDLWEVLDRL DSEGCSRYVV
TDVTKDGTLQ GPNLDLLGRV ADRTDAPVIA SGGVSSLDDL RAIATLTDRG VEGAIVGKAL
YAGRFTLPEA LAAMGQ