Gene YpAngola_A1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1044 
SymbolguaC 
ID5799507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1069981 
End bp1071024 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content47% 
IMG OID641339032 
Productguanosine 5'-monophosphate oxidoreductase 
Protein accessionYP_001605604 
Protein GI162421031 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0516] IMP dehydrogenase/GMP reductase 
TIGRFAM ID[TIGR01305] guanosine monophosphate reductase, eukaryotic 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.0547285 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATTG AAGAAGGTTT GAAATTAGGC TTTAAGGATG TGTTAATCCG TCCTAAACGT 
TCAACGCTGA AAAGCCGTTC TGAAGTTGCT CTGGAACGTC AGTTTACTTT CAAACATTCA
GGTTGGAATT GGTCTGGTGT CCCTATCATA GCCGCTAATA TGGATACCGT TGGTACCTTC
CGCATGGCTG AGGTTTTGGC TTCATTTGAT ATTCTTACTG CCGTTCACAA GCATTACACT
CTCGAACAAT GGGCTGAGTT CGTTAAGCGC TCACCAGAAT CAGTGTTACG CCATGTCATG
GTATCAACGG GCACTTCCTC TGCTGATTTC GACAAAATGA AACAAATTTT GGCGTTATCA
CCATCATTAA AATTTATCTG TATTGATGTC GCGAACGGCT ACTCAGAACA CTTTGTTTCT
TTCCTGCAAA GAGCGCGTGA AGCTTGTCCT GATAAAGTCA TTTGTGCGGG TAATGTCGTG
ACAGGTGAAA TGGTAGAGGA ACTGATCCTC TCTGGTGCTG ATATCGTTAA AGTCGGTATT
GGCCCTGGTT CTGTTTGTAC CACCCGTGTT AAGACTGGCG TTGGCTACCC ACAACTGTCT
GCTGTCATTG AGTGCGCCGA CGCTGCTCAT GGCCTTGGGG GCCAAATTGT CAGCGATGGT
GGCTGTTCTG TTCCAGGTGA TGTGGCTAAA GCTTTTGGTG GTGGTGCCGA TTTCGTGATG
CTAGGTGGCA TGTTGGCAGG CCATGATGAG TGTGAAGGGC GCGTTGTCGA AGAGAATGGC
GAGAAGTTCA TGCTGTTTTA CGGGATGAGT TCTGAATCTG CGATGAAACG CCATGTCGGT
GGTGTTGCAC AATACCGTGC GGCAGAAGGT AAAACGGTTA AGTTACCACT GCGTGGTTCA
GTCGATAATA CCGTGCGTGA CATCATGGGA GGCCTACGTT CTGCATGTAC TTATGTGGGC
GCTTCACATT TGAAAGAATT AACGAAGCGT ACGACGTTTA TTCGCGTAGC AGAGCAAGAA
AACCGCGTAT TTGGCACTGA TTGA
 
Protein sequence
MRIEEGLKLG FKDVLIRPKR STLKSRSEVA LERQFTFKHS GWNWSGVPII AANMDTVGTF 
RMAEVLASFD ILTAVHKHYT LEQWAEFVKR SPESVLRHVM VSTGTSSADF DKMKQILALS
PSLKFICIDV ANGYSEHFVS FLQRAREACP DKVICAGNVV TGEMVEELIL SGADIVKVGI
GPGSVCTTRV KTGVGYPQLS AVIECADAAH GLGGQIVSDG GCSVPGDVAK AFGGGADFVM
LGGMLAGHDE CEGRVVEENG EKFMLFYGMS SESAMKRHVG GVAQYRAAEG KTVKLPLRGS
VDNTVRDIMG GLRSACTYVG ASHLKELTKR TTFIRVAEQE NRVFGTD