Gene YpAngola_A3243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3243 
Symboltas 
ID5801719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3440094 
End bp3441134 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content50% 
IMG OID641341071 
Productputative aldo-keto reductase 
Protein accessionYP_001607594 
Protein GI162420318 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000690689 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0202789 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATATC ATCGTATCCC CCACAGTTCA TTGGAAGTAA GCCTGCTGGG TCTGGGCACC 
ATGACGTTTG GTGAGCAAAA CAGTGAAGCC GATGCCCACG CTCAACTGGA TTATGCCGTT
GCAGCCGGTA TTAACTTGAT TGATACCGCA GAAATGTACC CGGTGCCTCC AAGGCCAGAA
ACTCAGGGAT TAACTGAGCA ATATATTGGT CGCTGGATAA AAGCACGCGG TTGCCGCGAA
AAAATTATTT TAGCCAGTAA AGTCTCCGGG CCATCACGCG GTGATGATCA GCCCATTCGC
CCGAATATGG CATTGGATCG GAAGAATATC CGCATCGCGC TGGAAGAGAG CCTTAAGCGC
CTTAATACCG ATTATCTTGA TATTTATCAG TTACATTGGC CTCAGCGGGA AACAAACTGT
TTCGGTAAGC TGAATTATCG CTATAGCGAG CAAACTGCCG TTGTGACCTT GCTGGAAACA
CTGGAAGCCC TGAACGAGCA AGTGCGGGCC GGTAAAATTC GTTATATCGG GGTATCCAAT
GAAACACCAT GGGGTGTCAT GCGTTATCTG CAACTGGCAG AAAAGCATGA TCTACCGCGT
ATCGTCTCTA TTCAGAACCC TTACAGCCTG TTAAACCGTA GCTTTGAAGT GGGTCTGGCA
GAGATTAGCC AGCACGAAGG CGTTGAGTTA TTAGCTTATT CCAGCCTGGC TTTTGGCACA
CTGAGCGGCA AATACCTTAA TGGCGCGAAA CCTGCCGGTG CACGCAACAC CTTGTTCAGC
CGTTTCACCC GTTACTCTGG GCCACAAACC CAATTAGCGG TGGCTGAATA TGTGTCGCTG
GCAAAACACC ATGGGCTGGA TCCGGCGCAG ATGGCTCTGG CCTTTGTGCG GCAACAGCCG
TTTGTTGCCA GTACGCTACT CGGCGCAACG TCGCTGGAAC AACTGAAAAG TAATATTGAT
AGCCAAAATA TCGTGCTGAG TCAGGAAGTA CTGGATGCAC TGGAAGCGAT CCATACCCGC
TATACCTTCC CCGCACCTTA A
 
Protein sequence
MQYHRIPHSS LEVSLLGLGT MTFGEQNSEA DAHAQLDYAV AAGINLIDTA EMYPVPPRPE 
TQGLTEQYIG RWIKARGCRE KIILASKVSG PSRGDDQPIR PNMALDRKNI RIALEESLKR
LNTDYLDIYQ LHWPQRETNC FGKLNYRYSE QTAVVTLLET LEALNEQVRA GKIRYIGVSN
ETPWGVMRYL QLAEKHDLPR IVSIQNPYSL LNRSFEVGLA EISQHEGVEL LAYSSLAFGT
LSGKYLNGAK PAGARNTLFS RFTRYSGPQT QLAVAEYVSL AKHHGLDPAQ MALAFVRQQP
FVASTLLGAT SLEQLKSNID SQNIVLSQEV LDALEAIHTR YTFPAP