Gene YpAngola_A2217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2217 
Symbol 
ID5800687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2314030 
End bp2314986 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content53% 
IMG OID641340120 
Productsugar-binding domain-containing protein 
Protein accessionYP_001606665 
Protein GI162421120 
COG category[K] Transcription 
COG ID[COG2390] Transcriptional regulator, contains sigma factor-related N-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0880863 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATAAGA GTGATAAGAA ACTGGATCAG GCGGCCAGAG CGGCCTGGAT GTATTATGTT 
GCTGGGCAAA CTCAGCATGA AATTGCCGAG GCGTTAGGCG TCTCTCGGCA GGTTGCACAG
CGGTTGGTGG CTTGCGCCAT TGATAATGGA TTGGTCAGTG TCAATATCAC GCACCCTGTG
GCTCGCTGTA TGGCGCTGGC CGAGCAATTA CAACAACGCT ATGGTTTACA CCGTTGTCAG
GTTGTCCCTA GCCAAGGCAT GGACAACGCG GGGGTTCAAC GTGCAATTGC CGTGGCGGGG
GCCGAGGTGA TGGCGCAGTT TCTACGACAA GAGCAACCCT TAGTTATTGG CGTCGGTTCA
GGGCGCAGCT TAAAGGCCGC CATCGATGAG TTGCCTGATG TTGAACGGCC GCAACATAGC
TGTGTATCAT TGATAGGGGC CATTGCCGCT GACGGCTCTT GCACCCGTTA CGACGTCCCG
TTGTGGATGG CTGAGAAAAC CCAAGGCCGC TATTTTATCC TGCCAGCCCC TTTATTTGCT
GACAGCGCCG CTGATCGTGA CCTGTGGTGT AACCATCGCA TTTATCGCAC CGTCACCGAA
AAAGCAGCGC AGGCCGATGT CACCTTTATC GGTATCGGTT CGATTGGCTA CCATTGCCCG
TTACACAAAG ATGGTTTTAT TTCGGCGCAA GATGTGGATG CCTTGCTCGC ATCGCATGTG
GTCGCGGAGA TGCTGGGTAA TTTTATTGAT GCTGATGGGC AACCCGTCCC GTACGCGTTG
GATCAATGCT TAACCAGCGT GAAATTACGT ATTCAGCCGG AAAAACCCGT GATCGCTATT
GCCGGCGGGA AAGAAAAGCA TCAGGCGATT AAAGCGGCTT TGAAGGGGCA GTGGCTGAAT
GGTTTAGTGA CTGATGAAGA GAGTGCCATG GTTCTGCTGG CGGAAGAAAA GGCATAA
 
Protein sequence
MHKSDKKLDQ AARAAWMYYV AGQTQHEIAE ALGVSRQVAQ RLVACAIDNG LVSVNITHPV 
ARCMALAEQL QQRYGLHRCQ VVPSQGMDNA GVQRAIAVAG AEVMAQFLRQ EQPLVIGVGS
GRSLKAAIDE LPDVERPQHS CVSLIGAIAA DGSCTRYDVP LWMAEKTQGR YFILPAPLFA
DSAADRDLWC NHRIYRTVTE KAAQADVTFI GIGSIGYHCP LHKDGFISAQ DVDALLASHV
VAEMLGNFID ADGQPVPYAL DQCLTSVKLR IQPEKPVIAI AGGKEKHQAI KAALKGQWLN
GLVTDEESAM VLLAEEKA