Gene YpAngola_A3017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3017 
SymbolmglB 
ID5801489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3187072 
End bp3188064 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content44% 
IMG OID641340855 
Productgalactose-binding protein 
Protein accessionYP_001607385 
Protein GI162418532 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.013396 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGA AGGTTTTCAC ATTAGCAGCT TTGGTTACCA GCATGATGGT TGGCGCATAC 
GCTCAAGCTG AAACCCGTAT TGGCGTTACT ATTTATAAAT ATGATGACAA CTTTATGTCA
GTGGTCCGCA AAGCTATCGA AAAAGACGCG AAAGCTTCCC CTGAGATCAC TCTGCTGATG
AATGATTCCC AGAATGACCA ATCCAAGCAA AATGATCAGA TTGACGTATT GCTGGCTAAG
GGCGTGAAAG CTTTGGCAAT TAACCTGGTT GATCCCGCTG CGGCCCCAGT TGTAATTGAT
AAAGCACGTT CAAATGATAT TCCGATTGTA TTTTATAACA AAGAACCTTC TCGCAAGGCA
TTGGATAGCT ACGATAAAGC TTATTACGTC GGGACTGACT CGAAAGAATC TGGGGTTATT
CAGGGGGAGC TGATCGCTAA ACATTGGCAA GCTAATCCAG AGTGGGATCT GAACAAAGAT
GGTAAAATTC AGTTTGTGTT GCTGAAAGGT GAACCGGGTC ATCCAGATGC AGAGGCGCGT
ACTACCTATG TTATTAAGAC CCTGAATGAA AAAGGCTTGC CAACCCAACA ATTGCAGTTA
GACACCGCCA TGTGGGATAC CGCACAGGCT AAAGATAAGA TGGATGCATG GCTGTCTGGT
CCTAATGCAA ACAAAATTGA AGTAGTTATT GCCAACAATG ATGCGATGGC AATGGGTGCA
GTAGAAGCAC TGAAAGCACA CAATAAAACC AGCGTTCCAG TCTTTGGTGT CGATGCCTTA
CCAGAAGCGT TAGCGCTGGT TAAATCAGGC CAAATGGCGG GTACAGTGCT GAATGATGCC
AATAATCAGG CGAAAGCGAC CTTTGACTTG GCTAAAAATC TGGCGGCTGG CAAACCTGCA
GCAGAAGGGA CAACGTGGAA AATTGAAAAC AAAATCGTAC GTATTCCATA CGTAGGTGTT
GATAAAGATA ATCTGGCTGA ATTCACTAAA TAA
 
Protein sequence
MNKKVFTLAA LVTSMMVGAY AQAETRIGVT IYKYDDNFMS VVRKAIEKDA KASPEITLLM 
NDSQNDQSKQ NDQIDVLLAK GVKALAINLV DPAAAPVVID KARSNDIPIV FYNKEPSRKA
LDSYDKAYYV GTDSKESGVI QGELIAKHWQ ANPEWDLNKD GKIQFVLLKG EPGHPDAEAR
TTYVIKTLNE KGLPTQQLQL DTAMWDTAQA KDKMDAWLSG PNANKIEVVI ANNDAMAMGA
VEALKAHNKT SVPVFGVDAL PEALALVKSG QMAGTVLNDA NNQAKATFDL AKNLAAGKPA
AEGTTWKIEN KIVRIPYVGV DKDNLAEFTK