Gene YpAngola_A0339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0339 
SymbolnagE 
ID5798803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp353355 
End bp355388 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content49% 
IMG OID641338347 
ProductPTS system N-acetylgalactosamine-specific transporter subunit IIABC 
Protein accessionYP_001604947 
Protein GI162419497 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific
[COG1264] Phosphotransferase system IIB components
[COG2190] Phosphotransferase system IIA components 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR00830] PTS system, glucose subfamily, IIA component
[TIGR01998] PTS system, N-acetylglucosamine-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.210629 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATTC TTGGTTACTT ACAAAAAGTA GGTCGAGCGT TGATGGTCCC TGTGGCCACA 
CTGCCTGCGG CCGCGATATT GATGGGTGTA GGCTACTGGA TAGACCCAGT CGGTTGGGGG
GCTGATAATG CACTGGCGGC ATTATTCATC AAATCTGGTG CTGCAATAAT CGAAAACATG
TCGGTACTGT TCGCAATTGG TGTTGCTTAC GGTATGTCAA AAGATAAAGA CGGTGCTGCG
GCACTGACCG GTTTTGTCGG TTTCCTGGTA TTAACCACGC TGTGTTCGCC AGCAGCGGTT
TCGATGATCA AACAGATTCC ACTTGATCAA GTCCCTGCCG CATTTGGCAA AATCGAAAAC
CAGTTCGTGG GTATTTTGGT CGGTATTATC TCTGCTGAGC TTTATAACCG TTTCAGCGGC
GTTGAACTGC CAAAAGCGCT CTCTTTCTTC AGTGGTCGTC GTTTGGTTCC AATCTTGACG
TCCTTCCTGA TGATCGCAGT GGCCTTTATG CTGATGTACA TATGGCCACT GATTTATAAC
GCATTAGTGA CCTTCGGTGA ATACATCAAA GATCTGGGTT CTGTTGGTGC GGGTATCTAT
GCTTTCTTCA ACCGCTTACT GATCCCTGTT GGTCTGCATC ACGCTCTGAA CTCGGTGTTC
TGGTTTGACG TCGCTGGGAT TAACGATATT CCTAACTTCC TCGGCGGCCA AGAATCGATC
AATAAAGGCA CCGGCATTGT CGGTATCACT GGCCGTTATC AGGCAGGTTT CTTCCCGATT
ATGATGTTTG GTTTACCGGG TGCTGCGTTG GCAATTTACC ACTGCGCACG TCCAGAAAAT
AAAGCAAAAG TGGCGGGTAT CATGATGGCG GGGGCATTTG CAGCCTTCTT CACAGGTATC
ACTGAGCCTC TTGAATTCTC CTTCATGTTT GTGGCACCTG TGCTGTACTT CTTGCACGCC
GTGTTGACCG GTATTTCCGT ATTCATCGCC GCCAGCATGC ATTGGATCGC GGGCTTTGGT
TTCAGTGCCG GTTTAGTGGA TATGGTGCTC TCTTCCCGTA ACCCGTTGGC TACGCAATGG
TACATGCTGA TCCCACAAGG TCTGATATTC TTTGTGATTT ACTACTTAGT ATTCCGTTTC
ACTATCCAGA AATTCAACTT ATTGACACCT GGCCGTGAGC TGGCGGTGGA AGGCAGTGAA
GAAGATGGTT ACGACGTAAA TGTTGATAAA ACGCCAGCAG TGAATGAAAG TGAAATCAAT
AGCCTTGCTC GTCGCTATAT TGGTGCCATC GGTGGTTCAG ACAACCTGAC GGCCATTGAT
GCCTGCATTA CCCGCCTACG CCTGAATGTT AAAGATTCAG CCTTGGTTAA TGACAGTGTA
GCCAAACGCT TAGGGGCCTC TGGTGTTATT CGTCTGAATA AGCAAAGTGT ACAAATTATC
GTTGGTACTC GCGCAGAACT GATTGCCGCA GCAATGCGCA CGGTGCTGGC GGGTGGCCCA
ATCCCCGCGG CAAGCAGCAA TGCTGCGCCT ACGGGTGCAA GACCGCAGGC GGTCATTAAC
ACCGCGAAAA CGGCTTCTTT AGTTCTGGTT TCGCCAATTA CCGGTGATGT TGTGCCATTG
GCGCAGGTTC CTGATGAGGC TTTTGCTAGC AAAGCGGTAG GTGAGGGGGT TGCTATCCGT
CCTACAGACA AAATAGTTGT TTCTCCCGCT AGCGGCACTA TTGTGAAAAT CTTCAATACG
GATCATGCGT TCTGCCTGGA AACGGAAACC GGTGCTGAGA TTGTGGTTCA TATCGGGATT
GATACCGTGA AGCTTAACGG CCAGGGCTTT ACCCGTTTAG TTGAAGAGGG GACGACAGTG
GTTGCTGGTC AGCCGGTGTT AGAACTGGAT CTGGCTTATC TGAATGCCAA CGCGCATTCA
ATGATCAGCC CAGTTGTCGT CAGTAATATT GATGACTACG CGGGGATCTC GCTGTTGGCG
AGCGGTTCAG TCGTTGCCGG TCAAAGCCAA TTATTTGAGA TTCGTGGCAA ATAA
 
Protein sequence
MSILGYLQKV GRALMVPVAT LPAAAILMGV GYWIDPVGWG ADNALAALFI KSGAAIIENM 
SVLFAIGVAY GMSKDKDGAA ALTGFVGFLV LTTLCSPAAV SMIKQIPLDQ VPAAFGKIEN
QFVGILVGII SAELYNRFSG VELPKALSFF SGRRLVPILT SFLMIAVAFM LMYIWPLIYN
ALVTFGEYIK DLGSVGAGIY AFFNRLLIPV GLHHALNSVF WFDVAGINDI PNFLGGQESI
NKGTGIVGIT GRYQAGFFPI MMFGLPGAAL AIYHCARPEN KAKVAGIMMA GAFAAFFTGI
TEPLEFSFMF VAPVLYFLHA VLTGISVFIA ASMHWIAGFG FSAGLVDMVL SSRNPLATQW
YMLIPQGLIF FVIYYLVFRF TIQKFNLLTP GRELAVEGSE EDGYDVNVDK TPAVNESEIN
SLARRYIGAI GGSDNLTAID ACITRLRLNV KDSALVNDSV AKRLGASGVI RLNKQSVQII
VGTRAELIAA AMRTVLAGGP IPAASSNAAP TGARPQAVIN TAKTASLVLV SPITGDVVPL
AQVPDEAFAS KAVGEGVAIR PTDKIVVSPA SGTIVKIFNT DHAFCLETET GAEIVVHIGI
DTVKLNGQGF TRLVEEGTTV VAGQPVLELD LAYLNANAHS MISPVVVSNI DDYAGISLLA
SGSVVAGQSQ LFEIRGK