Gene YpAngola_A2082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2082 
Symbol 
ID5800552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2157186 
End bp2158454 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content52% 
IMG OID641339996 
Productcarbohydrate ABC transporter periplasmic-binding protein 
Protein accessionYP_001606542 
Protein GI162419038 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.890991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0137332 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATAA AAAAAATAGG TATCGCAGGT ATTATCGGCA CGTTGCTGAT GGCGGGTAAC 
GCCAGCGCAC AGGAAACCCT CCGTGTACTG CTCGAAGGGC ACAGCACCAG CGACTCGATA
AAAGCACTGT TACCCGAATT CGAAAAGCAG ACCGGTATTA AGGTTCAGGC AGAGATAGTA
CCTTACAGCG ATCTGACCTC TAAAGCCCTG CTGGCCTTCT CCTCGCACAG TGGACGTTAC
GACGTGGTTA TGGATGACTG GGTGCATGCG GTAGGTTACG CCTCTGCTGG TTATATCACA
CCTGTAGATC AGTGGATGGA GAGTGATACC GCCTTCTACG ATGGTGCGGA TTTCGTCAAA
AGCTATGCTG ATACGCTGCG TTATAAAGAC GGTTATTACG GGCTGCCAGT CTATGGTGAA
AGTACCTTCC TGATGTACCG CAAAGACCTG TTTGAACAGT ACGGTATCGC CGTGCCGAAA
ACCTTTGATG AGCTGACCGC TGCGGCAAAA ACCATCAAAG AGAAGACCGA AGGTAAGGTG
GCGGGTATTA CGCTCCGTGG AGCTCAGGGG ATCCAGAACA CCTTTGCATG GGCGTCATTC
CTCTGGGGTT ACGGCGGCCA GTGGATTGAC GACAACGGAA AATCTGCAAT TACTTCGCCA
CAGGCGGTAG AAGCCACCAA GTCATTCGTC AATATCCTGA AAAACTACGG GCCGATCGGC
GCGGCTAACT TCGGCTGGCA GGAAAACCGC TTGGTATTCC AGCAGGGCAA AGCGGCAATG
ACTATCGATT CGACAGTGAA CGGGGGCTTC AACGAAGACC CGAAAGAGTC CACGGTCGTC
GGTAAAGTGG GCTATGCCCC GGTACCGGTA CAGCCAGGCG ATCATCCGGG TAACAGCGGC
GCACTTCAGG TGCATGGCTT GTATATCTCC AGCGACAGTA AGAAGCAGGA TGCTGCCTGG
AAATTTATCA GTTGGGCAAC GGACAAACAG ACGCAGATGA AGTCGGTCGA ACTGAATCCT
AACGCCGGTG TGAGTTCACT CAGTGCCATC AACAGTGATG CCTTCACCAA GCGTTACGGG
GCCTTTAAGG ATGGTATGCT CGCAGCATTG CAAAACGGCA ATGCGAAATA CCTCCCAACC
ATTCCGCAGT CTACACAGAT TATCAACATA ACCGGTATTG CTCTATCCGA GGCACTGGCA
GGTACTCAGA CAGTAGAAAA TGCCCTTCAG CAAGCCAACA CCCGTAATGA TAAAGCGTTG
TCCCGTTAA
 
Protein sequence
MSIKKIGIAG IIGTLLMAGN ASAQETLRVL LEGHSTSDSI KALLPEFEKQ TGIKVQAEIV 
PYSDLTSKAL LAFSSHSGRY DVVMDDWVHA VGYASAGYIT PVDQWMESDT AFYDGADFVK
SYADTLRYKD GYYGLPVYGE STFLMYRKDL FEQYGIAVPK TFDELTAAAK TIKEKTEGKV
AGITLRGAQG IQNTFAWASF LWGYGGQWID DNGKSAITSP QAVEATKSFV NILKNYGPIG
AANFGWQENR LVFQQGKAAM TIDSTVNGGF NEDPKESTVV GKVGYAPVPV QPGDHPGNSG
ALQVHGLYIS SDSKKQDAAW KFISWATDKQ TQMKSVELNP NAGVSSLSAI NSDAFTKRYG
AFKDGMLAAL QNGNAKYLPT IPQSTQIINI TGIALSEALA GTQTVENALQ QANTRNDKAL
SR