Gene YpAngola_A1070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1070 
Symbol 
ID5799533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1094883 
End bp1098170 
Gene Length3288 bp 
Protein Length1095 aa 
Translation table11 
GC content53% 
IMG OID641339055 
Producthypothetical protein 
Protein accessionYP_001605627 
Protein GI162421844 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.243584 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGGCA ACATGTTGAA AAACATGCTG GCCGGCGACA TCATGCTCAA CCTGTTTGAC 
ACCCCAAGGC TGGATATGGC GATTGCCAAT ATCTTCCTGC GCCAGTTGGA CGATAGCGGT
ATTGTGCGGG TTACCCCACT GTTGTTTCAC AATGACCAAC TCGTCACCTA TAAGAACGCG
CAGGATGAAA TCATCTGGCA GACCACGGCT CCAGACTTCA CTGCCTTTGT GACTCTCTCC
TTTAGCCATG AGCAGGAAGA GACTTACTAC TACAGTGTGC GGGTTGAAAA TCACAGCGCG
CAGGCGCTAC GCTATGATCT GATCTACGGT CAGGATCTCT CACTGTCCGA TGCCGGTGCG
ACCAAGACCA ATGAATCCTA CTGCAGTCAG TATCTGGATC ATAAGGTCTT CTCGCTGGAT
AAGTATGGCT ATACCGTATC CTCTCGGCAA AACCTACCAC AGAGCACCGG TAATCCCCTG
CTACAACTCG GTAGCTTCTC ACCCGCAGTA GGTTTCTCTA CCGATGGTTA CCAGTTCTTT
GCCAAGCAGT ACAAGTTTAG CCACTTGCCC ACCATCGTCA CTGAACCATC GCTGGAGAAC
CGGAACTACC AGTATGAAAT GGCCTATGTC GCTCTACAGC TACAGCCAGT CACGCTGTCC
GCGGGTGACA GCGCAGACAG CGTATTTTAC GGTTTCTATC TCAGCCACCA GCCAGAGGCC
AATATTGCAC AGGCCTTTGA TGTTGCGCGG ATCCGGGCTA ACTATCGCCA GCCACAGCGA
GAACCCGCTA CCGAGCAGGC ATCCCCAACT CAGCAGGGCT ATGATCAGCG GCCACTGTCG
GGAGACAAGC TAACGGCAGA AGAGATCGAA CAACTGTTTG ACGGTGAAAA ACAGTTTGTT
GAGCAGTTGG ACGGTGAACT GCTATCGTTT TTCTATCAGG AGGCTAACTA CGTCACGTTG
GCAGAAAAAG AGCGCCATCT GGAGCGACCG ACCGGCCATA TTATTTCCTC TGGCAATAAT
ATCGACTTCA CTAATCCCAT CATGAATTCT ACCCACTATA TCTATGGGGT ATTCAACTCG
CATCTGACGC TGGGTAACAC CTCCTTTAAT AAGCTGTTGG GCGTCAATCG TAATATGCTC
AATCAGTTCA AGAGCAGTGG TCAGCGCATT CTGGTACAGA TCGGCGGCGA ATATCGCATT
CTGGCGATGC CCTCTGCATA TGAAGTGGGT GCCAACTTCT CTCGCTGGAT ATACAAGCTG
GCCGAGGGCA TGATTCAGGT CCGCGCCTTC GCCAGCCAGA GTGAGCCGGT TATTCAATTG
GATATTGCTG TCTCAGGCCA TAGCCAACCG CTCAACGTTA TTGTCAGCCA CCAACTGATC
ATGGGCAATC TGGAGGAGGA AGCGACCGTA CAGGTTGAAC GCCATGGCGA TCTGCTGCAG
ATCAGCCGCA CGGGCGATGA TCATGGCCCC CGCTTTAGCA TCGCCACTCG CGGCGGGTTC
ACGGGTGTCG AAAGATACGT CGATAGCGAC AGCCAAGGGG TACAGTACCT GTTGCTACAA
GGACAGATTG CCCAGCAGGC CAGCATCGCC TTCGGTGGCG TGCTAAACGG CGTAGATAGC
CGCGGAAAGT GGCTGGATTT TGAACACGAG CGGCAGGCTT ATCACGCACA GTATCGTGCA
CTGCTAAACG ACTTCTCGGT GAGTTTTTCT GCTGCACCGC AGCAAGCGCA GAAGCTGAAT
CACGCCATGC ACTGGTTCAC CCATAATGCG CTCACCCACT ACAGCTCACC GCATGGTTTA
GAGCAGCCAG CCGGTGCCGC TTGGGGGACC CGCGACGTTT CACAGGGGCC GATAGAGTTC
TTCATGGCCA TGGGGCGCTA TCAGCAGGTT GAAGCGATTC TGTGCCAGAC TTATCGCCAT
CAGTATCTGG AAACCGGCAC CTGGCCGCAG TGGTTTATGT TTGATGAATA TGCTCAGGTA
CAACAGCAGG AATCGCACGG CGATATTGTG GTCTGGCCGC TGAAAGCGTT GGCCGATTAT
CTGTTAGCCA CGGATCGCGT CGCGTTATTG GACACGCGTC TGTCCTACAC CAGCATCAAA
CAGAATTTCG CCTTTACCGG CGAGCAGGAG ACGCTGCTGC AACATGTTCA GCGGCAGATA
GACCATATCG TGGCGCATCT GGTACCCGGT ACCTATCTTT CCAGCTACGG TGACGGTGAC
TGGGATGATA CGTTGCAGCC CGCCAATCAG TCGCTGCGGG AAAATATGGT CAGTGGCTGG
ACTATTCCGC TGACGCTGCA AACGCTGAAA ACCTTGACCA AGGCGCTACA GGCTTATCCG
CAGTTTGCCG ATTTTATCGC GCGTATCGTG ACGCTAACCA GCAACATGGA AGCGGATTAC
CATAAGTATT TGATCAAGGA CGGCGTGATC AGTGGCTTTA TTCACTTTAA TCAGGGGGAG
GCGGAATACC TACTGCATCC TACCGATACC ACGACCCAGA TCAAGTACCG CCTGTTGCCC
GCCAAGCGCT CAATTATCTC CGAGTCGTTC GATAAAGAGA TGGCCGAGCA GCACATGAAG
ATCATCATGG ATAACCTAAT GTATCCTGAT GGCGTGCGCC TGATGGACCG GATGGCGGAG
TACAAGGCCG GTAAGCAGAC TTACTTCAAA CGGGCCGAAC TGTCCGCTAA TCTGGGGCGT
GAAATCGGCC TACAATACTG CCACGCCCAT ATTCGCTTTA TAGAAGCACT CTGCAAAATG
GGGATGGCGC AGGAGCTGTA CGATAACCTG TTTAAAACCA TCCCTGTGGG GATCCAGGAG
AGCGTGCCTA ACGCCGAGCT GCGCCAGGCA AACAGCTACT TCTCCAGTTC CGATGCCAAG
TTTGATGATC GCTACCAGGC TTATAACAAC TTCGATCAAT TGAAAACCGG TGCCGTAGCC
GCGAAAGCGG GCTGGCGTAT TTACTCCAGC GGCCCTGGGA TCTATATCAA CCAGATCGTT
TCCAACGTAC TTGGTGTGCG TTATCAGGCC GGCGATCTGT TGCTGGATCC GGTGATCAGT
CGGCAGTTTG GTGATGTGAC GCTAAACTAT CAACTCTATA ACCTTCCGGT CACGCTGCGC
ATCTATCCAC AACAGGGGGA GTTTACCCCG AAGCGTGTGC TACTCGATGG TCAGTCGCTG
GCGTTTACGT TGCAGGATAA TCCCTATCGT AGTGGGGCCG CACTGATTCA CCGCCAGGAG
ATAGAAGGGC GTCTGACGGC ACACAGTCAG CTAGAGATTT ACCTGTAG
 
Protein sequence
MQGNMLKNML AGDIMLNLFD TPRLDMAIAN IFLRQLDDSG IVRVTPLLFH NDQLVTYKNA 
QDEIIWQTTA PDFTAFVTLS FSHEQEETYY YSVRVENHSA QALRYDLIYG QDLSLSDAGA
TKTNESYCSQ YLDHKVFSLD KYGYTVSSRQ NLPQSTGNPL LQLGSFSPAV GFSTDGYQFF
AKQYKFSHLP TIVTEPSLEN RNYQYEMAYV ALQLQPVTLS AGDSADSVFY GFYLSHQPEA
NIAQAFDVAR IRANYRQPQR EPATEQASPT QQGYDQRPLS GDKLTAEEIE QLFDGEKQFV
EQLDGELLSF FYQEANYVTL AEKERHLERP TGHIISSGNN IDFTNPIMNS THYIYGVFNS
HLTLGNTSFN KLLGVNRNML NQFKSSGQRI LVQIGGEYRI LAMPSAYEVG ANFSRWIYKL
AEGMIQVRAF ASQSEPVIQL DIAVSGHSQP LNVIVSHQLI MGNLEEEATV QVERHGDLLQ
ISRTGDDHGP RFSIATRGGF TGVERYVDSD SQGVQYLLLQ GQIAQQASIA FGGVLNGVDS
RGKWLDFEHE RQAYHAQYRA LLNDFSVSFS AAPQQAQKLN HAMHWFTHNA LTHYSSPHGL
EQPAGAAWGT RDVSQGPIEF FMAMGRYQQV EAILCQTYRH QYLETGTWPQ WFMFDEYAQV
QQQESHGDIV VWPLKALADY LLATDRVALL DTRLSYTSIK QNFAFTGEQE TLLQHVQRQI
DHIVAHLVPG TYLSSYGDGD WDDTLQPANQ SLRENMVSGW TIPLTLQTLK TLTKALQAYP
QFADFIARIV TLTSNMEADY HKYLIKDGVI SGFIHFNQGE AEYLLHPTDT TTQIKYRLLP
AKRSIISESF DKEMAEQHMK IIMDNLMYPD GVRLMDRMAE YKAGKQTYFK RAELSANLGR
EIGLQYCHAH IRFIEALCKM GMAQELYDNL FKTIPVGIQE SVPNAELRQA NSYFSSSDAK
FDDRYQAYNN FDQLKTGAVA AKAGWRIYSS GPGIYINQIV SNVLGVRYQA GDLLLDPVIS
RQFGDVTLNY QLYNLPVTLR IYPQQGEFTP KRVLLDGQSL AFTLQDNPYR SGAALIHRQE
IEGRLTAHSQ LEIYL