Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A1070 |
Symbol | |
ID | 5799533 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | - |
Start bp | 1094883 |
End bp | 1098170 |
Gene Length | 3288 bp |
Protein Length | 1095 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641339055 |
Product | hypothetical protein |
Protein accession | YP_001605627 |
Protein GI | 162421844 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3459] Cellobiose phosphorylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.243584 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGGCA ACATGTTGAA AAACATGCTG GCCGGCGACA TCATGCTCAA CCTGTTTGAC ACCCCAAGGC TGGATATGGC GATTGCCAAT ATCTTCCTGC GCCAGTTGGA CGATAGCGGT ATTGTGCGGG TTACCCCACT GTTGTTTCAC AATGACCAAC TCGTCACCTA TAAGAACGCG CAGGATGAAA TCATCTGGCA GACCACGGCT CCAGACTTCA CTGCCTTTGT GACTCTCTCC TTTAGCCATG AGCAGGAAGA GACTTACTAC TACAGTGTGC GGGTTGAAAA TCACAGCGCG CAGGCGCTAC GCTATGATCT GATCTACGGT CAGGATCTCT CACTGTCCGA TGCCGGTGCG ACCAAGACCA ATGAATCCTA CTGCAGTCAG TATCTGGATC ATAAGGTCTT CTCGCTGGAT AAGTATGGCT ATACCGTATC CTCTCGGCAA AACCTACCAC AGAGCACCGG TAATCCCCTG CTACAACTCG GTAGCTTCTC ACCCGCAGTA GGTTTCTCTA CCGATGGTTA CCAGTTCTTT GCCAAGCAGT ACAAGTTTAG CCACTTGCCC ACCATCGTCA CTGAACCATC GCTGGAGAAC CGGAACTACC AGTATGAAAT GGCCTATGTC GCTCTACAGC TACAGCCAGT CACGCTGTCC GCGGGTGACA GCGCAGACAG CGTATTTTAC GGTTTCTATC TCAGCCACCA GCCAGAGGCC AATATTGCAC AGGCCTTTGA TGTTGCGCGG ATCCGGGCTA ACTATCGCCA GCCACAGCGA GAACCCGCTA CCGAGCAGGC ATCCCCAACT CAGCAGGGCT ATGATCAGCG GCCACTGTCG GGAGACAAGC TAACGGCAGA AGAGATCGAA CAACTGTTTG ACGGTGAAAA ACAGTTTGTT GAGCAGTTGG ACGGTGAACT GCTATCGTTT TTCTATCAGG AGGCTAACTA CGTCACGTTG GCAGAAAAAG AGCGCCATCT GGAGCGACCG ACCGGCCATA TTATTTCCTC TGGCAATAAT ATCGACTTCA CTAATCCCAT CATGAATTCT ACCCACTATA TCTATGGGGT ATTCAACTCG CATCTGACGC TGGGTAACAC CTCCTTTAAT AAGCTGTTGG GCGTCAATCG TAATATGCTC AATCAGTTCA AGAGCAGTGG TCAGCGCATT CTGGTACAGA TCGGCGGCGA ATATCGCATT CTGGCGATGC CCTCTGCATA TGAAGTGGGT GCCAACTTCT CTCGCTGGAT ATACAAGCTG GCCGAGGGCA TGATTCAGGT CCGCGCCTTC GCCAGCCAGA GTGAGCCGGT TATTCAATTG GATATTGCTG TCTCAGGCCA TAGCCAACCG CTCAACGTTA TTGTCAGCCA CCAACTGATC ATGGGCAATC TGGAGGAGGA AGCGACCGTA CAGGTTGAAC GCCATGGCGA TCTGCTGCAG ATCAGCCGCA CGGGCGATGA TCATGGCCCC CGCTTTAGCA TCGCCACTCG CGGCGGGTTC ACGGGTGTCG AAAGATACGT CGATAGCGAC AGCCAAGGGG TACAGTACCT GTTGCTACAA GGACAGATTG CCCAGCAGGC CAGCATCGCC TTCGGTGGCG TGCTAAACGG CGTAGATAGC CGCGGAAAGT GGCTGGATTT TGAACACGAG CGGCAGGCTT ATCACGCACA GTATCGTGCA CTGCTAAACG ACTTCTCGGT GAGTTTTTCT GCTGCACCGC AGCAAGCGCA GAAGCTGAAT CACGCCATGC ACTGGTTCAC CCATAATGCG CTCACCCACT ACAGCTCACC GCATGGTTTA GAGCAGCCAG CCGGTGCCGC TTGGGGGACC CGCGACGTTT CACAGGGGCC GATAGAGTTC TTCATGGCCA TGGGGCGCTA TCAGCAGGTT GAAGCGATTC TGTGCCAGAC TTATCGCCAT CAGTATCTGG AAACCGGCAC CTGGCCGCAG TGGTTTATGT TTGATGAATA TGCTCAGGTA CAACAGCAGG AATCGCACGG CGATATTGTG GTCTGGCCGC TGAAAGCGTT GGCCGATTAT CTGTTAGCCA CGGATCGCGT CGCGTTATTG GACACGCGTC TGTCCTACAC CAGCATCAAA CAGAATTTCG CCTTTACCGG CGAGCAGGAG ACGCTGCTGC AACATGTTCA GCGGCAGATA GACCATATCG TGGCGCATCT GGTACCCGGT ACCTATCTTT CCAGCTACGG TGACGGTGAC TGGGATGATA CGTTGCAGCC CGCCAATCAG TCGCTGCGGG AAAATATGGT CAGTGGCTGG ACTATTCCGC TGACGCTGCA AACGCTGAAA ACCTTGACCA AGGCGCTACA GGCTTATCCG CAGTTTGCCG ATTTTATCGC GCGTATCGTG ACGCTAACCA GCAACATGGA AGCGGATTAC CATAAGTATT TGATCAAGGA CGGCGTGATC AGTGGCTTTA TTCACTTTAA TCAGGGGGAG GCGGAATACC TACTGCATCC TACCGATACC ACGACCCAGA TCAAGTACCG CCTGTTGCCC GCCAAGCGCT CAATTATCTC CGAGTCGTTC GATAAAGAGA TGGCCGAGCA GCACATGAAG ATCATCATGG ATAACCTAAT GTATCCTGAT GGCGTGCGCC TGATGGACCG GATGGCGGAG TACAAGGCCG GTAAGCAGAC TTACTTCAAA CGGGCCGAAC TGTCCGCTAA TCTGGGGCGT GAAATCGGCC TACAATACTG CCACGCCCAT ATTCGCTTTA TAGAAGCACT CTGCAAAATG GGGATGGCGC AGGAGCTGTA CGATAACCTG TTTAAAACCA TCCCTGTGGG GATCCAGGAG AGCGTGCCTA ACGCCGAGCT GCGCCAGGCA AACAGCTACT TCTCCAGTTC CGATGCCAAG TTTGATGATC GCTACCAGGC TTATAACAAC TTCGATCAAT TGAAAACCGG TGCCGTAGCC GCGAAAGCGG GCTGGCGTAT TTACTCCAGC GGCCCTGGGA TCTATATCAA CCAGATCGTT TCCAACGTAC TTGGTGTGCG TTATCAGGCC GGCGATCTGT TGCTGGATCC GGTGATCAGT CGGCAGTTTG GTGATGTGAC GCTAAACTAT CAACTCTATA ACCTTCCGGT CACGCTGCGC ATCTATCCAC AACAGGGGGA GTTTACCCCG AAGCGTGTGC TACTCGATGG TCAGTCGCTG GCGTTTACGT TGCAGGATAA TCCCTATCGT AGTGGGGCCG CACTGATTCA CCGCCAGGAG ATAGAAGGGC GTCTGACGGC ACACAGTCAG CTAGAGATTT ACCTGTAG
|
Protein sequence | MQGNMLKNML AGDIMLNLFD TPRLDMAIAN IFLRQLDDSG IVRVTPLLFH NDQLVTYKNA QDEIIWQTTA PDFTAFVTLS FSHEQEETYY YSVRVENHSA QALRYDLIYG QDLSLSDAGA TKTNESYCSQ YLDHKVFSLD KYGYTVSSRQ NLPQSTGNPL LQLGSFSPAV GFSTDGYQFF AKQYKFSHLP TIVTEPSLEN RNYQYEMAYV ALQLQPVTLS AGDSADSVFY GFYLSHQPEA NIAQAFDVAR IRANYRQPQR EPATEQASPT QQGYDQRPLS GDKLTAEEIE QLFDGEKQFV EQLDGELLSF FYQEANYVTL AEKERHLERP TGHIISSGNN IDFTNPIMNS THYIYGVFNS HLTLGNTSFN KLLGVNRNML NQFKSSGQRI LVQIGGEYRI LAMPSAYEVG ANFSRWIYKL AEGMIQVRAF ASQSEPVIQL DIAVSGHSQP LNVIVSHQLI MGNLEEEATV QVERHGDLLQ ISRTGDDHGP RFSIATRGGF TGVERYVDSD SQGVQYLLLQ GQIAQQASIA FGGVLNGVDS RGKWLDFEHE RQAYHAQYRA LLNDFSVSFS AAPQQAQKLN HAMHWFTHNA LTHYSSPHGL EQPAGAAWGT RDVSQGPIEF FMAMGRYQQV EAILCQTYRH QYLETGTWPQ WFMFDEYAQV QQQESHGDIV VWPLKALADY LLATDRVALL DTRLSYTSIK QNFAFTGEQE TLLQHVQRQI DHIVAHLVPG TYLSSYGDGD WDDTLQPANQ SLRENMVSGW TIPLTLQTLK TLTKALQAYP QFADFIARIV TLTSNMEADY HKYLIKDGVI SGFIHFNQGE AEYLLHPTDT TTQIKYRLLP AKRSIISESF DKEMAEQHMK IIMDNLMYPD GVRLMDRMAE YKAGKQTYFK RAELSANLGR EIGLQYCHAH IRFIEALCKM GMAQELYDNL FKTIPVGIQE SVPNAELRQA NSYFSSSDAK FDDRYQAYNN FDQLKTGAVA AKAGWRIYSS GPGIYINQIV SNVLGVRYQA GDLLLDPVIS RQFGDVTLNY QLYNLPVTLR IYPQQGEFTP KRVLLDGQSL AFTLQDNPYR SGAALIHRQE IEGRLTAHSQ LEIYL
|
| |