Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A1733 |
Symbol | |
ID | 5800204 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | - |
Start bp | 1789561 |
End bp | 1790616 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641339671 |
Product | hypothetical protein |
Protein accession | YP_001606226 |
Protein GI | 162421001 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2152] Predicted glycosylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.625558 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.000641846 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATGAGGT ATTCCGGCAA TCCGTTAATC ACACCCCAGG ATATTACGCC AAGCCATCCG GGATTTAAGG TTGAATGCGC CTTTAATGCG GGCATTACCA AACTGGGCGA TGAGACGATT ATGTTGATTC GTATCGCGGA GAGCGTTATC CCTGAAAGTG AGGGTATTAC CGTCGTGCCT TTGCTTATGG AAATTTCCGG GCACTGGCAG GTGACCACGC GGGTATTTAA GCACGATGAC CCCGCCTACG ATTTTAGTGA TCCTCGGCTG ATCAGTGATA TTGCCGATCC ATCGACCGTG TACCTGACAT CTCTCTCCCA TCTTCGTGTG GCAAGAAGCC ATGATGGCGT TACTTTCGTC GTTGACGATC AACCCTTTAT CTTCCCGGCC AACGGTGATG AAGCTTTTGG TTGTGAGGAT GCTCGCATTA CTCAGATCGA CGGCGCTTAC TATATTAACT ATTCCGCAGT GTCAGCAAAG GGCATTTGCA CCGCGCTGGC GGTTACCCAG GATTTCACCG ACGTTAAGCG TTTGGGGCTG ATTTTCTGCC CTGATAATCG CGATGCGTGT CTGTTCCCGG AAAAAATCAA CGGTAAATAT ATGACGTTGC ACCGCCCGGC ACCTAAACAC TTCGGTAAAC CCGAGATTTG GCTGGCCAGC TCTCTGGATC TGCTGCATTG GGGAGATCAT CGGCATTTAC TGGGGACATC GCAAGACCCT TGGGATGCCT TGAAACTGGG GGGCGGGGCG CAAATGCTGA AAACAGAAAA AGGGTGGCTG CAAATTTACC ATGGGGTAGA TGCGACCCAA CGCTATTCTC TCGGTGCGCT GCTGCTAGAC CTGGAACAGC CTACTAAAAT CCTTGCTAAA TCGCCGGTAC CGTTGCTTCA ACCGCAGGCT CCTTATGAGC TGCACGGCTT TTTTGGTAAC GTCGTCTTTA CCTGCGGCGC ACTCATTGAG AACGATAGCC TACGGGTCTA TTACGGTGCG GCGGATGAAT GTATGTGCCT GGCTGAAATG CCGCTGGCTC AACTGTGGCA GCACCTGTCG GTGTAA
|
Protein sequence | MMRYSGNPLI TPQDITPSHP GFKVECAFNA GITKLGDETI MLIRIAESVI PESEGITVVP LLMEISGHWQ VTTRVFKHDD PAYDFSDPRL ISDIADPSTV YLTSLSHLRV ARSHDGVTFV VDDQPFIFPA NGDEAFGCED ARITQIDGAY YINYSAVSAK GICTALAVTQ DFTDVKRLGL IFCPDNRDAC LFPEKINGKY MTLHRPAPKH FGKPEIWLAS SLDLLHWGDH RHLLGTSQDP WDALKLGGGA QMLKTEKGWL QIYHGVDATQ RYSLGALLLD LEQPTKILAK SPVPLLQPQA PYELHGFFGN VVFTCGALIE NDSLRVYYGA ADECMCLAEM PLAQLWQHLS V
|
| |