Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A1043 |
Symbol | |
ID | 5799506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | + |
Start bp | 1068355 |
End bp | 1069554 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641339031 |
Product | type IV pilin biogenesis protein |
Protein accession | YP_001605603 |
Protein GI | 162421450 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1459] Type II secretory pathway, component PulF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.0343376 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGCCGCC ATCGTTTATT CAATTGGACA GCTCTCAACA AAACAGGGGA GCTACAGACG GGCATGCTAC TGGCAACTGA GAGAAACAGT GTCTATGAAC ATATCATCCA ACATGGCTTA CAGCCCCTAG GCGTGAAAGG GGGAAGGCGG TTATCTGCAC GCTACTGGCA AGGGGAACGG TTGGTGGCAA TGACCCGCCA ATTAGCGACT TTATTGCAAG CGGGTTTACC GTTGGTCAAC AGTTTACAAT TATTGGCAAA AGAGGCAGAT GACTCAGCAT GGCGTTGCCT ATTAGATGAG ATAAGTCAGC AAGTTGCACA AGGCCAGTCA CTCTCGGAAG TGATGGAACA GTATCCACAC GTATTCCCCC GACTGTATCC TCCAGTGGTT GCCGTCGGGG AGCTTACGGG TAATCTTGAG CAATGTTGTA CTCAATTAGT ACACCATCAG GAACGGCAGC AAAATTTACA CAAAAAAGTC ATAAAAGCGC TGAAATACCC CGTTGTGGTC TGCATCGTCG CATTGGTAGT CAGTGTCATT ATGTTAGTCA TGGTGTTACC CGAATTTGCA CAAATATATC AATCGTTTGA TACCCCCCTG CCGGGGCTAA CTGCAAGCTT ACTGTGGCTA TCAACTTTTC TCACTTTTTA TGGCCCCTAT CTGGCGCTGA TAATAGCAAT AGTGTGTATT GGGTATTTCT ATACATTACG AAAAAAATCT CGCTGGCAGC AATGGGAACA GACCATTCTA TTAAGCATTC CCTTAGTCTC AACATTAATC CGTGGTAGCT GCCTCAGCCA AATTTTTCAA ACGTTAGCTA TCACACAACA AGCCGGGCTA CCCCTGTCAG CCGGGTTAGA TGCAGCGGCT CGATCCATTC ACAACTACAA TTATCAGCAA GCCTTAAGGT GTATTCAAAA ACAAATTAGC CAAGGTATAC CGCTGTATAC CACTCTTAAT CAGCACCCTT TATTTCCTGC CATTTGTCAG CAGCTCATTA GGGTTGGTGA AGAATCAGGC TCACAGGATG TGTTGCTGGA AAAGTTAGCC TGTTGGCATC AGCAACAAAC TCAGAATTTG GCAGATAACG TCACTCAAAT GCTAGAGCCA CTTCTTATGC TGATTATTGG CAGTATTGTC GGCGTACTGG TGATCGCCAT GTATTTACCC ATATTCCAGT TAGGTGATGT TATTGGATAA
|
Protein sequence | MSRHRLFNWT ALNKTGELQT GMLLATERNS VYEHIIQHGL QPLGVKGGRR LSARYWQGER LVAMTRQLAT LLQAGLPLVN SLQLLAKEAD DSAWRCLLDE ISQQVAQGQS LSEVMEQYPH VFPRLYPPVV AVGELTGNLE QCCTQLVHHQ ERQQNLHKKV IKALKYPVVV CIVALVVSVI MLVMVLPEFA QIYQSFDTPL PGLTASLLWL STFLTFYGPY LALIIAIVCI GYFYTLRKKS RWQQWEQTIL LSIPLVSTLI RGSCLSQIFQ TLAITQQAGL PLSAGLDAAA RSIHNYNYQQ ALRCIQKQIS QGIPLYTTLN QHPLFPAICQ QLIRVGEESG SQDVLLEKLA CWHQQQTQNL ADNVTQMLEP LLMLIIGSIV GVLVIAMYLP IFQLGDVIG
|
| |