Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A2374 |
Symbol | |
ID | 5800844 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | - |
Start bp | 2491614 |
End bp | 2494817 |
Gene Length | 3204 bp |
Protein Length | 1067 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641340256 |
Product | fibronectin type III domain-containing protein |
Protein accession | YP_001606800 |
Protein GI | 162421384 |
COG category | [S] Function unknown |
COG ID | [COG4733] Phage-related protein, tail component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.213672 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.639425 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACGTA AACCAATTAA AGGCCGCAAA GGTGGGGGCA GCAATGCCAC AACGCCAGTT GAGTCACCGG ACAGTATTCA ATCGACGGCA AGAGCTAAAA TACTCATTGC TTTGGGTGAG GGGGAGTTCG CCGGAGGTTT GGATGGAACC AATATCTATC TGGACGGCAC ACCTATAAAG AACTCTGACG GTACTAGTAA TTTCACTGGG GTTACTTGGG AGTATCGTCC CGGCACGCAG GCTCAGGACT ACATTCAAGG AATGCCAAAT GTCGAGAATG AGATAACGGT TAACACAGAG CTTAAATCAG ATACGCCATG GGTGCGCTCC ATCACAAATA CCCAACTCTC GGCTACACGT GTTCGTCTTG GATGGCCCTC ATTACAGCGT CAGGCGGACA ATGGTGATGT TGGCGGTTAT CGCATTGAGT ACGCCGTCGA TGTGGCAACG GATGGTGGCG CATATCAAAC ACTGCTTGAT ACAGCCATTG ATGGGAAAAC AACAACTTTA TATGAACGCT CGCACAGAAT AAACCTACCC AAGGCCACAG CTGGTTGGCA GGTTCGTACA AGGCGAAAAA CAGCCAATGC CAACTCTGGC CGCATTGCCG ACAAGATGAA TGTCGAAGCT ATTTCTGAAG TCATCGATGC CAAGTTACGT TACCCAAATA CCGCGCTTCT CTATATAGAA TTCGACGCAA CTCAATTTCA GAATATCCCT ATTATCTCAT GTGAGCCTAA AGGCCGGATT ATCCGCGTAC CTACTACATA TGATCCAGTA ACGCGTACCT ACTCTGGTGT GTGGGATGGT TCATTTAAAT GGGCTCATAC CAACAACCCA GCCTGGGTAT TCTACAACAT TGTATTAGCA GATCGCTTTG GCCTTGGTCA TCGGATTGAG GTCAGCCAGG TAGATAAGTG GGAGCTGTAC CGAATTGGTC AATACTGCGA TCAGCTTATT CCTGATGGTC GGGGCGGTAG TGGTACTGAG CCTCGTTTTA CCTGCGATGT GTATATTCAG TCTCAGGCCG AGGCATTTAC TGTATTGCGT GATTTGGCCG CCATTTTTCG GGGCATGACC TATTGGGGAA ATAATCAGCT TTGCACCCTG GCAGATATGC CACGAGATGT GGACTATATA TTTACCCGTG CCAGTGTGAT TGACGGACGA TTCACTTACG GTGGTGGTTC CGAGAAAAAG CGCTATACAA CCGCAATGGT GAGCTGGAGT GACCCCGCAA ATAACTGTCA GGATGCAATC GAGGCAGTGT CAGATAACGA CTTGGTTCGT CGCTACGGTG TCAATCAGCT TGATATGACG GCTATCGGCT GTATCCGGCA AACTGAGGCG AATAGGCGTG GACGTTGGGC GCTACTGACA AACAGTAAAG ACCGGACTGT TAATTTTAAT GTAGGGTTAG ACGGGGCCAT TCCGTTGCCC GGTCATATCA TTGGTGTTGC GGATGATATG CTCTCTGGTC GGAAGATGGG CGGTCGCATT AGCTCAGTAT CGGGCCGGAA TATCACTCTT GACCGTGTTG CTGATGTGAA AGCAGGTGAC CGGCTACTTG TTAACTTACC AAACGGTGTA GCTCAGGGCA GAACGGTGCA AGTGGTCAAC GGGAAAGTAA TCACTGTCAC AACGGCTTAC AGTGAAGTGC CAGCAGCGGA AAGCGGTTGG TCTGTTGATG CGGATGATTT AGCTATCCAG CAATATCGGG TTACTGGTAT TTCTGACAAT GACGACAATA CATACAGTAT CTCATCTGTT CAGCATGATC CGGACAAATA TGAGCGAATT GATACGGGCG CTCGGATTGA TGAAAGACCC ATCAGCGTAA TCCCGCCCGG CGTCCAGCCA CCTCCGACAA ATGTTGTTAT TGATAGCTTC TCAGCACTTT CACAAGGGCT CGCAATAACC ACCCTACGTG TTACGTGGGA ACCAGCAGCC AGCGCGATAG CATACGAGGC TGAATGGCGA CGTGATAACG GAAACTGGAT ATCAGCACCG CGCACATCTG CTCAGGGATT TCAGGTTGAA GGTATTTATG CTGGACAATA TCAGGCTCGC GTTCGTGCTA TTAACCCCTC AGAAATATCC AGTATTTGGG CTAATGCTCA GGAAACCACA TTAAACGGTA AAGAGGGAAA TCCTCCAATG CCAGTTGGAT TTACAGCTAC AGGCATTCTC TTTGGCATCA CTCTCAATTG GGGATACCCT GAAGGAGCCG AAGATGCGTT AAAAACAGAG ATTGAATATA GCCTGTCTGC TGACGGCACC GATGCCATGC TGTTGAGTGA TGTGCCGCAT CCGCAACGGA ACTACACTAT GCAGGGGTTG AGAGCAGGGC AGGTGTTCTG GTTCCGTGCT CGGATAGTTG ATAAATCCGG TAATCAGTCG CCATGGATTG ATTGGGTTCG TGGCATGTCC AGCACAGACA CAAGCGCTAT TCTCGAAGCG ATTGGCGACG ACTTTATCAA TAACACAGTT GCGGGTCAGC AACTGATTAA TGATGACTTC ATGAATGCAG AGGGCATTCT CGAAACAGCG AAGGCCAATA ACGCCAGCAT CTGGCAGCAA TGGGCTCAAC ACGGAGAGAA TAAAGCCGGT GTTATCCACT TAACGACCAC TGTTGCCGAT GCTGAAAGAG CATTTGCTGA GTTTGAAACC CTTGTTACAG CAACATTTGA AGACCAGACA GCAGCGATAG ACCAAAAAAT GACAGCAGTT GTTGATGCCA ACGGGGCTAG TGCTACTTAT AGTTTAAGGG CCGGACTGAA TTATAACGGC CAGTTTGTCA GCGCAGGCAT GGTAATTGGT GCAGAGTTTA TTAATGGTGT AGCTAAATCC TCAATTGGTT TTACTGCCGA TCAATTTATA TTGCTCTCCG GTCCAACTGG TAATTTATTT TCGCCTTTTG CAGTGGTAAA TGGTCAAGTG TTTATGAATG ATGCATTTAT TGCAAAGGCA TCAATTGGGC GAGGAAAAAT AACAGATACC CTTGACTCAG ATAATTACGT GCAAGGAATA TCCGGTCTAA AACTGGATTT TAAAAATGGT AATGCTGAAT TTAACAATGT AAATCTCAGG GGGAATATAA CTATGGATAA CACGATTAAT GGTATTCGCA CCATAGTAGA TTATCGTGGG CAGAGGACAT ATCACGCAAA TGGTCAGCCA GCGATAATAT GCGGGTACTT CTAA
|
Protein sequence | MARKPIKGRK GGGSNATTPV ESPDSIQSTA RAKILIALGE GEFAGGLDGT NIYLDGTPIK NSDGTSNFTG VTWEYRPGTQ AQDYIQGMPN VENEITVNTE LKSDTPWVRS ITNTQLSATR VRLGWPSLQR QADNGDVGGY RIEYAVDVAT DGGAYQTLLD TAIDGKTTTL YERSHRINLP KATAGWQVRT RRKTANANSG RIADKMNVEA ISEVIDAKLR YPNTALLYIE FDATQFQNIP IISCEPKGRI IRVPTTYDPV TRTYSGVWDG SFKWAHTNNP AWVFYNIVLA DRFGLGHRIE VSQVDKWELY RIGQYCDQLI PDGRGGSGTE PRFTCDVYIQ SQAEAFTVLR DLAAIFRGMT YWGNNQLCTL ADMPRDVDYI FTRASVIDGR FTYGGGSEKK RYTTAMVSWS DPANNCQDAI EAVSDNDLVR RYGVNQLDMT AIGCIRQTEA NRRGRWALLT NSKDRTVNFN VGLDGAIPLP GHIIGVADDM LSGRKMGGRI SSVSGRNITL DRVADVKAGD RLLVNLPNGV AQGRTVQVVN GKVITVTTAY SEVPAAESGW SVDADDLAIQ QYRVTGISDN DDNTYSISSV QHDPDKYERI DTGARIDERP ISVIPPGVQP PPTNVVIDSF SALSQGLAIT TLRVTWEPAA SAIAYEAEWR RDNGNWISAP RTSAQGFQVE GIYAGQYQAR VRAINPSEIS SIWANAQETT LNGKEGNPPM PVGFTATGIL FGITLNWGYP EGAEDALKTE IEYSLSADGT DAMLLSDVPH PQRNYTMQGL RAGQVFWFRA RIVDKSGNQS PWIDWVRGMS STDTSAILEA IGDDFINNTV AGQQLINDDF MNAEGILETA KANNASIWQQ WAQHGENKAG VIHLTTTVAD AERAFAEFET LVTATFEDQT AAIDQKMTAV VDANGASATY SLRAGLNYNG QFVSAGMVIG AEFINGVAKS SIGFTADQFI LLSGPTGNLF SPFAVVNGQV FMNDAFIAKA SIGRGKITDT LDSDNYVQGI SGLKLDFKNG NAEFNNVNLR GNITMDNTIN GIRTIVDYRG QRTYHANGQP AIICGYF
|
| |