Gene YpAngola_A2374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2374 
Symbol 
ID5800844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2491614 
End bp2494817 
Gene Length3204 bp 
Protein Length1067 aa 
Translation table11 
GC content48% 
IMG OID641340256 
Productfibronectin type III domain-containing protein 
Protein accessionYP_001606800 
Protein GI162421384 
COG category[S] Function unknown 
COG ID[COG4733] Phage-related protein, tail component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.213672 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.639425 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGTA AACCAATTAA AGGCCGCAAA GGTGGGGGCA GCAATGCCAC AACGCCAGTT 
GAGTCACCGG ACAGTATTCA ATCGACGGCA AGAGCTAAAA TACTCATTGC TTTGGGTGAG
GGGGAGTTCG CCGGAGGTTT GGATGGAACC AATATCTATC TGGACGGCAC ACCTATAAAG
AACTCTGACG GTACTAGTAA TTTCACTGGG GTTACTTGGG AGTATCGTCC CGGCACGCAG
GCTCAGGACT ACATTCAAGG AATGCCAAAT GTCGAGAATG AGATAACGGT TAACACAGAG
CTTAAATCAG ATACGCCATG GGTGCGCTCC ATCACAAATA CCCAACTCTC GGCTACACGT
GTTCGTCTTG GATGGCCCTC ATTACAGCGT CAGGCGGACA ATGGTGATGT TGGCGGTTAT
CGCATTGAGT ACGCCGTCGA TGTGGCAACG GATGGTGGCG CATATCAAAC ACTGCTTGAT
ACAGCCATTG ATGGGAAAAC AACAACTTTA TATGAACGCT CGCACAGAAT AAACCTACCC
AAGGCCACAG CTGGTTGGCA GGTTCGTACA AGGCGAAAAA CAGCCAATGC CAACTCTGGC
CGCATTGCCG ACAAGATGAA TGTCGAAGCT ATTTCTGAAG TCATCGATGC CAAGTTACGT
TACCCAAATA CCGCGCTTCT CTATATAGAA TTCGACGCAA CTCAATTTCA GAATATCCCT
ATTATCTCAT GTGAGCCTAA AGGCCGGATT ATCCGCGTAC CTACTACATA TGATCCAGTA
ACGCGTACCT ACTCTGGTGT GTGGGATGGT TCATTTAAAT GGGCTCATAC CAACAACCCA
GCCTGGGTAT TCTACAACAT TGTATTAGCA GATCGCTTTG GCCTTGGTCA TCGGATTGAG
GTCAGCCAGG TAGATAAGTG GGAGCTGTAC CGAATTGGTC AATACTGCGA TCAGCTTATT
CCTGATGGTC GGGGCGGTAG TGGTACTGAG CCTCGTTTTA CCTGCGATGT GTATATTCAG
TCTCAGGCCG AGGCATTTAC TGTATTGCGT GATTTGGCCG CCATTTTTCG GGGCATGACC
TATTGGGGAA ATAATCAGCT TTGCACCCTG GCAGATATGC CACGAGATGT GGACTATATA
TTTACCCGTG CCAGTGTGAT TGACGGACGA TTCACTTACG GTGGTGGTTC CGAGAAAAAG
CGCTATACAA CCGCAATGGT GAGCTGGAGT GACCCCGCAA ATAACTGTCA GGATGCAATC
GAGGCAGTGT CAGATAACGA CTTGGTTCGT CGCTACGGTG TCAATCAGCT TGATATGACG
GCTATCGGCT GTATCCGGCA AACTGAGGCG AATAGGCGTG GACGTTGGGC GCTACTGACA
AACAGTAAAG ACCGGACTGT TAATTTTAAT GTAGGGTTAG ACGGGGCCAT TCCGTTGCCC
GGTCATATCA TTGGTGTTGC GGATGATATG CTCTCTGGTC GGAAGATGGG CGGTCGCATT
AGCTCAGTAT CGGGCCGGAA TATCACTCTT GACCGTGTTG CTGATGTGAA AGCAGGTGAC
CGGCTACTTG TTAACTTACC AAACGGTGTA GCTCAGGGCA GAACGGTGCA AGTGGTCAAC
GGGAAAGTAA TCACTGTCAC AACGGCTTAC AGTGAAGTGC CAGCAGCGGA AAGCGGTTGG
TCTGTTGATG CGGATGATTT AGCTATCCAG CAATATCGGG TTACTGGTAT TTCTGACAAT
GACGACAATA CATACAGTAT CTCATCTGTT CAGCATGATC CGGACAAATA TGAGCGAATT
GATACGGGCG CTCGGATTGA TGAAAGACCC ATCAGCGTAA TCCCGCCCGG CGTCCAGCCA
CCTCCGACAA ATGTTGTTAT TGATAGCTTC TCAGCACTTT CACAAGGGCT CGCAATAACC
ACCCTACGTG TTACGTGGGA ACCAGCAGCC AGCGCGATAG CATACGAGGC TGAATGGCGA
CGTGATAACG GAAACTGGAT ATCAGCACCG CGCACATCTG CTCAGGGATT TCAGGTTGAA
GGTATTTATG CTGGACAATA TCAGGCTCGC GTTCGTGCTA TTAACCCCTC AGAAATATCC
AGTATTTGGG CTAATGCTCA GGAAACCACA TTAAACGGTA AAGAGGGAAA TCCTCCAATG
CCAGTTGGAT TTACAGCTAC AGGCATTCTC TTTGGCATCA CTCTCAATTG GGGATACCCT
GAAGGAGCCG AAGATGCGTT AAAAACAGAG ATTGAATATA GCCTGTCTGC TGACGGCACC
GATGCCATGC TGTTGAGTGA TGTGCCGCAT CCGCAACGGA ACTACACTAT GCAGGGGTTG
AGAGCAGGGC AGGTGTTCTG GTTCCGTGCT CGGATAGTTG ATAAATCCGG TAATCAGTCG
CCATGGATTG ATTGGGTTCG TGGCATGTCC AGCACAGACA CAAGCGCTAT TCTCGAAGCG
ATTGGCGACG ACTTTATCAA TAACACAGTT GCGGGTCAGC AACTGATTAA TGATGACTTC
ATGAATGCAG AGGGCATTCT CGAAACAGCG AAGGCCAATA ACGCCAGCAT CTGGCAGCAA
TGGGCTCAAC ACGGAGAGAA TAAAGCCGGT GTTATCCACT TAACGACCAC TGTTGCCGAT
GCTGAAAGAG CATTTGCTGA GTTTGAAACC CTTGTTACAG CAACATTTGA AGACCAGACA
GCAGCGATAG ACCAAAAAAT GACAGCAGTT GTTGATGCCA ACGGGGCTAG TGCTACTTAT
AGTTTAAGGG CCGGACTGAA TTATAACGGC CAGTTTGTCA GCGCAGGCAT GGTAATTGGT
GCAGAGTTTA TTAATGGTGT AGCTAAATCC TCAATTGGTT TTACTGCCGA TCAATTTATA
TTGCTCTCCG GTCCAACTGG TAATTTATTT TCGCCTTTTG CAGTGGTAAA TGGTCAAGTG
TTTATGAATG ATGCATTTAT TGCAAAGGCA TCAATTGGGC GAGGAAAAAT AACAGATACC
CTTGACTCAG ATAATTACGT GCAAGGAATA TCCGGTCTAA AACTGGATTT TAAAAATGGT
AATGCTGAAT TTAACAATGT AAATCTCAGG GGGAATATAA CTATGGATAA CACGATTAAT
GGTATTCGCA CCATAGTAGA TTATCGTGGG CAGAGGACAT ATCACGCAAA TGGTCAGCCA
GCGATAATAT GCGGGTACTT CTAA
 
Protein sequence
MARKPIKGRK GGGSNATTPV ESPDSIQSTA RAKILIALGE GEFAGGLDGT NIYLDGTPIK 
NSDGTSNFTG VTWEYRPGTQ AQDYIQGMPN VENEITVNTE LKSDTPWVRS ITNTQLSATR
VRLGWPSLQR QADNGDVGGY RIEYAVDVAT DGGAYQTLLD TAIDGKTTTL YERSHRINLP
KATAGWQVRT RRKTANANSG RIADKMNVEA ISEVIDAKLR YPNTALLYIE FDATQFQNIP
IISCEPKGRI IRVPTTYDPV TRTYSGVWDG SFKWAHTNNP AWVFYNIVLA DRFGLGHRIE
VSQVDKWELY RIGQYCDQLI PDGRGGSGTE PRFTCDVYIQ SQAEAFTVLR DLAAIFRGMT
YWGNNQLCTL ADMPRDVDYI FTRASVIDGR FTYGGGSEKK RYTTAMVSWS DPANNCQDAI
EAVSDNDLVR RYGVNQLDMT AIGCIRQTEA NRRGRWALLT NSKDRTVNFN VGLDGAIPLP
GHIIGVADDM LSGRKMGGRI SSVSGRNITL DRVADVKAGD RLLVNLPNGV AQGRTVQVVN
GKVITVTTAY SEVPAAESGW SVDADDLAIQ QYRVTGISDN DDNTYSISSV QHDPDKYERI
DTGARIDERP ISVIPPGVQP PPTNVVIDSF SALSQGLAIT TLRVTWEPAA SAIAYEAEWR
RDNGNWISAP RTSAQGFQVE GIYAGQYQAR VRAINPSEIS SIWANAQETT LNGKEGNPPM
PVGFTATGIL FGITLNWGYP EGAEDALKTE IEYSLSADGT DAMLLSDVPH PQRNYTMQGL
RAGQVFWFRA RIVDKSGNQS PWIDWVRGMS STDTSAILEA IGDDFINNTV AGQQLINDDF
MNAEGILETA KANNASIWQQ WAQHGENKAG VIHLTTTVAD AERAFAEFET LVTATFEDQT
AAIDQKMTAV VDANGASATY SLRAGLNYNG QFVSAGMVIG AEFINGVAKS SIGFTADQFI
LLSGPTGNLF SPFAVVNGQV FMNDAFIAKA SIGRGKITDT LDSDNYVQGI SGLKLDFKNG
NAEFNNVNLR GNITMDNTIN GIRTIVDYRG QRTYHANGQP AIICGYF