Gene YpAngola_B0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_B0043 
SymbolyscP 
ID5798302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010157 
Strand
Start bp27606 
End bp28973 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content49% 
IMG OID641337854 
Producttype III secretion system needle length determinant YscP 
Protein accessionYP_001604474 
Protein GI162417754 
COG category 
COG ID 
TIGRFAM ID[TIGR02514] type III secretion system needle length determinant 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value5.35952e-19 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones86 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAA TCACCACTCG TTCCCCATTA GAACCTGAGT ATCAACCTCT GGGGAAGCCG 
CATCATGCTT TGCAAGCATG TGTCGATTTT GAGCAAGCGC TGTTGCATAA TAATAAGGGG
AATTGTCATC CCAAAGAAGA GTCGCTTAAA CCTGTACGTC CGCATGACCT TGGCAAAAAA
GAAGGTCAGA AAGGAGATGG CTTGCGTGCG CATGCGCCAT TAGCGGCGAC GTCTCAACCC
GGAAGGAAAG AGGTAGGATT AAAACCTCAA CACAACCATC AGAATAATCA TGATTTCAAC
TTATCTCCGC TTGCCGAAGG TGCTACCAAT AGAGCGCACT TATACCAGCA GGATAGCCGT
TTTGATGACC GCGTAGAGAG TATTATTAAT GCTCTCATGC CATTGGCGCC CTTTTTAGAG
GGGGTGACTT GTGAAACGGG GACATCAAGT GAATCCCCCT GCGAGCCGTC TGGACATGAT
GAGTTATTTG TTCAGCAATC GCCTATCGAT TCCGCTCAAC CAGTTCAATT GAATAGCAAG
CCGACTGTTC AGCCATTGAA TCCGGCTGCT GACGGCGCAG AGGTTATTGT ATGGTCTGTC
GGTAGGGAAA CTCCGGCCAG TATAGCAAAA AACCAGCGCG ATAGCAGGCA AAAACGCCTT
GCAGAAGAAC CGTTAGCTCT TCATCAAAAA GCATTGCCAG AGATATGTCC CCCGGCAGTT
AGTGCCACAC CGGATGATCA TTTGGTAGCA AGATGGTGTG CTACTCCTGT GACTGAGGTA
GCAGAAAAAT CTGCTCGTTT TCCGTACAAA GCGACAGTGC AGTCAGAGCA ACTGGACATG
ACCGAGCTGG CGGATCGGTC CCAACATCTT ACTGATGGCG TTGATAGCAG CAAAGATACC
ATCGAACCAC CGCGACCAGA AAAACTGTTA CTTCCGCGCG AAGAAACCTT GCCGGAGATG
TATTCCTTGT CTTTTACAGC ACCGGTTGTC ACGCCGGGTG ATCACCTATT AGCAACAATG
CGCGCGACCA GGCTGGCATC AGTCTCAGAG CAACTTATAC AGTTAGCACA GCGACTAGCG
GTAGAACTAG AACTGCGCGG CGGCTCATCC CAAGTAACCC AATTACACCT TAACTTACCT
GAATTGGGGG CTATTATGGT TCGTATTGCT GAGATTCCGG GAAAACTGCA TGTAGAACTG
ATCGCCAGTC GGGAAGCTTT AAGAATTTTA GCGCAGGGAA GTTATGATCT TCTTGAGCGA
TTACAACGCA TTGAGCCAAC ACAACTTGAT TTTCAAGCTA GCGATGACAG TGAACAGGAG
TCACGTCAGA AACGCCACGT CTATGAGGAG TGGGAGGCTG AAGAATGA
 
Protein sequence
MNKITTRSPL EPEYQPLGKP HHALQACVDF EQALLHNNKG NCHPKEESLK PVRPHDLGKK 
EGQKGDGLRA HAPLAATSQP GRKEVGLKPQ HNHQNNHDFN LSPLAEGATN RAHLYQQDSR
FDDRVESIIN ALMPLAPFLE GVTCETGTSS ESPCEPSGHD ELFVQQSPID SAQPVQLNSK
PTVQPLNPAA DGAEVIVWSV GRETPASIAK NQRDSRQKRL AEEPLALHQK ALPEICPPAV
SATPDDHLVA RWCATPVTEV AEKSARFPYK ATVQSEQLDM TELADRSQHL TDGVDSSKDT
IEPPRPEKLL LPREETLPEM YSLSFTAPVV TPGDHLLATM RATRLASVSE QLIQLAQRLA
VELELRGGSS QVTQLHLNLP ELGAIMVRIA EIPGKLHVEL IASREALRIL AQGSYDLLER
LQRIEPTQLD FQASDDSEQE SRQKRHVYEE WEAEE