Gene YPK_2300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_2300 
Symbol 
ID6089012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp2545760 
End bp2546929 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content53% 
IMG OID641597362 
Producttail sheath protein 
Protein accessionYP_001721030 
Protein GI170024525 
COG category[R] General function prediction only 
COG ID[COG3497] Phage tail sheath protein FI 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGATT ATCATCACGG CGCGCGCGTC ATCGAAATCA ACGACGGTAC TCGCGTTATT 
TCCACTGTTT CCACCGCCAT TATCGGCATG GTCTGTACTT CCGATGATGC TGACCCCACT
CTGTTCCCAC TCAATACCCC GGTATTACTC ACCGATGTGC TGGCCGCCAG CGGCAAGGCC
GGTGAAACCG GCACATTAGC CCATTCACTG GATGCTATCA GCGACCAAAC CAAACCACTG
ACCGTCGTTG TCCGGGTGGC GCAGGGTGAA ACCGAAGCTG AAACCACGTC AAATATTATT
GGCGGAATAA CACCGGATGG CCGTTATACC GGCATGAAAG CGCTGTTAGC GGCGCAGGGT
AAGTTTGACG TCAAGCCCCG TATTTTAGGG GTTCCCGGTC ATGACACTCT GGCGGTATCC
ACTGAGCTAC TTTCCATCGC TCAGAGCCTA CGTGCCTTTG CCTACATCAG CGCCTATGGT
TGCAAAACCA AAGAAGAGGC CATTATCTAC CGCGATAATT TCAGTCAGCG CGAAGCGATG
GTGATTTGGC CCGATTTCCT CAGTTGGGAC ACGGTCACTA ACGCCGAAAC CACCGCTTAC
GCCACGGCTC GCGCCCTCGG CTTACGTGCC AAGATTGATA ATGATGTTGG CTGGCATAAA
ACGCTGTCTA ACGTCGGGGT GAATGGCGTC ACCGGTATCA GTGCGGATGT GTTCTGGGAT
CTGCAAAACA GCGCTACCGA TGCCAATTTG CTCAACAGTA AAGACGTCAC CACGCTGATC
CGCAAAGATG GTTACCGTTT TTGGGGTTCC CGTTCTTGTT CTGACGATCC GTTATTTGCC
TTTGAGAACT ACACCCGCAC CGCACAGGTA CTGGCTGACA CCCTGGCCGA GGCCCATATG
TGGGCTAACG ATAAGCCGCT TACCCCGTCA CTGGCAAAAG ACATTATTGA GGGTATTCGC
GCCAAAATGC GCGAGCTGAA ATCATTGGGT TATCTGATTG ATGGTGACTG CTGGTACGAC
GACAGCGTAA ACGATAAAGA CACACTAAAG GCTGGCCGCC TGTTTATTGA TTACGACTAT
ACGCCGGTGC CGCCGCTGGA AGATTTAACC CTGCGTCAAC GCATTACTGA TCGTTATCTG
GCTAATTTCG CCGCCGCCGT TAACAGCTAA
 
Protein sequence
MSDYHHGARV IEINDGTRVI STVSTAIIGM VCTSDDADPT LFPLNTPVLL TDVLAASGKA 
GETGTLAHSL DAISDQTKPL TVVVRVAQGE TEAETTSNII GGITPDGRYT GMKALLAAQG
KFDVKPRILG VPGHDTLAVS TELLSIAQSL RAFAYISAYG CKTKEEAIIY RDNFSQREAM
VIWPDFLSWD TVTNAETTAY ATARALGLRA KIDNDVGWHK TLSNVGVNGV TGISADVFWD
LQNSATDANL LNSKDVTTLI RKDGYRFWGS RSCSDDPLFA FENYTRTAQV LADTLAEAHM
WANDKPLTPS LAKDIIEGIR AKMRELKSLG YLIDGDCWYD DSVNDKDTLK AGRLFIDYDY
TPVPPLEDLT LRQRITDRYL ANFAAAVNS