Gene YPK_3149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_3149 
Symbol 
ID6089188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp3443988 
End bp3445196 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content48% 
IMG OID641598227 
Productintegrase family protein 
Protein accessionYP_001721873 
Protein GI170025368 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATTAA CCGATGTGAA GGTAAGAAGT GCAAAACCTA CTGACAAGCC TTATAAGCTT 
ACCGATGGCG AAGGCATGCA TTTAATGGTT CATCCCAATG GATCTAAATA CTGGCGCTTA
CAGTACCGCT TTGACGGTAA GCAAAAGATG TTGGCACTGG GTATCTACCC TGAGATTTCG
CTATCTGAAG CCAGGCAGCG GAGAGATGAA GCTAAGCGAC AAGTCGCTAA CGCTATCGAT
CCCAGCGAAC AGAAAAAGGT TGAGAAACAA GCTCGCAAAG CGACAGTGGA AAATACCTTT
AAAGCTATTG CTCTGGAATG GCATGAGTAC AAACGGCCAA ACTGGTCCAA AGGCTACGCG
GAAGACATCA TGGAAGCGTT TGAAAACGAT ATCTTCCCGG ATATTGGTAA GCGCCCAATA
GCAGAGATAA AGCCTCTGGA GATCCTCAGT TCACTTCGTA AACTCGAAAA ACGTGGCGTC
CTCGACAAGT TGCGCAAAAT CCGCCAGGCC TGCAACCAAG TGTTTCGTTA CGCCATTGTC
ACTGGTCGCG CTGAATCTAA CCCTGCTTCA GAACTGGCAA GTGCACTAAC TCCGCCAAAA
TCTGTGCACT ATCCTCACCT GTTAGCGGAT GAACTTCCAG CTTTCTTAGA GGCTCTCGCC
GCTTATTCTG GTAGCCCAAT AACTCGGCTA GCAACAAAGA TTCTGATGCT AACTGGTGTT
CGTACCATTG AGCTCCGCAT GGCTGAATGG AAAGAGTTCG ATTTCGACCA GCGGGTTTGG
GAAGTACCGG TACAAAGGAT GAAGATGCGG CGACCTCACC TTGTACCACT ATCTGATCAA
GTTGTGACAG CGCTACGTGA AATTCAGGCG GTAACTGGCC GTTATAACTT GGTGTTTCCT
GGACGTAACG ACATCACCAA ACCAATGAGC GAAGCGAGTA TCAATCAGGT ACTGAAAAGG
ATTGGGTATC ACGGGAAGGC GACAGGTCAT GGCTTCAGGC ACACGATGAG TACTATCTTG
CATGAACAAG GCTATAACAC GGCTTGGATC GAGTTACAGC TGGCTCATGT GGATAAGAAT
ACCATTCGTG GCACATACAA CCATGCGCAG TATCTGGAGC AACGTCGAGA GATGTTGCAG
TGGTATGGGG AATATGTGGA TGGGTTGGCG GCTAGGGGAA TCGCAATAAA TATTCGTAAA
AAAGCATAA
 
Protein sequence
MALTDVKVRS AKPTDKPYKL TDGEGMHLMV HPNGSKYWRL QYRFDGKQKM LALGIYPEIS 
LSEARQRRDE AKRQVANAID PSEQKKVEKQ ARKATVENTF KAIALEWHEY KRPNWSKGYA
EDIMEAFEND IFPDIGKRPI AEIKPLEILS SLRKLEKRGV LDKLRKIRQA CNQVFRYAIV
TGRAESNPAS ELASALTPPK SVHYPHLLAD ELPAFLEALA AYSGSPITRL ATKILMLTGV
RTIELRMAEW KEFDFDQRVW EVPVQRMKMR RPHLVPLSDQ VVTALREIQA VTGRYNLVFP
GRNDITKPMS EASINQVLKR IGYHGKATGH GFRHTMSTIL HEQGYNTAWI ELQLAHVDKN
TIRGTYNHAQ YLEQRREMLQ WYGEYVDGLA ARGIAINIRK KA