Gene YPK_3998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_3998 
Symbol 
ID6088639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp4409288 
End bp4411081 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content52% 
IMG OID641599092 
ProductATP-dependent DNA helicase RecQ 
Protein accessionYP_001722714 
Protein GI170026209 
COG category[L] Replication, recombination and repair 
COG ID[COG0514] Superfamily II DNA helicase 
TIGRFAM ID[TIGR00614] ATP-dependent DNA helicase, RecQ family
[TIGR01389] ATP-dependent DNA helicase RecQ 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0373347 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCAGGTAT TGCGAGATAC TTTTGGTTAT CAGCAATTTC GGCCAGGGCA GCAGGAGATC 
ATCAACGCGA CATTGTCAGG CCAGGATTGT CTGGTTGTGA TGCCAACTGG CGGCGGTAAG
TCCTTGTGTT ATCAGATCCC TGCCTTGGTG ACCGATGGGC TGACATTGGT GGTTTCACCG
CTTATCTCTC TGATGAAAGA TCAGGTTGAC CAACTGTTGG CCTATGGCGT GGGCGCGGGT
TGTCTAAACT CATCGCAGAC CCGTGAACAG CAACTGGCCG TTATGGATGG TTGCCGCAGT
GGGCAGATTA AATTGCTGTA TATCGCGCCA GAACGGTTGG TGATGGAGAG TTTTCTTGAT
CAACTGTATC AATGGCGACC CGCCTTGCTG GCAGTGGATG AGGCTCACTG TATTTCCCAA
TGGGGGCATG ATTTCCGCCC GGAATACCGT GCTCTTGGTC AGTTAAAGCA GCGTTTCCCT
GATCTACCGG TTATTGCATT GACCGCTACG GCTGATGAAG CGACGCGTGG CGATATTGTG
CGCCTGTTGA ATCTGGATCA ACCTCTGATT CAGATCAGTA GTTTTGATCG CCCGAATATC
CGCTATACCT TAGTTGAGAA GTTTAAACCT CTCGATCAAC TGTGGCGTTT TGTACAAGAC
CAGCGCGGCA AGAGCGGCAT TATTTACTGT AATAGCCGAG CGAAAGTGGA AGATACGACC
GCCCGTTTGC AAAGCCGTGG CTTGAGTGTT GCCGCCTATC ACGCCGGGCT GGATAACGAG
CGCCGTGCTC AGGTGCAAGA GGCTTTCCAG CGCGATGACT TACAAGTGGT GGTCGCCACC
GTAGCGTTTG GAATGGGGAT TAACAAACCG AACGTCCGCT TTGTCGTGCA TTTTGATATC
CCACGCACCA TCGAATCCTA CTATCAGGAA ACGGGGCGCG CCGGGCGTGA TGGCTTACCG
GCTGAGGCCG TGTTGTTGTA CGATCCGGCA GATATGGCGT GGTTACGCCG TTGTCTTGAA
GAGAAACCTG CGGGTGCTCA GCAAGATATC GAACGGCATA AACTCAATGC GATGGGGGCA
TTTGCCGAAG CGCAAACCTG TCGCCGTTTG GTGCTGCTCA ACTATTTTGG TGAAGGTAAA
CAGCAGCCAT GCGGTAACTG CGATATTTGT CTGGATCCGC CAAAGCGCTA CGATGGTCTG
GCAGATGCGC AAAAGGCGCT CTCTTGCGTC TATCGGGTAG GGCAACGCTT TGGCTTAGGG
TACATCGTTG AGGTGCTGCG TGGGGCGAAT AACCAGCGTA TTCGTGAGAT GGGCCATGAC
AAGCTTTCGG TGTACGGTAT TGGCCGTGAG CAAACCCATG AACACTGGGT CAGCGTCTTG
CGCCAGTTGA TCCATTTGGG GCTGCTCAGC CAGAATATTG CCCAGTTCTC TGCCTTGCAA
CTGACTGAAG CGGCGCGCCC GGTCTTACGC GCAGAGCTGC CATTACAACT GGCGGTACCG
CGTATTCAGA GTCTGAAAGT GCGTAGCAGT GCCAATCAGA AATCCTATGG TGGCAATTAT
GATCGCAAGT TGTTCGCTAA GCTGCGCAAA CTGCGTAAAT CAATTGCTGA TGAAGGCAAT
ATCCCGCCTT ATGTGGTGTT TAACGATGCG ACCTTGCTGG AGATGGCTGA GCAGATGCCG
ATTACTGCCA GCGAGCTATT GAGCGTTAAT GGTGTCGGTC AGCGTAAACT AGAACGTTTT
GGTGCACCAT TTATGGCCCT GATCCGTGAT CATGTGGATA ACAACGATGA CTAA
 
Protein sequence
MQVLRDTFGY QQFRPGQQEI INATLSGQDC LVVMPTGGGK SLCYQIPALV TDGLTLVVSP 
LISLMKDQVD QLLAYGVGAG CLNSSQTREQ QLAVMDGCRS GQIKLLYIAP ERLVMESFLD
QLYQWRPALL AVDEAHCISQ WGHDFRPEYR ALGQLKQRFP DLPVIALTAT ADEATRGDIV
RLLNLDQPLI QISSFDRPNI RYTLVEKFKP LDQLWRFVQD QRGKSGIIYC NSRAKVEDTT
ARLQSRGLSV AAYHAGLDNE RRAQVQEAFQ RDDLQVVVAT VAFGMGINKP NVRFVVHFDI
PRTIESYYQE TGRAGRDGLP AEAVLLYDPA DMAWLRRCLE EKPAGAQQDI ERHKLNAMGA
FAEAQTCRRL VLLNYFGEGK QQPCGNCDIC LDPPKRYDGL ADAQKALSCV YRVGQRFGLG
YIVEVLRGAN NQRIREMGHD KLSVYGIGRE QTHEHWVSVL RQLIHLGLLS QNIAQFSALQ
LTEAARPVLR AELPLQLAVP RIQSLKVRSS ANQKSYGGNY DRKLFAKLRK LRKSIADEGN
IPPYVVFNDA TLLEMAEQMP ITASELLSVN GVGQRKLERF GAPFMALIRD HVDNNDD