Gene YPK_3547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_3547 
Symbol 
ID6091002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp3902173 
End bp3905079 
Gene Length2907 bp 
Protein Length968 aa 
Translation table11 
GC content52% 
IMG OID641598631 
ProductATP-dependent helicase HepA 
Protein accessionYP_001722267 
Protein GI170025762 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.212031 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTTTA CACTTGGTCA ACGCTGGATC AGCGACACAG AAAGCGAACT TGGATTGGGT 
ACTGTCGTTG CCATTGACGT GCGCATGATC ACCTTACTGT TCCCTGCAAC CGGTGAAAAC
CGCCTTTACG CCAGAAATGA CTCGCCAATC ACCCGGGTCA TGTTCAATCC GAGCGATACC
ATCACTCACC ACGAAGGCTG GCAGCTGAAA GTGGAAGAAG TGACTCAAGA AAATGGGCTG
ATTACCTATA TCGGTACCCG TTTGGATACT GAGGAAACCG GCGTCGCCAT GCGTGAAGTT
TTGCTCGATA GCAAACTGAC TTTTAGTAAA CCACAGGATC GCTTATTCGC CGGTCAGATC
GATCGCATGG ATCGCTTTGC CCTGCGTTTT CGCGCCCGTA AATATCAAAG CGAGCAGTTC
CGGTTGCCGT GGAGCGGCCT GCGTGGTATC CGTGCCAGCC TGATCCCACA CCAGTTACAT
ATCGCTTATG AAGTGGGTCA GCGCCACGCG CCACGGGTCT TACTGGCCGA TGAAGTGGGG
TTGGGTAAAA CCATCGAAGC CGGGATGATT ATCCACCAGC AACTGCTGGC TGGCCGTGCT
GAACGGGTGC TGATCGTGGT ACCGGAGAGC CTACAACATC AGTGGCTAGT AGAGATGTTA
CGCCGCTTCA ACCTGCGCTT CTCACTGTTT GATGACAGCC GTTATTCCGA AGCCTTGTTG
GACAGTAGCA ATCCGTTTGA TACCGAACAG ATGGTGATTT GTTCGCTGGA TTTTGTTCGT
CGGAATAAGC AGCGGTTAGA ACAACTGGCC GATGCTTCCT GGGATCTGCT GGTGGTGGAT
GAAGCGCACC ATCTGGCCTG GAGTGAAGAG GCTCCAAGCC GCGAATATCA AGTGATTGAA
CAGTTGGCCG AGCATATCCC CGGCGTGCTG TTATTGACAG CAACACCAGA GCAATTGGGC
CAACAGAGTC ACTTTGCCCG CCTGCGTTTG CTGGATCCAG ATCGCTTCCA TGATTACGAA
GAGTTCGTCA ATGAACAACA GAAATACCGG CCTATCGCCG ATGCCGTCAC CTTGCTGTTA
GGGGGCGAGC GTTTAACTGA TGATAAGCTG AATCTGTTAG GTGAACTGAT TGATGAACAA
GATATTGAGC CACTACTGAA AGCGGCCAAT AGCCAAAGTG AAGACAGCGA AGCGGCGCGT
CAAGAATTAG TTACCATGCT GATGGATAGG CACGGTACTA GCCGGGTGCT GTTCCGTAAT
ACCCGTAATG GGGTAAAAGG CTTCCCGCAC CGCGTCTTAC ACCAAATTAA GCTGCCATTG
CCAACACAGT ACCAAACTGC AATTAAGGTC TCTGGCATCA TGGGGGCGAA GAAAACCCTC
GACGCTCGTG CAAAAGATAT GCTTTATCCC GAGCAAATCT ATCAGGAGTT TGAAGGCGAA
AACGCCACCT GGTGGAACTT TGACCCACGG GTTGAATGGC TACTGAACTA TTTGGTTGCC
AACCGGGGTG AGAAGGTGTT GGTGATCTGC GCACAAGCCG CTACAGCTTT GCAGCTTGAA
CAAGTGTTAC GTGAGCGTGA AGCTATTCGG GCCGCCGTCT TCCATGAAGG TTTATCGCTG
ATTGAACGTG ACCGTGCCGC CGCCTATTTT GCCTCAGAAG AAGATGGTGC TCAGGTCTTA
CTGTGTTCCG AAATTGGTTC AGAGGGGCGT AACTTCCAAT TTGCCTGCCA ACTGGTCATG
TTCGATTTGC CGTTCAACCC CGATCTCCTG GAGCAACGTA TTGGCCGTTT GGATCGTATC
GGCCAGAACC GTGAAATCCA GATTATGGTG CCTTATTTGG AAGACACCGC GCAGGCAATA
CTGGTTCGTT GGTACCACGA AGGGTTGGAT GCGTTCGAAC ACACCTGCCC AACAGGCCGT
ACCATTTATG ACAGCAGCTA TCAGGAACTG ATTAGCTATT TGGCCACGCC AAGTGAGCAG
GAAGGGTTGG ATGAGTTTAT CCACACCTGC CGTCAGCAGC ACGAAGGGTT AAAACTTCAG
TTGGAACAGG GCCGTGACCG CTTACTGGAG ATGCACTCTA ACGGTGGCGA ACATGGGCAG
GAGCTGGCAC AGAGCATTGC CGAACAAGAT AATGACATCA ATTTAGTCAG TTTTGCACTC
AACCTGTTCG ATATTGTCGG GATCAACCAG GAAGATCGTA GCGATAACCT GATCGTGCTG
ACCCCGTCCG ATCACATGCT GGTGCCCGAT TTCCCCGGTT TGCCACCAGA TGGCTGCACC
GTCACCTTTG ATCGTGAACA GGCACTCTCG CGGGAAGATG CCCAGTTTGT CAGTTGGGAA
CACCCCATCA TCCGCAATGG GTTGGATTTG ATCCTGTCTG GCGATACCGG CAGTTGCGCG
GTGTCTTTAT TGAAAAATAA AGCCTTACCC GTCGGTACCC TGCTGGCTGA GTTAGTTTAC
GTGGTCGAGG CACAAGCACC GAAACACTTG CAACTGACCC GTTTCTTGCC ACCAACACCC
GTGCGTATGC TCATGGATCG CAACGGCACT AACCTGGCGG CACAGGTTGA ATTTGAAAGT
TTCAATCGCC AACTGAATGC GGTAAACCGC CATACCTCCA GCAAGCTGGT CAATGCGGTT
CAGCAAGAAG TTCACACCAT GTTGCAACAA GCCGAAGCAC TGGTAGAAGC GCAAGCGCAG
GCTCTGATTG AAACGGCAAA ACGCGAGGCC GATGATAAGT TGAGTACTGA ACTGGCGCGT
CTGGAAGCGT TGAAAGCGGT TAACCCGAAT ATTCGTGATG ACGAAATAGA GGCGCTTGAG
CATAACCGTA AGATGGTGCT GGAAAACCTG AATCAAGCAG GCTGGCGTTT AGATGCTATC
CGGCTGGTGG TGGTAACACA TCAGTAG
 
Protein sequence
MPFTLGQRWI SDTESELGLG TVVAIDVRMI TLLFPATGEN RLYARNDSPI TRVMFNPSDT 
ITHHEGWQLK VEEVTQENGL ITYIGTRLDT EETGVAMREV LLDSKLTFSK PQDRLFAGQI
DRMDRFALRF RARKYQSEQF RLPWSGLRGI RASLIPHQLH IAYEVGQRHA PRVLLADEVG
LGKTIEAGMI IHQQLLAGRA ERVLIVVPES LQHQWLVEML RRFNLRFSLF DDSRYSEALL
DSSNPFDTEQ MVICSLDFVR RNKQRLEQLA DASWDLLVVD EAHHLAWSEE APSREYQVIE
QLAEHIPGVL LLTATPEQLG QQSHFARLRL LDPDRFHDYE EFVNEQQKYR PIADAVTLLL
GGERLTDDKL NLLGELIDEQ DIEPLLKAAN SQSEDSEAAR QELVTMLMDR HGTSRVLFRN
TRNGVKGFPH RVLHQIKLPL PTQYQTAIKV SGIMGAKKTL DARAKDMLYP EQIYQEFEGE
NATWWNFDPR VEWLLNYLVA NRGEKVLVIC AQAATALQLE QVLREREAIR AAVFHEGLSL
IERDRAAAYF ASEEDGAQVL LCSEIGSEGR NFQFACQLVM FDLPFNPDLL EQRIGRLDRI
GQNREIQIMV PYLEDTAQAI LVRWYHEGLD AFEHTCPTGR TIYDSSYQEL ISYLATPSEQ
EGLDEFIHTC RQQHEGLKLQ LEQGRDRLLE MHSNGGEHGQ ELAQSIAEQD NDINLVSFAL
NLFDIVGINQ EDRSDNLIVL TPSDHMLVPD FPGLPPDGCT VTFDREQALS REDAQFVSWE
HPIIRNGLDL ILSGDTGSCA VSLLKNKALP VGTLLAELVY VVEAQAPKHL QLTRFLPPTP
VRMLMDRNGT NLAAQVEFES FNRQLNAVNR HTSSKLVNAV QQEVHTMLQQ AEALVEAQAQ
ALIETAKREA DDKLSTELAR LEALKAVNPN IRDDEIEALE HNRKMVLENL NQAGWRLDAI
RLVVVTHQ