Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YPK_3547 |
Symbol | |
ID | 6091002 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis YPIII |
Kingdom | Bacteria |
Replicon accession | NC_010465 |
Strand | + |
Start bp | 3902173 |
End bp | 3905079 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641598631 |
Product | ATP-dependent helicase HepA |
Protein accession | YP_001722267 |
Protein GI | 170025762 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.212031 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTTTTA CACTTGGTCA ACGCTGGATC AGCGACACAG AAAGCGAACT TGGATTGGGT ACTGTCGTTG CCATTGACGT GCGCATGATC ACCTTACTGT TCCCTGCAAC CGGTGAAAAC CGCCTTTACG CCAGAAATGA CTCGCCAATC ACCCGGGTCA TGTTCAATCC GAGCGATACC ATCACTCACC ACGAAGGCTG GCAGCTGAAA GTGGAAGAAG TGACTCAAGA AAATGGGCTG ATTACCTATA TCGGTACCCG TTTGGATACT GAGGAAACCG GCGTCGCCAT GCGTGAAGTT TTGCTCGATA GCAAACTGAC TTTTAGTAAA CCACAGGATC GCTTATTCGC CGGTCAGATC GATCGCATGG ATCGCTTTGC CCTGCGTTTT CGCGCCCGTA AATATCAAAG CGAGCAGTTC CGGTTGCCGT GGAGCGGCCT GCGTGGTATC CGTGCCAGCC TGATCCCACA CCAGTTACAT ATCGCTTATG AAGTGGGTCA GCGCCACGCG CCACGGGTCT TACTGGCCGA TGAAGTGGGG TTGGGTAAAA CCATCGAAGC CGGGATGATT ATCCACCAGC AACTGCTGGC TGGCCGTGCT GAACGGGTGC TGATCGTGGT ACCGGAGAGC CTACAACATC AGTGGCTAGT AGAGATGTTA CGCCGCTTCA ACCTGCGCTT CTCACTGTTT GATGACAGCC GTTATTCCGA AGCCTTGTTG GACAGTAGCA ATCCGTTTGA TACCGAACAG ATGGTGATTT GTTCGCTGGA TTTTGTTCGT CGGAATAAGC AGCGGTTAGA ACAACTGGCC GATGCTTCCT GGGATCTGCT GGTGGTGGAT GAAGCGCACC ATCTGGCCTG GAGTGAAGAG GCTCCAAGCC GCGAATATCA AGTGATTGAA CAGTTGGCCG AGCATATCCC CGGCGTGCTG TTATTGACAG CAACACCAGA GCAATTGGGC CAACAGAGTC ACTTTGCCCG CCTGCGTTTG CTGGATCCAG ATCGCTTCCA TGATTACGAA GAGTTCGTCA ATGAACAACA GAAATACCGG CCTATCGCCG ATGCCGTCAC CTTGCTGTTA GGGGGCGAGC GTTTAACTGA TGATAAGCTG AATCTGTTAG GTGAACTGAT TGATGAACAA GATATTGAGC CACTACTGAA AGCGGCCAAT AGCCAAAGTG AAGACAGCGA AGCGGCGCGT CAAGAATTAG TTACCATGCT GATGGATAGG CACGGTACTA GCCGGGTGCT GTTCCGTAAT ACCCGTAATG GGGTAAAAGG CTTCCCGCAC CGCGTCTTAC ACCAAATTAA GCTGCCATTG CCAACACAGT ACCAAACTGC AATTAAGGTC TCTGGCATCA TGGGGGCGAA GAAAACCCTC GACGCTCGTG CAAAAGATAT GCTTTATCCC GAGCAAATCT ATCAGGAGTT TGAAGGCGAA AACGCCACCT GGTGGAACTT TGACCCACGG GTTGAATGGC TACTGAACTA TTTGGTTGCC AACCGGGGTG AGAAGGTGTT GGTGATCTGC GCACAAGCCG CTACAGCTTT GCAGCTTGAA CAAGTGTTAC GTGAGCGTGA AGCTATTCGG GCCGCCGTCT TCCATGAAGG TTTATCGCTG ATTGAACGTG ACCGTGCCGC CGCCTATTTT GCCTCAGAAG AAGATGGTGC TCAGGTCTTA CTGTGTTCCG AAATTGGTTC AGAGGGGCGT AACTTCCAAT TTGCCTGCCA ACTGGTCATG TTCGATTTGC CGTTCAACCC CGATCTCCTG GAGCAACGTA TTGGCCGTTT GGATCGTATC GGCCAGAACC GTGAAATCCA GATTATGGTG CCTTATTTGG AAGACACCGC GCAGGCAATA CTGGTTCGTT GGTACCACGA AGGGTTGGAT GCGTTCGAAC ACACCTGCCC AACAGGCCGT ACCATTTATG ACAGCAGCTA TCAGGAACTG ATTAGCTATT TGGCCACGCC AAGTGAGCAG GAAGGGTTGG ATGAGTTTAT CCACACCTGC CGTCAGCAGC ACGAAGGGTT AAAACTTCAG TTGGAACAGG GCCGTGACCG CTTACTGGAG ATGCACTCTA ACGGTGGCGA ACATGGGCAG GAGCTGGCAC AGAGCATTGC CGAACAAGAT AATGACATCA ATTTAGTCAG TTTTGCACTC AACCTGTTCG ATATTGTCGG GATCAACCAG GAAGATCGTA GCGATAACCT GATCGTGCTG ACCCCGTCCG ATCACATGCT GGTGCCCGAT TTCCCCGGTT TGCCACCAGA TGGCTGCACC GTCACCTTTG ATCGTGAACA GGCACTCTCG CGGGAAGATG CCCAGTTTGT CAGTTGGGAA CACCCCATCA TCCGCAATGG GTTGGATTTG ATCCTGTCTG GCGATACCGG CAGTTGCGCG GTGTCTTTAT TGAAAAATAA AGCCTTACCC GTCGGTACCC TGCTGGCTGA GTTAGTTTAC GTGGTCGAGG CACAAGCACC GAAACACTTG CAACTGACCC GTTTCTTGCC ACCAACACCC GTGCGTATGC TCATGGATCG CAACGGCACT AACCTGGCGG CACAGGTTGA ATTTGAAAGT TTCAATCGCC AACTGAATGC GGTAAACCGC CATACCTCCA GCAAGCTGGT CAATGCGGTT CAGCAAGAAG TTCACACCAT GTTGCAACAA GCCGAAGCAC TGGTAGAAGC GCAAGCGCAG GCTCTGATTG AAACGGCAAA ACGCGAGGCC GATGATAAGT TGAGTACTGA ACTGGCGCGT CTGGAAGCGT TGAAAGCGGT TAACCCGAAT ATTCGTGATG ACGAAATAGA GGCGCTTGAG CATAACCGTA AGATGGTGCT GGAAAACCTG AATCAAGCAG GCTGGCGTTT AGATGCTATC CGGCTGGTGG TGGTAACACA TCAGTAG
|
Protein sequence | MPFTLGQRWI SDTESELGLG TVVAIDVRMI TLLFPATGEN RLYARNDSPI TRVMFNPSDT ITHHEGWQLK VEEVTQENGL ITYIGTRLDT EETGVAMREV LLDSKLTFSK PQDRLFAGQI DRMDRFALRF RARKYQSEQF RLPWSGLRGI RASLIPHQLH IAYEVGQRHA PRVLLADEVG LGKTIEAGMI IHQQLLAGRA ERVLIVVPES LQHQWLVEML RRFNLRFSLF DDSRYSEALL DSSNPFDTEQ MVICSLDFVR RNKQRLEQLA DASWDLLVVD EAHHLAWSEE APSREYQVIE QLAEHIPGVL LLTATPEQLG QQSHFARLRL LDPDRFHDYE EFVNEQQKYR PIADAVTLLL GGERLTDDKL NLLGELIDEQ DIEPLLKAAN SQSEDSEAAR QELVTMLMDR HGTSRVLFRN TRNGVKGFPH RVLHQIKLPL PTQYQTAIKV SGIMGAKKTL DARAKDMLYP EQIYQEFEGE NATWWNFDPR VEWLLNYLVA NRGEKVLVIC AQAATALQLE QVLREREAIR AAVFHEGLSL IERDRAAAYF ASEEDGAQVL LCSEIGSEGR NFQFACQLVM FDLPFNPDLL EQRIGRLDRI GQNREIQIMV PYLEDTAQAI LVRWYHEGLD AFEHTCPTGR TIYDSSYQEL ISYLATPSEQ EGLDEFIHTC RQQHEGLKLQ LEQGRDRLLE MHSNGGEHGQ ELAQSIAEQD NDINLVSFAL NLFDIVGINQ EDRSDNLIVL TPSDHMLVPD FPGLPPDGCT VTFDREQALS REDAQFVSWE HPIIRNGLDL ILSGDTGSCA VSLLKNKALP VGTLLAELVY VVEAQAPKHL QLTRFLPPTP VRMLMDRNGT NLAAQVEFES FNRQLNAVNR HTSSKLVNAV QQEVHTMLQQ AEALVEAQAQ ALIETAKREA DDKLSTELAR LEALKAVNPN IRDDEIEALE HNRKMVLENL NQAGWRLDAI RLVVVTHQ
|
| |