Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A0101 |
Symbol | rapA |
ID | 6874087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 105501 |
End bp | 108407 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642783355 |
Product | ATP-dependent helicase HepA |
Protein accession | YP_002214049 |
Protein GI | 198245326 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTTTA CACTTGGTCA ACGCTGGATC AGCGATACAG AAAGCGAGCT GGGACTTGGA ACCGTTGTTG CGATGGATGC GCGAACCGTC ACCTTACTTT TCCCGTCCAC GGGGGAAAAC CGCCTGTATG CGCGCAGTGA TTCTCCCGTG ACCCGCGTCA TGTTCAACCC TGGCGATACG ATTACAAGCC ATGAAGGCTG GCAGCTACAT ATCGATGAAG TAAAAGAAGA AAATGGCCTG CTTGTTTATG TCGGCACCCG CCTGGATACC GAAGAGACCA ATGTGACGTT GCGCGAGGTT CTGCTCGACA GCAAGTTGGT TTTCAGTAAG CCCCAGGATC GTCTGTTCGC CGGTCAAATT GATCGAATGG ATCGGTTCGC GCTGCGCTAT CGCGCCCGTA AATTTCAGAG CGAGCAGTAC CGGATGCCGT ACAGCGGCCT GCGCGGTCAG CGGACCAACC TGATCCCGCA TCAGCTTAAT ATCGCTCATG ATGTGGGTCG TCGCCACGCG CCACGCGTAC TGCTGGCGGA TGAAGTGGGC TTAGGTAAAA CCATTGAAGC CGGGATGATT CTGCATCAAC AGTTATTATC CGGCGCGGCG GAACGCGTAT TGATCATTGT TCCGGAAACC CTGCAACACC AGTGGCTGGT AGAAATGCTG CGCCGTTTCA ACCTGCGCTT CGCGCTGTTC GATGACGAAC GCTATACCGA AGCGCAGCAC GACGCGTATA ACCCGTTTGA AACCGAACAA CTGGTGATCT GCTCGCTGGA TTTCGCCCGC CGTAATAAGC AGCGTCTGGA ACATTTGTGC GACGCCGAGT GGGATCTGCT GGTGGTCGAC GAAGCGCATC ATCTGGTGTG GAGTACCGAT GCGCCGAGTC GTGAATATAT GGCCATCGAA CAGTTAGCTG AACGCGTGCC GGGCGTGCTG CTGCTGACCG CCACGCCAGA ACAGCTGGGG ATGGAAAGCC ATTTCGCCCG TCTGCGCCTG CTCGATCCGA ACCGTTTCCA CGATTTCGAA CAGTTTGTCG AAGAACAGAA AAACTACCGT CCTGTCGCCG ATGCGGTGGC CATGCTGCTG GCGGGCAATA AACTCAGCAA CGACGAACTG AACAGGCTGG GCGATCTGAT CGGCGAACAG GATATTGAAC CGCTGTTGCA GGCCGCTAAT AGCGATCGCG ACGACGCGCA GGCCGCCCGT GATGAGCTGG TGTCGATGCT GATGGATCGC CACGGCACCA GCCGCGTTCT GTTCCGCAAC ACCCGTAACG GCGTCAAGGG GTTCCCGAAA CGTGAACTGC ATACGGTAAA ACTGCCGCTG CCGACCCAGT ATCAGACCGC CATTAAGGTC TCCGGCATTA TGGGCGCGCG TAAAAGCGCG GAAGATCGCG CCCGCGATAT GCTCTATCCG GAACAAATTT ATCAGGAGTT CGAAGGCGAT ACTGGCACCT GGTGGAACTT CGACCCACGC GTTGAGTGGC TAATGGGCTA TCTGACCAGC CATCGTTCGC AAAAAGTGCT GGTGATCTGC GCCAAAGCGA CCACCGCGTT ACAGCTGGAG CAGGTGCTGC GCGAACGCGA AGGCATCCGC GCCGCCGTGT TCCATGAGGG CATGTCGATT ATCGAACGCG ACCGCGCCGC CGCCTGGTTC GCCGAAGAAG ATACCGGCGC GCAGGTGCTG TTATGTTCCG AAATCGGCTC CGAAGGACGT AACTTCCAGT TTGCCAGCAA TCTGGTGATG TTCGACTTGC CGTTTAACCC GGATCTGCTG GAACAGCGTA TTGGTCGTCT GGATCGCATC GGCCAGGCGC ATGATATCCA GATCCACGTC CCGTACCTGG AAAAAACCGC CCAGTCGGTG CTGGTTCGCT GGTATCACGA AGGGCTGGAC GCCTTTGAGC ACACCTGTCC AACCGGCCGC GCGATTTATG ATTCAGCCTA CGCCAGTCTG ATTAACTATC TGGCCGCGCC TGAAGAAACC GACGGGTTTG ACGATCTGAT CAAATCCTGC CGCGAGCAAC ATGAAGCGCT AAAAGCCCAG TTAGAACAGG GCCGCGACCG CCTGCTGGAG ATCCACTCCA ACGGTGGTGA AAAAGCCCAA CAGCTTGCAC AAAGCATTGA AGAACAGGAC GACGACACCA ACCTGATCGC GTTCGCCATG AACCTGTTCG ATATCGTCGG CATTAACCAG GACGATCGCG GCGACAACCT GATCGTGCTG ACGCCGTCCG ACCACATGTT GGTGCCGGAT TTCCCCGGCC TGCCGGAAGA CGGCTGTACT ATCACGTTTG AACGTGACGT AGCCCTGTCT CGCGAAGATG CGCAGTTTAT TACCTGGGAA CATCCGCTCA TCCGTAACGG ACTGGATCTG ATCCTCTCCG GCGATACCGG CAGCAGCACC ATTTCGCTGT TGAAAAATAA AGCGCTGCCA GTCGGCACGC TGCTGGTCGA ACTGGTTTAC GTCGTGGAAG CGCAGGCGCC GAAACAGCTA CAACTGAACC GCTTCCTGCC GCCGACGCCG GTACGTATGC TGTTAGATAA AAACGGCAAC AATCTGGCCG CCCAGGTCGA GTTTGAAACC TTCAACCGCC AGCTCAGCGC CGTGAATCGC CACACCGGCA GCAAACTGGT GAACGCCGTG CAACAGGACG TCCACGCCAT TTTGCAACTG GGCGAGACGC AGATTGAAAA GTCCGCCAGA GCGCTGATTG ATAACGCGCG TCGCGAGGCG GATGAAAAAC TGTCCGGGGA ACTGTCGCGT CTGGAAGCGC TGCGCGCCGT CAACCCGAAC ATTCGCGACG ACGAACTTGC CGCTATTGAC AGCAACCGTC AGCAGGTACT GGAAAGCCTG AATCAGGCAG GCTGGCGTCT GGACGCGCTG CGTCTTATCG TCGTCACGCA CCAATAA
|
Protein sequence | MPFTLGQRWI SDTESELGLG TVVAMDARTV TLLFPSTGEN RLYARSDSPV TRVMFNPGDT ITSHEGWQLH IDEVKEENGL LVYVGTRLDT EETNVTLREV LLDSKLVFSK PQDRLFAGQI DRMDRFALRY RARKFQSEQY RMPYSGLRGQ RTNLIPHQLN IAHDVGRRHA PRVLLADEVG LGKTIEAGMI LHQQLLSGAA ERVLIIVPET LQHQWLVEML RRFNLRFALF DDERYTEAQH DAYNPFETEQ LVICSLDFAR RNKQRLEHLC DAEWDLLVVD EAHHLVWSTD APSREYMAIE QLAERVPGVL LLTATPEQLG MESHFARLRL LDPNRFHDFE QFVEEQKNYR PVADAVAMLL AGNKLSNDEL NRLGDLIGEQ DIEPLLQAAN SDRDDAQAAR DELVSMLMDR HGTSRVLFRN TRNGVKGFPK RELHTVKLPL PTQYQTAIKV SGIMGARKSA EDRARDMLYP EQIYQEFEGD TGTWWNFDPR VEWLMGYLTS HRSQKVLVIC AKATTALQLE QVLREREGIR AAVFHEGMSI IERDRAAAWF AEEDTGAQVL LCSEIGSEGR NFQFASNLVM FDLPFNPDLL EQRIGRLDRI GQAHDIQIHV PYLEKTAQSV LVRWYHEGLD AFEHTCPTGR AIYDSAYASL INYLAAPEET DGFDDLIKSC REQHEALKAQ LEQGRDRLLE IHSNGGEKAQ QLAQSIEEQD DDTNLIAFAM NLFDIVGINQ DDRGDNLIVL TPSDHMLVPD FPGLPEDGCT ITFERDVALS REDAQFITWE HPLIRNGLDL ILSGDTGSST ISLLKNKALP VGTLLVELVY VVEAQAPKQL QLNRFLPPTP VRMLLDKNGN NLAAQVEFET FNRQLSAVNR HTGSKLVNAV QQDVHAILQL GETQIEKSAR ALIDNARREA DEKLSGELSR LEALRAVNPN IRDDELAAID SNRQQVLESL NQAGWRLDAL RLIVVTHQ
|
| |