Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A0102 |
Symbol | rapA |
ID | 6483324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 109968 |
End bp | 112874 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642735545 |
Product | ATP-dependent helicase HepA |
Protein accession | YP_002039327 |
Protein GI | 194446131 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0648061 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 0.111163 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTTTA CACTTGGTCA ACGCTGGATC AGCGATACAG AAAGCGAGCT GGGACTTGGA ACCGTTGTTG CGATGGATGC GCGAACCGTC ACCTTACTTT TCCCGTCCAC GGGGGAAAAC CGCCTGTATG CGCGCAGTGA TTCTCCCGTG ACCCGCGTCA TGTTCAACCC TGGCGATACG ATTACAAGCC ATGAAGGCTG GCAGCTACAT ATCGATGAAG TAAAAGAAGA AAATGGCCTG CTTGTTTATG TCGGCACCCG CCTGGATACC GAAGAGACCA ATGTGACGTT GCGCGAGGTT CTGCTCGACA GCAAGTTGGT TTTCAGTAAG CCCCAGGATC GTCTGTTCGC CGGTCAAATC GATCGAATGG ATCGGTTCGC GCTGCGCTAT CGCGCCCGTA AATTTCAGAG CGAGCAGTAC CGGATGCCAT ACAGCGGCCT GCGCGGTCAG CGGACCAATC TGATCCCGCA TCAGCTTAAT ATCGCTCATG ATGTAGGTCG TCGCCACGCG CCGCGCGTAC TGCTGGCGGA TGAAGTGGGC TTAGGTAAAA CCATTGAAGC CGGGATGATT CTGCATCAAC AGTTATTATC CGGCGCGGCG GAACGCGTAT TGATCATCGT TCCGGAAACC CTGCAACACC AGTGGCTGGT AGAAATGCTG CGCCGTTTCA ACCTGCGCTT CGCGCTGTTC GATGACGAAC GCTATACCGA AGCGCAGCAC GACGCGTATA ACCCGTTTGA AACCGAACAA CTAGTGATCT GTTCGCTGGA TTTCGCCCGC CGTAATAAGC AGCGTCTGGA ACATTTGTGC GACGCCGAGT GGGATTTGCT GGTGGTCGAC GAAGCGCATC ATCTGGTGTG GAGTACCGAT GCGCCGAGCC GTGAATATAT GGCCATCGAA CAATTAGCTG AACGCGTACC GGGCGTGCTG CTGCTGACCG CCACGCCAGA ACAGCTGGGG ATGGAAAGCC ATTTCGCCCG TCTGCGCCTG CTCGATCCGA ACCGTTTCCA CGATTTCGAA CAGTTTGTCG AAGAACAGAA AAACTACCGC CCTGTCGCCG ATGCGGTGGC CATGCTGCTG GCGGGCAATA AACTCAGCAA CGACGAACTG AACAGGCTGG GCGATCTGAT CGGCGAACAG GATATTGAAC CGCTGTTGCA GGCCGCTAAT AGCGATCGCG ACGATGCGCA GGCCGCCCGT GATGAGCTGG TGTCGATGCT GATGGATCGC CACGGCACCA GCCGCGTTCT GTTCCGCAAC ACCCGTAACG GCGTCAAGGG GTTCCCGAAA CGTGAACTGC ATACGGTAAA ACTGCCGCTG CCGACCCAGT ATCAGACCGC CATTAAGGTC TCCGGCATTA TGGGCGCGCG TAAAAGCGCG GAAGATCGCG CCCGCGATAT GCTCTATCCG GAACAAATTT ATCAGGAGTT CGAAGGCGAT ACTGGCACCT GGTGGAACTT CGACCCACGC GTTGAGTGGC TAATGGGCTA TCTGACCAGC CATCGTTCGC AAAAAGTGCT GGTGATCTGC GCCAAAGCGA CCACCGCGTT ACAGCTGGAG CAGGTGCTGC GCGAACGCGA AGGCATCCGC GCCGCCGTGT TCCATGAGGG CATGTCGATT ATCGAACGCG ACCGCGCCGC CGCCTGGTTC GCCGAAGAAG ATACCGGCGC GCAGGTGCTG TTATGTTCCG AAATCGGCTC CGAAGGACGT AACTTCCAGT TTGCCAGCAA TCTGGTGATG TTCGACTTGC CGTTTAACCC GGATCTGCTG GAACAGCGTA TTGGTCGTCT GGATCGTATC GGCCAGGCGC ATGATATCCA GATCCACGTC CCGTACCTGG AAAAAACCGC CCAGTCGGTG CTGGTTCGCT GGTATCACGA AGGGCTGGAC GCCTTTGAGC ACACCTGCCC AACCGGTCGC GCGATTTATG ATTCAGCCTA CGCCAGTCTG ATTAACTATC TGGCCGCGCC TGAAGAAACC GACGGGTTTG ACGATCTGAT CAAATCCTGC CGCGAGCAAC ATGAAGCGCT AAAAGCCCAG TTAGAACAGG GCCGCGACCG CCTGCTGGAG ATCCACTCCA ACGGCGGAGA AAAAGCCCAA CAGCTCGCAC AAAGCATTGA AGAACAGGAC GACGACACCA ACCTGATCGC GTTCGCCATG AACCTGTTCG ATATTGTCGG CATTAACCAG GACGATCGCG GCGACAACCT GATCGTGCTG ACGCCGTCCG ACCACATGTT GGTGCCGGAT TTCCCCGGCC TGCCGGAAGA CGGCTGTACT ATCACGTTTG AACGTGACGT AGCCCTGTCT CGCGAAGATG CGCAGTTTAT TACCTGGGAA CATCCGCTCA TCCGTAACGG ACTGGATCTG ATCCTCTCCG GCGATACCGG CAGCAGCACC ATTTCGCTGT TGAAAAATAA AGCGCTGCCA GTCGGCACGC TGCTGGTCGA ACTGGTTTAC GTCGTGGAAG CGCAGGCGCC GAAACAGCTA CAACTGAACC GCTTCCTGCC GCCGACGCCG GTACGTATGC TGTTAGATAA AAACGGCAAC AATCTGGCCG CCCAGGTCGA GTTTGAAACC TTCAACCGTC AGCTCAGCGC CGTGAATCGC CACACCGGCA GCAAACTGGT TAACGCCGTG CAACAGGACG TCTATGCCAT TTTGCAACTG GGCGAGACGC AGATTGAACA GTCCGCCAGA GCGCTGATTG ATAACGCGCG TCGCGAGGCG GATGAAAAAC TGTCCGGGGA ACTGTCGCGT CTGGAAGCGC TGCGCGCCGT CAACCCGAAC ATTCGCGACG ACGAACTTGC CGCTATCGAC AGCAACCGTC AGCAGGTGCT GGAAAGCCTG AATCAGGCAG GCTGGCGTCT GGACGCGCTG CGTCTTATCG TCGTCACGCA CCAATAA
|
Protein sequence | MPFTLGQRWI SDTESELGLG TVVAMDARTV TLLFPSTGEN RLYARSDSPV TRVMFNPGDT ITSHEGWQLH IDEVKEENGL LVYVGTRLDT EETNVTLREV LLDSKLVFSK PQDRLFAGQI DRMDRFALRY RARKFQSEQY RMPYSGLRGQ RTNLIPHQLN IAHDVGRRHA PRVLLADEVG LGKTIEAGMI LHQQLLSGAA ERVLIIVPET LQHQWLVEML RRFNLRFALF DDERYTEAQH DAYNPFETEQ LVICSLDFAR RNKQRLEHLC DAEWDLLVVD EAHHLVWSTD APSREYMAIE QLAERVPGVL LLTATPEQLG MESHFARLRL LDPNRFHDFE QFVEEQKNYR PVADAVAMLL AGNKLSNDEL NRLGDLIGEQ DIEPLLQAAN SDRDDAQAAR DELVSMLMDR HGTSRVLFRN TRNGVKGFPK RELHTVKLPL PTQYQTAIKV SGIMGARKSA EDRARDMLYP EQIYQEFEGD TGTWWNFDPR VEWLMGYLTS HRSQKVLVIC AKATTALQLE QVLREREGIR AAVFHEGMSI IERDRAAAWF AEEDTGAQVL LCSEIGSEGR NFQFASNLVM FDLPFNPDLL EQRIGRLDRI GQAHDIQIHV PYLEKTAQSV LVRWYHEGLD AFEHTCPTGR AIYDSAYASL INYLAAPEET DGFDDLIKSC REQHEALKAQ LEQGRDRLLE IHSNGGEKAQ QLAQSIEEQD DDTNLIAFAM NLFDIVGINQ DDRGDNLIVL TPSDHMLVPD FPGLPEDGCT ITFERDVALS REDAQFITWE HPLIRNGLDL ILSGDTGSST ISLLKNKALP VGTLLVELVY VVEAQAPKQL QLNRFLPPTP VRMLLDKNGN NLAAQVEFET FNRQLSAVNR HTGSKLVNAV QQDVYAILQL GETQIEQSAR ALIDNARREA DEKLSGELSR LEALRAVNPN IRDDELAAID SNRQQVLESL NQAGWRLDAL RLIVVTHQ
|
| |