Gene SNSL254_A0102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0102 
SymbolrapA 
ID6483324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp109968 
End bp112874 
Gene Length2907 bp 
Protein Length968 aa 
Translation table11 
GC content57% 
IMG OID642735545 
ProductATP-dependent helicase HepA 
Protein accessionYP_002039327 
Protein GI194446131 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0648061 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.111163 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTTTA CACTTGGTCA ACGCTGGATC AGCGATACAG AAAGCGAGCT GGGACTTGGA 
ACCGTTGTTG CGATGGATGC GCGAACCGTC ACCTTACTTT TCCCGTCCAC GGGGGAAAAC
CGCCTGTATG CGCGCAGTGA TTCTCCCGTG ACCCGCGTCA TGTTCAACCC TGGCGATACG
ATTACAAGCC ATGAAGGCTG GCAGCTACAT ATCGATGAAG TAAAAGAAGA AAATGGCCTG
CTTGTTTATG TCGGCACCCG CCTGGATACC GAAGAGACCA ATGTGACGTT GCGCGAGGTT
CTGCTCGACA GCAAGTTGGT TTTCAGTAAG CCCCAGGATC GTCTGTTCGC CGGTCAAATC
GATCGAATGG ATCGGTTCGC GCTGCGCTAT CGCGCCCGTA AATTTCAGAG CGAGCAGTAC
CGGATGCCAT ACAGCGGCCT GCGCGGTCAG CGGACCAATC TGATCCCGCA TCAGCTTAAT
ATCGCTCATG ATGTAGGTCG TCGCCACGCG CCGCGCGTAC TGCTGGCGGA TGAAGTGGGC
TTAGGTAAAA CCATTGAAGC CGGGATGATT CTGCATCAAC AGTTATTATC CGGCGCGGCG
GAACGCGTAT TGATCATCGT TCCGGAAACC CTGCAACACC AGTGGCTGGT AGAAATGCTG
CGCCGTTTCA ACCTGCGCTT CGCGCTGTTC GATGACGAAC GCTATACCGA AGCGCAGCAC
GACGCGTATA ACCCGTTTGA AACCGAACAA CTAGTGATCT GTTCGCTGGA TTTCGCCCGC
CGTAATAAGC AGCGTCTGGA ACATTTGTGC GACGCCGAGT GGGATTTGCT GGTGGTCGAC
GAAGCGCATC ATCTGGTGTG GAGTACCGAT GCGCCGAGCC GTGAATATAT GGCCATCGAA
CAATTAGCTG AACGCGTACC GGGCGTGCTG CTGCTGACCG CCACGCCAGA ACAGCTGGGG
ATGGAAAGCC ATTTCGCCCG TCTGCGCCTG CTCGATCCGA ACCGTTTCCA CGATTTCGAA
CAGTTTGTCG AAGAACAGAA AAACTACCGC CCTGTCGCCG ATGCGGTGGC CATGCTGCTG
GCGGGCAATA AACTCAGCAA CGACGAACTG AACAGGCTGG GCGATCTGAT CGGCGAACAG
GATATTGAAC CGCTGTTGCA GGCCGCTAAT AGCGATCGCG ACGATGCGCA GGCCGCCCGT
GATGAGCTGG TGTCGATGCT GATGGATCGC CACGGCACCA GCCGCGTTCT GTTCCGCAAC
ACCCGTAACG GCGTCAAGGG GTTCCCGAAA CGTGAACTGC ATACGGTAAA ACTGCCGCTG
CCGACCCAGT ATCAGACCGC CATTAAGGTC TCCGGCATTA TGGGCGCGCG TAAAAGCGCG
GAAGATCGCG CCCGCGATAT GCTCTATCCG GAACAAATTT ATCAGGAGTT CGAAGGCGAT
ACTGGCACCT GGTGGAACTT CGACCCACGC GTTGAGTGGC TAATGGGCTA TCTGACCAGC
CATCGTTCGC AAAAAGTGCT GGTGATCTGC GCCAAAGCGA CCACCGCGTT ACAGCTGGAG
CAGGTGCTGC GCGAACGCGA AGGCATCCGC GCCGCCGTGT TCCATGAGGG CATGTCGATT
ATCGAACGCG ACCGCGCCGC CGCCTGGTTC GCCGAAGAAG ATACCGGCGC GCAGGTGCTG
TTATGTTCCG AAATCGGCTC CGAAGGACGT AACTTCCAGT TTGCCAGCAA TCTGGTGATG
TTCGACTTGC CGTTTAACCC GGATCTGCTG GAACAGCGTA TTGGTCGTCT GGATCGTATC
GGCCAGGCGC ATGATATCCA GATCCACGTC CCGTACCTGG AAAAAACCGC CCAGTCGGTG
CTGGTTCGCT GGTATCACGA AGGGCTGGAC GCCTTTGAGC ACACCTGCCC AACCGGTCGC
GCGATTTATG ATTCAGCCTA CGCCAGTCTG ATTAACTATC TGGCCGCGCC TGAAGAAACC
GACGGGTTTG ACGATCTGAT CAAATCCTGC CGCGAGCAAC ATGAAGCGCT AAAAGCCCAG
TTAGAACAGG GCCGCGACCG CCTGCTGGAG ATCCACTCCA ACGGCGGAGA AAAAGCCCAA
CAGCTCGCAC AAAGCATTGA AGAACAGGAC GACGACACCA ACCTGATCGC GTTCGCCATG
AACCTGTTCG ATATTGTCGG CATTAACCAG GACGATCGCG GCGACAACCT GATCGTGCTG
ACGCCGTCCG ACCACATGTT GGTGCCGGAT TTCCCCGGCC TGCCGGAAGA CGGCTGTACT
ATCACGTTTG AACGTGACGT AGCCCTGTCT CGCGAAGATG CGCAGTTTAT TACCTGGGAA
CATCCGCTCA TCCGTAACGG ACTGGATCTG ATCCTCTCCG GCGATACCGG CAGCAGCACC
ATTTCGCTGT TGAAAAATAA AGCGCTGCCA GTCGGCACGC TGCTGGTCGA ACTGGTTTAC
GTCGTGGAAG CGCAGGCGCC GAAACAGCTA CAACTGAACC GCTTCCTGCC GCCGACGCCG
GTACGTATGC TGTTAGATAA AAACGGCAAC AATCTGGCCG CCCAGGTCGA GTTTGAAACC
TTCAACCGTC AGCTCAGCGC CGTGAATCGC CACACCGGCA GCAAACTGGT TAACGCCGTG
CAACAGGACG TCTATGCCAT TTTGCAACTG GGCGAGACGC AGATTGAACA GTCCGCCAGA
GCGCTGATTG ATAACGCGCG TCGCGAGGCG GATGAAAAAC TGTCCGGGGA ACTGTCGCGT
CTGGAAGCGC TGCGCGCCGT CAACCCGAAC ATTCGCGACG ACGAACTTGC CGCTATCGAC
AGCAACCGTC AGCAGGTGCT GGAAAGCCTG AATCAGGCAG GCTGGCGTCT GGACGCGCTG
CGTCTTATCG TCGTCACGCA CCAATAA
 
Protein sequence
MPFTLGQRWI SDTESELGLG TVVAMDARTV TLLFPSTGEN RLYARSDSPV TRVMFNPGDT 
ITSHEGWQLH IDEVKEENGL LVYVGTRLDT EETNVTLREV LLDSKLVFSK PQDRLFAGQI
DRMDRFALRY RARKFQSEQY RMPYSGLRGQ RTNLIPHQLN IAHDVGRRHA PRVLLADEVG
LGKTIEAGMI LHQQLLSGAA ERVLIIVPET LQHQWLVEML RRFNLRFALF DDERYTEAQH
DAYNPFETEQ LVICSLDFAR RNKQRLEHLC DAEWDLLVVD EAHHLVWSTD APSREYMAIE
QLAERVPGVL LLTATPEQLG MESHFARLRL LDPNRFHDFE QFVEEQKNYR PVADAVAMLL
AGNKLSNDEL NRLGDLIGEQ DIEPLLQAAN SDRDDAQAAR DELVSMLMDR HGTSRVLFRN
TRNGVKGFPK RELHTVKLPL PTQYQTAIKV SGIMGARKSA EDRARDMLYP EQIYQEFEGD
TGTWWNFDPR VEWLMGYLTS HRSQKVLVIC AKATTALQLE QVLREREGIR AAVFHEGMSI
IERDRAAAWF AEEDTGAQVL LCSEIGSEGR NFQFASNLVM FDLPFNPDLL EQRIGRLDRI
GQAHDIQIHV PYLEKTAQSV LVRWYHEGLD AFEHTCPTGR AIYDSAYASL INYLAAPEET
DGFDDLIKSC REQHEALKAQ LEQGRDRLLE IHSNGGEKAQ QLAQSIEEQD DDTNLIAFAM
NLFDIVGINQ DDRGDNLIVL TPSDHMLVPD FPGLPEDGCT ITFERDVALS REDAQFITWE
HPLIRNGLDL ILSGDTGSST ISLLKNKALP VGTLLVELVY VVEAQAPKQL QLNRFLPPTP
VRMLLDKNGN NLAAQVEFET FNRQLSAVNR HTGSKLVNAV QQDVYAILQL GETQIEQSAR
ALIDNARREA DEKLSGELSR LEALRAVNPN IRDDELAAID SNRQQVLESL NQAGWRLDAL
RLIVVTHQ