Gene SeD_A0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0101 
SymbolrapA 
ID6874087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp105501 
End bp108407 
Gene Length2907 bp 
Protein Length968 aa 
Translation table11 
GC content57% 
IMG OID642783355 
ProductATP-dependent helicase HepA 
Protein accessionYP_002214049 
Protein GI198245326 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTTTA CACTTGGTCA ACGCTGGATC AGCGATACAG AAAGCGAGCT GGGACTTGGA 
ACCGTTGTTG CGATGGATGC GCGAACCGTC ACCTTACTTT TCCCGTCCAC GGGGGAAAAC
CGCCTGTATG CGCGCAGTGA TTCTCCCGTG ACCCGCGTCA TGTTCAACCC TGGCGATACG
ATTACAAGCC ATGAAGGCTG GCAGCTACAT ATCGATGAAG TAAAAGAAGA AAATGGCCTG
CTTGTTTATG TCGGCACCCG CCTGGATACC GAAGAGACCA ATGTGACGTT GCGCGAGGTT
CTGCTCGACA GCAAGTTGGT TTTCAGTAAG CCCCAGGATC GTCTGTTCGC CGGTCAAATT
GATCGAATGG ATCGGTTCGC GCTGCGCTAT CGCGCCCGTA AATTTCAGAG CGAGCAGTAC
CGGATGCCGT ACAGCGGCCT GCGCGGTCAG CGGACCAACC TGATCCCGCA TCAGCTTAAT
ATCGCTCATG ATGTGGGTCG TCGCCACGCG CCACGCGTAC TGCTGGCGGA TGAAGTGGGC
TTAGGTAAAA CCATTGAAGC CGGGATGATT CTGCATCAAC AGTTATTATC CGGCGCGGCG
GAACGCGTAT TGATCATTGT TCCGGAAACC CTGCAACACC AGTGGCTGGT AGAAATGCTG
CGCCGTTTCA ACCTGCGCTT CGCGCTGTTC GATGACGAAC GCTATACCGA AGCGCAGCAC
GACGCGTATA ACCCGTTTGA AACCGAACAA CTGGTGATCT GCTCGCTGGA TTTCGCCCGC
CGTAATAAGC AGCGTCTGGA ACATTTGTGC GACGCCGAGT GGGATCTGCT GGTGGTCGAC
GAAGCGCATC ATCTGGTGTG GAGTACCGAT GCGCCGAGTC GTGAATATAT GGCCATCGAA
CAGTTAGCTG AACGCGTGCC GGGCGTGCTG CTGCTGACCG CCACGCCAGA ACAGCTGGGG
ATGGAAAGCC ATTTCGCCCG TCTGCGCCTG CTCGATCCGA ACCGTTTCCA CGATTTCGAA
CAGTTTGTCG AAGAACAGAA AAACTACCGT CCTGTCGCCG ATGCGGTGGC CATGCTGCTG
GCGGGCAATA AACTCAGCAA CGACGAACTG AACAGGCTGG GCGATCTGAT CGGCGAACAG
GATATTGAAC CGCTGTTGCA GGCCGCTAAT AGCGATCGCG ACGACGCGCA GGCCGCCCGT
GATGAGCTGG TGTCGATGCT GATGGATCGC CACGGCACCA GCCGCGTTCT GTTCCGCAAC
ACCCGTAACG GCGTCAAGGG GTTCCCGAAA CGTGAACTGC ATACGGTAAA ACTGCCGCTG
CCGACCCAGT ATCAGACCGC CATTAAGGTC TCCGGCATTA TGGGCGCGCG TAAAAGCGCG
GAAGATCGCG CCCGCGATAT GCTCTATCCG GAACAAATTT ATCAGGAGTT CGAAGGCGAT
ACTGGCACCT GGTGGAACTT CGACCCACGC GTTGAGTGGC TAATGGGCTA TCTGACCAGC
CATCGTTCGC AAAAAGTGCT GGTGATCTGC GCCAAAGCGA CCACCGCGTT ACAGCTGGAG
CAGGTGCTGC GCGAACGCGA AGGCATCCGC GCCGCCGTGT TCCATGAGGG CATGTCGATT
ATCGAACGCG ACCGCGCCGC CGCCTGGTTC GCCGAAGAAG ATACCGGCGC GCAGGTGCTG
TTATGTTCCG AAATCGGCTC CGAAGGACGT AACTTCCAGT TTGCCAGCAA TCTGGTGATG
TTCGACTTGC CGTTTAACCC GGATCTGCTG GAACAGCGTA TTGGTCGTCT GGATCGCATC
GGCCAGGCGC ATGATATCCA GATCCACGTC CCGTACCTGG AAAAAACCGC CCAGTCGGTG
CTGGTTCGCT GGTATCACGA AGGGCTGGAC GCCTTTGAGC ACACCTGTCC AACCGGCCGC
GCGATTTATG ATTCAGCCTA CGCCAGTCTG ATTAACTATC TGGCCGCGCC TGAAGAAACC
GACGGGTTTG ACGATCTGAT CAAATCCTGC CGCGAGCAAC ATGAAGCGCT AAAAGCCCAG
TTAGAACAGG GCCGCGACCG CCTGCTGGAG ATCCACTCCA ACGGTGGTGA AAAAGCCCAA
CAGCTTGCAC AAAGCATTGA AGAACAGGAC GACGACACCA ACCTGATCGC GTTCGCCATG
AACCTGTTCG ATATCGTCGG CATTAACCAG GACGATCGCG GCGACAACCT GATCGTGCTG
ACGCCGTCCG ACCACATGTT GGTGCCGGAT TTCCCCGGCC TGCCGGAAGA CGGCTGTACT
ATCACGTTTG AACGTGACGT AGCCCTGTCT CGCGAAGATG CGCAGTTTAT TACCTGGGAA
CATCCGCTCA TCCGTAACGG ACTGGATCTG ATCCTCTCCG GCGATACCGG CAGCAGCACC
ATTTCGCTGT TGAAAAATAA AGCGCTGCCA GTCGGCACGC TGCTGGTCGA ACTGGTTTAC
GTCGTGGAAG CGCAGGCGCC GAAACAGCTA CAACTGAACC GCTTCCTGCC GCCGACGCCG
GTACGTATGC TGTTAGATAA AAACGGCAAC AATCTGGCCG CCCAGGTCGA GTTTGAAACC
TTCAACCGCC AGCTCAGCGC CGTGAATCGC CACACCGGCA GCAAACTGGT GAACGCCGTG
CAACAGGACG TCCACGCCAT TTTGCAACTG GGCGAGACGC AGATTGAAAA GTCCGCCAGA
GCGCTGATTG ATAACGCGCG TCGCGAGGCG GATGAAAAAC TGTCCGGGGA ACTGTCGCGT
CTGGAAGCGC TGCGCGCCGT CAACCCGAAC ATTCGCGACG ACGAACTTGC CGCTATTGAC
AGCAACCGTC AGCAGGTACT GGAAAGCCTG AATCAGGCAG GCTGGCGTCT GGACGCGCTG
CGTCTTATCG TCGTCACGCA CCAATAA
 
Protein sequence
MPFTLGQRWI SDTESELGLG TVVAMDARTV TLLFPSTGEN RLYARSDSPV TRVMFNPGDT 
ITSHEGWQLH IDEVKEENGL LVYVGTRLDT EETNVTLREV LLDSKLVFSK PQDRLFAGQI
DRMDRFALRY RARKFQSEQY RMPYSGLRGQ RTNLIPHQLN IAHDVGRRHA PRVLLADEVG
LGKTIEAGMI LHQQLLSGAA ERVLIIVPET LQHQWLVEML RRFNLRFALF DDERYTEAQH
DAYNPFETEQ LVICSLDFAR RNKQRLEHLC DAEWDLLVVD EAHHLVWSTD APSREYMAIE
QLAERVPGVL LLTATPEQLG MESHFARLRL LDPNRFHDFE QFVEEQKNYR PVADAVAMLL
AGNKLSNDEL NRLGDLIGEQ DIEPLLQAAN SDRDDAQAAR DELVSMLMDR HGTSRVLFRN
TRNGVKGFPK RELHTVKLPL PTQYQTAIKV SGIMGARKSA EDRARDMLYP EQIYQEFEGD
TGTWWNFDPR VEWLMGYLTS HRSQKVLVIC AKATTALQLE QVLREREGIR AAVFHEGMSI
IERDRAAAWF AEEDTGAQVL LCSEIGSEGR NFQFASNLVM FDLPFNPDLL EQRIGRLDRI
GQAHDIQIHV PYLEKTAQSV LVRWYHEGLD AFEHTCPTGR AIYDSAYASL INYLAAPEET
DGFDDLIKSC REQHEALKAQ LEQGRDRLLE IHSNGGEKAQ QLAQSIEEQD DDTNLIAFAM
NLFDIVGINQ DDRGDNLIVL TPSDHMLVPD FPGLPEDGCT ITFERDVALS REDAQFITWE
HPLIRNGLDL ILSGDTGSST ISLLKNKALP VGTLLVELVY VVEAQAPKQL QLNRFLPPTP
VRMLLDKNGN NLAAQVEFET FNRQLSAVNR HTGSKLVNAV QQDVHAILQL GETQIEKSAR
ALIDNARREA DEKLSGELSR LEALRAVNPN IRDDELAAID SNRQQVLESL NQAGWRLDAL
RLIVVTHQ