Gene EcSMS35_0061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0061 
SymbolrapA 
ID6144155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp65507 
End bp68413 
Gene Length2907 bp 
Protein Length968 aa 
Translation table11 
GC content55% 
IMG OID641614962 
ProductATP-dependent helicase HepA 
Protein accessionYP_001742178 
Protein GI170680507 
COG category[L] Replication, recombination and repair
[K] Transcription 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00130615 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.800173 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTTTA CACTTGGTCA ACGCTGGATC AGCGATACAG AAAGCGAATT GGGACTTGGA 
ACCGTTGTCG CGGTGGATGC GCGAACTGTC ACTTTACTTT TCCCATCTAC TGGTGAAAAC
CGTCTGTACG CACGCAGTGA TTCCCCCGTG ACCCGCGTGA TGTTCAACCC TGGTGATACT
ATTACCAGCC ATGATGGCTG GCAGATGCAA GTCGAAGAAG TAAAAGAAGA AAATGGCTTG
CTGACCTATA TCGGTACTCG CCTGGATACT GAAGAGTCCG GCGTGGCCCT GCGTGAAGTT
TTCCTTGATA GCAAACTGGT GTTCAGCAAA CCGCAGGACC GTCTGTTTGC CGGGCAGATT
GACCGTATGG ACCGCTTTGC GCTGCGTTAT CGCGCGCGTA AGTATTCCAG CGAACAGTTC
CGTATGCCGT ACAGCGGCCT GCGCGGTCAG CGTACCAGCC TGATCCCGCA CCAGCTCAAC
ATCGCTCATG ATGTTGGTCG CCGCCACGCG CCGCGCGTCC TGTTGGCTGA CGAAGTGGGT
TTAGGGAAAA CCATTGAAGC CGGGATGATC CTGCATCAGC AACTGCTCTC TGGCGCTGCT
GAACGTGTGC TGATTATCGT CCCGGAAACC TTACAGCATC AGTGGCTGGT AGAAATGCTG
CGCCGTTTCA ACCTGCGCTT TGCTCTGTTT GATGATGAGC GTTATGCCGA AGCTCAGCAC
GATGCTTACA ACCCGTTCGA TACCGAGCAG CTGGTGATTT GCTCGCTGGA TTTTGCCCGT
CGTAGCAAAC AGCGTCTGGA ACATCTCTGT GAAGCCGAGT GGGATCTACT GGTGGTCGAT
GAAGCGCATC ACCTGGTGTG GAGCGAAGAT GCGCCGAGCC GTGAATATCA GGCCATTGAA
CAACTGGCAG AGCACGTGCC GGGCGTTCTG CTGCTGACCG CAACCCCGGA ACAGCTGGGG
ATGGAAAGCC ACTTCGCCCG TCTGCGTCTG CTGGACCCGA ACCGTTTCCA CGATTTTGCG
CAATTCGTTG AAGAGCAGAA AAATTATCGT CCGGTTGCGG ACGCCGTTGC CATGCTGCTG
GCAGGTAACA AACTGAGCAA TGATGAACTG AACATGCTCG GTGAGATGAT CGGCGAGCAG
GATATCGAGC CGCTGTTGCA AGCAGCAAAC AGCGACAGCG AAGATGCACA GAGCGCCCGT
CAGGAGCTGG TTTCGATGCT GATGGATCGC CACGGCACCA GCCGCGTGCT GTTCCGTAAC
ACGCGTAACG GTGTGAAAGG CTTCCCGAAA CGCGAGCTGC ACACCATTAA GCTGCCGCTG
CCGACACAGT ATCAGACGGC TATTAAAGTC TCCGGCATTA TGGGCGCACG CAAAAGTGCG
GAAGACCGCG CTCGCGATAT GCTCTACCCG GAGCGTATTT ATCAGGAATT TGAAGGTGAT
AACGCCACCT GGTGGAACTT CGATCCGCGC GTTGAGTGGC TGATGGGCTA CCTGACCAGC
CATCGCTCTC AGAAAGTGCT GGTGATCTGT GCCAAAGCTG CCACTGCGCT GCAACTGGAG
CAGGTACTGC GCGAACGTGA AGGTATTCGC GCCGCGGTAT TCCACGAAGG TATGTCGATT
ATTGAACGTG ACCGCGCTGC CGCCTGGTTT GCTGAAGAAG ACACCGGCGC ACAGGTACTG
CTGTGCTCGG AAATCGGTTC TGAAGGACGT AACTTCCAGT TCGCCAGCCA CATGGTGATG
TTTGACCTGC CATTCAACCC GGATCTGCTG GAGCAGCGTA TTGGTCGTCT GGATCGTATC
GGTCAGGCGC ACGATATTCA GATCCATGTG CCTTATCTGG AGAAAACCGC TCAGTCGGTG
CTGGTGCGCT GGTATCACGA AGGTCTGGAT GCATTCGAGC ACACCTGCCC GACCGGACGC
ACTATCTACG ATAGCGTATA CAACGATCTG ATTAACTATC TGGCTTCACC GGATGAGACC
GAAGGCTTTG ACGATCTGAT CAAAAACTGC CGCGAGCAAC ATGAAGCGCT GAAAGCACAG
CTGGAACAGG GCCGTGACCG CCTGCTGGAA ATCCACTCCA ACGGTGGCGA AAAAGCCCAG
GCACTGGCAG AAAGCATTGA AGAGCAGGAT GACGATACCA ACCTGATCGC CTTCGCCATG
AACCTGTTCG ATATTATCGG TATCAATCAG GACGATCGCG GCGACAACAT GATCGTACTG
ACGCCGTCCG ATCATATGCT GGTGCCGGAC TTCCCTGGCT TGTCGGAAGA TGGCATCACC
ATCACCTTTG ATCGTGAAGT GGCGCTGGCG CGTGAAGATG CGCAGTTTAT TACCTGGGAA
CATCCGCTGA TCCGCAACGG TCTGGATCTG ATCCTTTCTG GCGATACCGG TAGCAGCACG
ATTTCACTGT TGAAAAACAA AGCGTTGCCG GTAGGTACGC TGTTGGTGGA ACTGATTTAC
GTGGTCGAAG CCCAGGCTCC GAAGCAGTTG CAGCTCAACC GCTTCCTGCC ACCGACGCCG
GTACGTATGC TGCTGGATAA AAACGGCAAC AACCTGGCGG CGCAGGTGGA GTTTGAAACC
TTTAACCGTC AGCTTAACGC GGTTAACCGT CACACCGGCA GCAAACTGGT TAACGCCGTG
CAGCAGGATG TTCACGCGAT CCTTCAGCTG GGTGAAGCAC AGATCGAGAA ATCTGCCCGT
GCACTGATTG ATGCCGCACG TAACGAAGCC GACGAAAAAC TGTCTGCCGA GCTGTCTCGT
CTGGAAGCTC TGCGTGCAGT GAACCCGAAC ATTCGTGACG ACGAACTGAC CGCCATTGAG
AGCAACCGTC AGCAGGTAAT GGAAAGCCTG GATCAGGCAG GCTGGCGTCT GGATGCCCTG
CGTTTGATCG TTGTAACGCA TCAGTAA
 
Protein sequence
MPFTLGQRWI SDTESELGLG TVVAVDARTV TLLFPSTGEN RLYARSDSPV TRVMFNPGDT 
ITSHDGWQMQ VEEVKEENGL LTYIGTRLDT EESGVALREV FLDSKLVFSK PQDRLFAGQI
DRMDRFALRY RARKYSSEQF RMPYSGLRGQ RTSLIPHQLN IAHDVGRRHA PRVLLADEVG
LGKTIEAGMI LHQQLLSGAA ERVLIIVPET LQHQWLVEML RRFNLRFALF DDERYAEAQH
DAYNPFDTEQ LVICSLDFAR RSKQRLEHLC EAEWDLLVVD EAHHLVWSED APSREYQAIE
QLAEHVPGVL LLTATPEQLG MESHFARLRL LDPNRFHDFA QFVEEQKNYR PVADAVAMLL
AGNKLSNDEL NMLGEMIGEQ DIEPLLQAAN SDSEDAQSAR QELVSMLMDR HGTSRVLFRN
TRNGVKGFPK RELHTIKLPL PTQYQTAIKV SGIMGARKSA EDRARDMLYP ERIYQEFEGD
NATWWNFDPR VEWLMGYLTS HRSQKVLVIC AKAATALQLE QVLREREGIR AAVFHEGMSI
IERDRAAAWF AEEDTGAQVL LCSEIGSEGR NFQFASHMVM FDLPFNPDLL EQRIGRLDRI
GQAHDIQIHV PYLEKTAQSV LVRWYHEGLD AFEHTCPTGR TIYDSVYNDL INYLASPDET
EGFDDLIKNC REQHEALKAQ LEQGRDRLLE IHSNGGEKAQ ALAESIEEQD DDTNLIAFAM
NLFDIIGINQ DDRGDNMIVL TPSDHMLVPD FPGLSEDGIT ITFDREVALA REDAQFITWE
HPLIRNGLDL ILSGDTGSST ISLLKNKALP VGTLLVELIY VVEAQAPKQL QLNRFLPPTP
VRMLLDKNGN NLAAQVEFET FNRQLNAVNR HTGSKLVNAV QQDVHAILQL GEAQIEKSAR
ALIDAARNEA DEKLSAELSR LEALRAVNPN IRDDELTAIE SNRQQVMESL DQAGWRLDAL
RLIVVTHQ