Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0061 |
Symbol | rapA |
ID | 6144155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 65507 |
End bp | 68413 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641614962 |
Product | ATP-dependent helicase HepA |
Protein accession | YP_001742178 |
Protein GI | 170680507 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00130615 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.800173 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTTTA CACTTGGTCA ACGCTGGATC AGCGATACAG AAAGCGAATT GGGACTTGGA ACCGTTGTCG CGGTGGATGC GCGAACTGTC ACTTTACTTT TCCCATCTAC TGGTGAAAAC CGTCTGTACG CACGCAGTGA TTCCCCCGTG ACCCGCGTGA TGTTCAACCC TGGTGATACT ATTACCAGCC ATGATGGCTG GCAGATGCAA GTCGAAGAAG TAAAAGAAGA AAATGGCTTG CTGACCTATA TCGGTACTCG CCTGGATACT GAAGAGTCCG GCGTGGCCCT GCGTGAAGTT TTCCTTGATA GCAAACTGGT GTTCAGCAAA CCGCAGGACC GTCTGTTTGC CGGGCAGATT GACCGTATGG ACCGCTTTGC GCTGCGTTAT CGCGCGCGTA AGTATTCCAG CGAACAGTTC CGTATGCCGT ACAGCGGCCT GCGCGGTCAG CGTACCAGCC TGATCCCGCA CCAGCTCAAC ATCGCTCATG ATGTTGGTCG CCGCCACGCG CCGCGCGTCC TGTTGGCTGA CGAAGTGGGT TTAGGGAAAA CCATTGAAGC CGGGATGATC CTGCATCAGC AACTGCTCTC TGGCGCTGCT GAACGTGTGC TGATTATCGT CCCGGAAACC TTACAGCATC AGTGGCTGGT AGAAATGCTG CGCCGTTTCA ACCTGCGCTT TGCTCTGTTT GATGATGAGC GTTATGCCGA AGCTCAGCAC GATGCTTACA ACCCGTTCGA TACCGAGCAG CTGGTGATTT GCTCGCTGGA TTTTGCCCGT CGTAGCAAAC AGCGTCTGGA ACATCTCTGT GAAGCCGAGT GGGATCTACT GGTGGTCGAT GAAGCGCATC ACCTGGTGTG GAGCGAAGAT GCGCCGAGCC GTGAATATCA GGCCATTGAA CAACTGGCAG AGCACGTGCC GGGCGTTCTG CTGCTGACCG CAACCCCGGA ACAGCTGGGG ATGGAAAGCC ACTTCGCCCG TCTGCGTCTG CTGGACCCGA ACCGTTTCCA CGATTTTGCG CAATTCGTTG AAGAGCAGAA AAATTATCGT CCGGTTGCGG ACGCCGTTGC CATGCTGCTG GCAGGTAACA AACTGAGCAA TGATGAACTG AACATGCTCG GTGAGATGAT CGGCGAGCAG GATATCGAGC CGCTGTTGCA AGCAGCAAAC AGCGACAGCG AAGATGCACA GAGCGCCCGT CAGGAGCTGG TTTCGATGCT GATGGATCGC CACGGCACCA GCCGCGTGCT GTTCCGTAAC ACGCGTAACG GTGTGAAAGG CTTCCCGAAA CGCGAGCTGC ACACCATTAA GCTGCCGCTG CCGACACAGT ATCAGACGGC TATTAAAGTC TCCGGCATTA TGGGCGCACG CAAAAGTGCG GAAGACCGCG CTCGCGATAT GCTCTACCCG GAGCGTATTT ATCAGGAATT TGAAGGTGAT AACGCCACCT GGTGGAACTT CGATCCGCGC GTTGAGTGGC TGATGGGCTA CCTGACCAGC CATCGCTCTC AGAAAGTGCT GGTGATCTGT GCCAAAGCTG CCACTGCGCT GCAACTGGAG CAGGTACTGC GCGAACGTGA AGGTATTCGC GCCGCGGTAT TCCACGAAGG TATGTCGATT ATTGAACGTG ACCGCGCTGC CGCCTGGTTT GCTGAAGAAG ACACCGGCGC ACAGGTACTG CTGTGCTCGG AAATCGGTTC TGAAGGACGT AACTTCCAGT TCGCCAGCCA CATGGTGATG TTTGACCTGC CATTCAACCC GGATCTGCTG GAGCAGCGTA TTGGTCGTCT GGATCGTATC GGTCAGGCGC ACGATATTCA GATCCATGTG CCTTATCTGG AGAAAACCGC TCAGTCGGTG CTGGTGCGCT GGTATCACGA AGGTCTGGAT GCATTCGAGC ACACCTGCCC GACCGGACGC ACTATCTACG ATAGCGTATA CAACGATCTG ATTAACTATC TGGCTTCACC GGATGAGACC GAAGGCTTTG ACGATCTGAT CAAAAACTGC CGCGAGCAAC ATGAAGCGCT GAAAGCACAG CTGGAACAGG GCCGTGACCG CCTGCTGGAA ATCCACTCCA ACGGTGGCGA AAAAGCCCAG GCACTGGCAG AAAGCATTGA AGAGCAGGAT GACGATACCA ACCTGATCGC CTTCGCCATG AACCTGTTCG ATATTATCGG TATCAATCAG GACGATCGCG GCGACAACAT GATCGTACTG ACGCCGTCCG ATCATATGCT GGTGCCGGAC TTCCCTGGCT TGTCGGAAGA TGGCATCACC ATCACCTTTG ATCGTGAAGT GGCGCTGGCG CGTGAAGATG CGCAGTTTAT TACCTGGGAA CATCCGCTGA TCCGCAACGG TCTGGATCTG ATCCTTTCTG GCGATACCGG TAGCAGCACG ATTTCACTGT TGAAAAACAA AGCGTTGCCG GTAGGTACGC TGTTGGTGGA ACTGATTTAC GTGGTCGAAG CCCAGGCTCC GAAGCAGTTG CAGCTCAACC GCTTCCTGCC ACCGACGCCG GTACGTATGC TGCTGGATAA AAACGGCAAC AACCTGGCGG CGCAGGTGGA GTTTGAAACC TTTAACCGTC AGCTTAACGC GGTTAACCGT CACACCGGCA GCAAACTGGT TAACGCCGTG CAGCAGGATG TTCACGCGAT CCTTCAGCTG GGTGAAGCAC AGATCGAGAA ATCTGCCCGT GCACTGATTG ATGCCGCACG TAACGAAGCC GACGAAAAAC TGTCTGCCGA GCTGTCTCGT CTGGAAGCTC TGCGTGCAGT GAACCCGAAC ATTCGTGACG ACGAACTGAC CGCCATTGAG AGCAACCGTC AGCAGGTAAT GGAAAGCCTG GATCAGGCAG GCTGGCGTCT GGATGCCCTG CGTTTGATCG TTGTAACGCA TCAGTAA
|
Protein sequence | MPFTLGQRWI SDTESELGLG TVVAVDARTV TLLFPSTGEN RLYARSDSPV TRVMFNPGDT ITSHDGWQMQ VEEVKEENGL LTYIGTRLDT EESGVALREV FLDSKLVFSK PQDRLFAGQI DRMDRFALRY RARKYSSEQF RMPYSGLRGQ RTSLIPHQLN IAHDVGRRHA PRVLLADEVG LGKTIEAGMI LHQQLLSGAA ERVLIIVPET LQHQWLVEML RRFNLRFALF DDERYAEAQH DAYNPFDTEQ LVICSLDFAR RSKQRLEHLC EAEWDLLVVD EAHHLVWSED APSREYQAIE QLAEHVPGVL LLTATPEQLG MESHFARLRL LDPNRFHDFA QFVEEQKNYR PVADAVAMLL AGNKLSNDEL NMLGEMIGEQ DIEPLLQAAN SDSEDAQSAR QELVSMLMDR HGTSRVLFRN TRNGVKGFPK RELHTIKLPL PTQYQTAIKV SGIMGARKSA EDRARDMLYP ERIYQEFEGD NATWWNFDPR VEWLMGYLTS HRSQKVLVIC AKAATALQLE QVLREREGIR AAVFHEGMSI IERDRAAAWF AEEDTGAQVL LCSEIGSEGR NFQFASHMVM FDLPFNPDLL EQRIGRLDRI GQAHDIQIHV PYLEKTAQSV LVRWYHEGLD AFEHTCPTGR TIYDSVYNDL INYLASPDET EGFDDLIKNC REQHEALKAQ LEQGRDRLLE IHSNGGEKAQ ALAESIEEQD DDTNLIAFAM NLFDIIGINQ DDRGDNMIVL TPSDHMLVPD FPGLSEDGIT ITFDREVALA REDAQFITWE HPLIRNGLDL ILSGDTGSST ISLLKNKALP VGTLLVELIY VVEAQAPKQL QLNRFLPPTP VRMLLDKNGN NLAAQVEFET FNRQLNAVNR HTGSKLVNAV QQDVHAILQL GEAQIEKSAR ALIDAARNEA DEKLSAELSR LEALRAVNPN IRDDELTAIE SNRQQVMESL DQAGWRLDAL RLIVVTHQ
|
| |