Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_0061 |
Symbol | rapA |
ID | 5590175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 62164 |
End bp | 65070 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640923792 |
Product | ATP-dependent helicase HepA |
Protein accession | YP_001461229 |
Protein GI | 157157613 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000176621 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTTTTA CACTTGGTCA ACGCTGGATC AGCGATACAG AAAGCGAATT GGGACTTGGA ACCGTTGTCG CGGTGGATGC GCGAACTGTC ACTTTACTTT TCCCATCTAC TGGTGAAAAC CGTCTGTACG CACGCAGTGA TTCCCCCGTG ACCCGCGTGA TGTTCAACCC TGGTGATACC ATTACCAGCC ATGACGGCTG GCAGATGCAA GTCGAAGAAG TAAAAGAAGA AAATGGCTTG CTGACCTATA TCGGTACTCG CCTGGATACT GAAGAGTCCG GCGTAGCCCT GCGTGAAGTT TTCCTTGATA GCAAACTGGT GTTCAGCAAA CCGCAGGACC GTCTGTTTGC CGGGCAGATT GACCGTATGG ACCGCTTTGC GCTGCGTTAT CGCGCGCGTA AGTATTCCAG CGAACAGTTC CGTATGCCGT ACAGCGGCCT GCGCGGTCAG CGTACCAGTC TGATTCCGCA TCAGCTCAAC ATCGCTCATG ATGTTGGTCG CCGCCACGCG CCGCGCGTCC TGCTGGCTGA CGAAGTGGGT TTAGGGAAAA CCATTGAAGC CGGGATGATC CTGCATCAGC AACTGCTCTC TGGCGCTGCT GAACGTGTGC TGATTATCGT CCCGGAAACC TTACAGCATC AGTGGCTGGT AGAAATGCTG CGCCGTTTCA ACCTGCGCTT TGCACTGTTT GATGATGAGC GTTATGCCGA AGCTCAGCAC GATGCTTACA ACCCGTTTGA CACCGAACAG CTGGTGATTT GCTCGCTGGA TTTTGCCCGT CGTAGCAAAC AGCGCCTGGA ACATCTCTGT GAAGCCGAAT GGGACCTGCT GGTGGTCGAT GAAGCGCATC ACCTGGTGTG GAGCGAAGAT GCGCCAAGCC GTGAATATCA GGCCATTGAA CAACTGGCAG AGCACGTGCC GGGCGTTCTG CTGCTGACCG CGACCCCGGA ACAGCTGGGG ATGGAAAGCC ACTTCGCCCG TCTGCGTCTG CTGGACCCGA ACCGTTTCCA CGATTTTGCG CAGTTCGTTG AAGAGCAGAA AAATTATCGT CCGGTTGCGG ACGCCGTTGC CATGCTGCTG GCAGGTAACA AACTGAGCAA TGACGAACTG AACATGCTTG GCGAGATGAT CGGCGAGCAG GATATCGAGC CGCTGTTGCA AGCAGCAAAC AGCGACAGCG AAGATGCCCA GAGCGCCCGT CAGGAACTGG TTTCGATGCT GATGGATCGC CACGGCACCA GCCGCGTGCT GTTCCGTAAC ACGCGTAACG GTGTGAAAGG ATTCCCGAAA CGCGAGCTGC ACACCATTAA ACTGCCGCTA CCGACGCAGT ATCAGACGGC TATTAAAGTC TCCGGCATTA TGGGCGCACG TAAAAGTGCG GAAGATCGTG CTCGCGATAT GCTCTACCCG GAGCGTATTT ATCAGGAATT TGAAGGTGAT AACGCCACCT GGTGGAACTT CGATCCGCGC GTTGAGTGGC TGATGGGCTA CCTGACCAGC CATCGCTCTC AGAAAGTGCT GGTGATCTGC GCCAAAGCTG CCACTGCGCT GCAACTGGAG CAGGTACTGC GCGAACGTGA AGGTATTCGC GCTGCGGTGT TCCACGAAGG TATGTCGATT ATCGAACGTG ACCGCGCTGC CGCCTGGTTT GCCGAAGAAG ACACCGGCGC ACAGGTACTG CTGTGCTCAG AAATCGGTTC TGAAGGACGT AACTTCCAGT TCGCCAGCCA CATGGTGATG TTTGACCTGC CATTCAACCC GGATCTACTG GAGCAGCGTA TTGGTCGTCT GGATCGTATC GGCCAGGCGC ACGATATTCA GATCCATGTG CCTTATCTGG AGAAAACCGC TCAGTCGGTG CTGGTGCGCT GGTATCACGA AGGTCTGGAT GCATTTGAGC ACACCTGCCC GACTGGACGC ACTATTTACG ATAGCGTATA CAACGATCTG ATTAACTATC TGGCTTCACC GGATCAAACC GAAGGCTTTG ACGATCTGAT CAAAAACTGC CGCGAGCAAC ATGAAGCGCT GAAAGCACAG CTGGAACAGG GTCGTGACCG CCTGCTGGAA ATCCACTCCA ACGGTGGCGA AAAAGCCCAG GCACTGGCAG AAAGCATTGA AGAGCAGGAT GACGATACCA ACCTGATCGC CTTCGCCATG AACCTGTTCG ATATTATCGG TATCAATCAG GACGATCGCG GCGACAACAT GATCGTGCTG ACGCCGTCCG ATCATATGCT GGTGCCGGAC TTCCCTGGCC TGTCGGAAGA TGGCATCACC ATCACCTTTG ATCGTGAAGT GGCACTGGCG CGTGAAGATG CACAGTTTAT TACCTGGGAG CATCCGCTGA TCCGCAACGG TCTGGATCTG ATCCTTTCTG GCGATACCGG TAGCAGCACG ATTTCACTGT TAAAAAACAA AGCGTTGCCG GTAGGTACGC TGTTGGTGGA ACTGATTTAT GTGGTTGAAG CCCAGGCGCC GAAGCAGTTG CAGCTCAACC GCTTCCTGCC ACCGACGCCG GTACGTATGC TGCTGGATAA AAACGGCAAC AACCTGGCGG CGCAGGTAGA GTTTGAAACC TTTAACCGCC AGCTTAACGC GGTTAACCGT CACACCGGCA GCAAACTGGT TAACGCCGTG CAGCAGGATG TTCACGCTAT CCTTCAACTG GGTGAAGCGC AGATCGAGAA ATCTGCCCGT GCATTGATTG ATGCAGCGCG TAACGAAGCC GACGAAAAAC TGTCTGCCGA GCTGTCTCGT CTGGAAGCTC TGCGTGCAGT GAACCCGAAC ATTCGTGACG ACGAACTGAC CGCCATTGAG AGCAACCGTC AGCAGATAAT GGAAAGCCTG GATCAGGCAG GTTGGCGTCT GGATGCCCTG CGTTTGATCG TTGTGACGCA TCAGTAA
|
Protein sequence | MPFTLGQRWI SDTESELGLG TVVAVDARTV TLLFPSTGEN RLYARSDSPV TRVMFNPGDT ITSHDGWQMQ VEEVKEENGL LTYIGTRLDT EESGVALREV FLDSKLVFSK PQDRLFAGQI DRMDRFALRY RARKYSSEQF RMPYSGLRGQ RTSLIPHQLN IAHDVGRRHA PRVLLADEVG LGKTIEAGMI LHQQLLSGAA ERVLIIVPET LQHQWLVEML RRFNLRFALF DDERYAEAQH DAYNPFDTEQ LVICSLDFAR RSKQRLEHLC EAEWDLLVVD EAHHLVWSED APSREYQAIE QLAEHVPGVL LLTATPEQLG MESHFARLRL LDPNRFHDFA QFVEEQKNYR PVADAVAMLL AGNKLSNDEL NMLGEMIGEQ DIEPLLQAAN SDSEDAQSAR QELVSMLMDR HGTSRVLFRN TRNGVKGFPK RELHTIKLPL PTQYQTAIKV SGIMGARKSA EDRARDMLYP ERIYQEFEGD NATWWNFDPR VEWLMGYLTS HRSQKVLVIC AKAATALQLE QVLREREGIR AAVFHEGMSI IERDRAAAWF AEEDTGAQVL LCSEIGSEGR NFQFASHMVM FDLPFNPDLL EQRIGRLDRI GQAHDIQIHV PYLEKTAQSV LVRWYHEGLD AFEHTCPTGR TIYDSVYNDL INYLASPDQT EGFDDLIKNC REQHEALKAQ LEQGRDRLLE IHSNGGEKAQ ALAESIEEQD DDTNLIAFAM NLFDIIGINQ DDRGDNMIVL TPSDHMLVPD FPGLSEDGIT ITFDREVALA REDAQFITWE HPLIRNGLDL ILSGDTGSST ISLLKNKALP VGTLLVELIY VVEAQAPKQL QLNRFLPPTP VRMLLDKNGN NLAAQVEFET FNRQLNAVNR HTGSKLVNAV QQDVHAILQL GEAQIEKSAR ALIDAARNEA DEKLSAELSR LEALRAVNPN IRDDELTAIE SNRQQIMESL DQAGWRLDAL RLIVVTHQ
|
| |