Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0051 |
Symbol | rapA |
ID | 6271807 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 51508 |
End bp | 54414 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641724310 |
Product | ATP-dependent helicase HepA |
Protein accession | YP_001878870 |
Protein GI | 187731450 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00000808065 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTTTTA CACTTGGTCA ACGCTGGATC AGCGATACAG AAAGCGAATT GGGACTTGGA ACCGTTGTCG CGGTGGATGC GCGAACTGTC ACTTTACTTT TCCCATCTAC TGGTGAAAAC CGTCTGTACG CACGCAGTGA TTCCCCCGTG ACCCGCGTGA TGTTCAACCC TGGTGATACC ATTACCAGCC ATGACGGCTG GCAGATGCAA GTCGAAGAAG TAAAAGAAGA AAATGGCTTG CTGACCTATA TCGGTACTCG CCTGGATACT GAAGAGTCCG GCGTAGCCCT GCGTGAAGTT TTCCTTGATA GCAAACTGGT GTTCAGCAAA CCGCAGGACC GCCTGTTTGC CGGGCAGATT GACCGTATGG ACCGCTTTGC GCTGCGTTAT CGCGCGCGTA AGTATTCCAG CGAACAGTTC CGTATGCCGT ACAGCGGCCT GCGCGGTCAG CGTACCAGCC TGATCCCGCA CCAGCTCAAC ATCGCTCATG ATGTTGGCCG CCGCCACGCG CCGCGCGTCC TGCTGGCTGA CGAAGTGGGT TTAGGGAAAA CCATTGAAGC CGGGATGATC CTGCATCAGC AACTGCTCTC TGGCGCTGCT GAACGTGTGC TGATTATCGT CCCGGAAACC TTACAGCATC AGTGGCTGGT AGAAATGCTG CGCCGTTTCA ACCTGCGCTT TGCTCTGTTT GATGATGAGC GTTATGCCGA AGCTCAGCAC GATGCTTACA ACCCGTTTGA CACCGAACAG CTGGTGATTT GCTCGCTGGA TTTTGCCCGT CGTAGCAAAC AGCGCCTGGA ACATCTCTGT GAAGCCGAAT GGGACCTGCT GGTGGTCGAT GAAGCGCATC ACCTGGTGTG GAGCGAAGAT GCGCCAAGCC GTGAATATCA GGCCATTGAA CAACTGGCAG AGCACGTGCC GGGCGTTCTG CTGCTGACCG CGACCCCGGA ACAGCTGGGG ATGGAAAGCC ACTTCGCCCG TCTGCGTCTG CTGGACCCGA ACCGTTTCCA CGATTTTGCG CAGTTCGTTG AAGAGCAGAA AAATTATCGT CCGGTTGCGG ACGCCGTTGC CATGCTGCTG GCAGGTAACA AACTGAGCAA TGACGAACTG AACATGCTCG GCGAGATGAT CGGCGAGCAG GATATCGAGC CGCTGTTGCA GGCAGCAAAC AGCGACAGCG AAGATGCCCA GAGCGCCCGT CAGGAGCTGG TTTCGATGCT GATGGATCGC CACGGCACCA GCCGCGTGCT GTTCCGTAAC ACGCGTAACG GTGTGAAAGG ATTCCCGAAA CGCGAGCTGC ACACCATTAA GCTGCCGCTA CCGACGCAGT ATCAGACGGC TATTAAAGTC TCCGGCATTA TGGGCGCACG TAAAAGTGCG GAAGATCGTG CTCGCGATAT GCTCTACCCG GAGCGTATTT ATCAGGAATT TGAAGGTGAT AACGCCACCT GGTGGAACTT CGATCCGCGC GTTGAGTGGC TGATGGGCTA CCTGACCAGC CATCGCTCTC AGAAAGTGCT GGTGATCTGC GCCAAAGCTG CCACTGCGCT GCAACTGGAG CAGGTACTGC GCGAACGTGA AGGTATTCGC GCTGCGGTGT TCCACGAAGG TATGTCGATT ATCGAACGTG ACCGCGCTGC CGCCTGGTTT GCCGAAGAAG ACACCGGCGC ACAGGTACTG CTGTGCTCAG AAATCGGTTC TGAAGGACGT AACTTCCAGT TCGCCAGCCA CATGGTGATG TTTGACCTGC CATTCAACCC GGATCTACTG GAGCAGCGTA TTGGTCGTCT GGATCGTATC GGCCAGGCGC ACGATATTCA GATCCATGTG CCTTATCTGG AGAAAACCGC TCAGTCGGTG CTGGTGCGCT GGTATCACGA AGGTCTGGAT GCATTTGAGC ACACCTGCCC GACCGGACGC ACTATTTACG ATAGCGTATA CAACGATCTG ATTAACTATC TGGCTTCACC GGATCAAACC GAAGGCTTTG ACGATCTGAT CAAAAACTGC CGCGAGCAAC ATGAAGCGCT GAAAGCACAG CTGGAACAGG GTCGTGACCG CCTGCTGGAA ATCCACTCCA ACGGTGGCGA AAAAGCCCAG GCACTGGCAG AAAGCATTGA AGAGCAGGAT GACGATACCA ACCTGATCGC CTTCGCCATG AACCTGCTCG ATATTATCGG TATCAATCAG GACGATCGCG GCGACAACAT GATCGTGCTG ACGCCGTCCG ATCATATGCT GGTGCCGGAC TTCCCTGGCC TGTCGGAAGA TGGCATCACC ATCACCTTTG ATCGTGAAGT GGCGCTGGCG CGTGAAGATG CACAGTTTAT TACCTGGGAG CATCCGCTGA TCCGCAACGG TCTGGATCTG ATCCTTTCTG GCGATACCGG TAGCAGCACG ATTTCACTGT TAAAAAACAA AGCGTTGCCG GTAGGTACGC TGTTGGTGGA ACTGATTTAT GTGGTTGAAG CCCAGGCTCC GAAGCAGTTG CAGCTCAACC GCTTCCTGCC ACCGACGCCG GTACGTATGC TGCTGGATAA AAACGGCAAC AACCTGGCGG CGCAGGTAGA GTTTGAAACC TTTAACCGCC AGCTTAACGC GGTTAACCGT CACACCGGCA GCAAACTGGT TAACGCCGTG CAGCAGGATG TTCACGCTAT CCTTCAACTG GGTGAAGCGC AGATCGAGAA ATCTGCCCGT GCATTGATTG ATGCAGCGCG TAACGAAGCC GACGAAAAAC TGTCTGCCGA GCTGTCTCGT CTGGAAGCTC TGCGTGCAGT GAACCCGAAC ATTCGTGACG ACGAACTGAC CGCCATTGAG AGCAACCGTC AGCAGGTAAT GGAAAGCCTG GATCAGGCAG GTTGGCGTCT GGATGCCCTG CGTTTGATCG TTGTAACGCA TCAGTAA
|
Protein sequence | MPFTLGQRWI SDTESELGLG TVVAVDARTV TLLFPSTGEN RLYARSDSPV TRVMFNPGDT ITSHDGWQMQ VEEVKEENGL LTYIGTRLDT EESGVALREV FLDSKLVFSK PQDRLFAGQI DRMDRFALRY RARKYSSEQF RMPYSGLRGQ RTSLIPHQLN IAHDVGRRHA PRVLLADEVG LGKTIEAGMI LHQQLLSGAA ERVLIIVPET LQHQWLVEML RRFNLRFALF DDERYAEAQH DAYNPFDTEQ LVICSLDFAR RSKQRLEHLC EAEWDLLVVD EAHHLVWSED APSREYQAIE QLAEHVPGVL LLTATPEQLG MESHFARLRL LDPNRFHDFA QFVEEQKNYR PVADAVAMLL AGNKLSNDEL NMLGEMIGEQ DIEPLLQAAN SDSEDAQSAR QELVSMLMDR HGTSRVLFRN TRNGVKGFPK RELHTIKLPL PTQYQTAIKV SGIMGARKSA EDRARDMLYP ERIYQEFEGD NATWWNFDPR VEWLMGYLTS HRSQKVLVIC AKAATALQLE QVLREREGIR AAVFHEGMSI IERDRAAAWF AEEDTGAQVL LCSEIGSEGR NFQFASHMVM FDLPFNPDLL EQRIGRLDRI GQAHDIQIHV PYLEKTAQSV LVRWYHEGLD AFEHTCPTGR TIYDSVYNDL INYLASPDQT EGFDDLIKNC REQHEALKAQ LEQGRDRLLE IHSNGGEKAQ ALAESIEEQD DDTNLIAFAM NLLDIIGINQ DDRGDNMIVL TPSDHMLVPD FPGLSEDGIT ITFDREVALA REDAQFITWE HPLIRNGLDL ILSGDTGSST ISLLKNKALP VGTLLVELIY VVEAQAPKQL QLNRFLPPTP VRMLLDKNGN NLAAQVEFET FNRQLNAVNR HTGSKLVNAV QQDVHAILQL GEAQIEKSAR ALIDAARNEA DEKLSAELSR LEALRAVNPN IRDDELTAIE SNRQQVMESL DQAGWRLDAL RLIVVTHQ
|
| |