Gene SbBS512_E0051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0051 
SymbolrapA 
ID6271807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp51508 
End bp54414 
Gene Length2907 bp 
Protein Length968 aa 
Translation table11 
GC content55% 
IMG OID641724310 
ProductATP-dependent helicase HepA 
Protein accessionYP_001878870 
Protein GI187731450 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000808065 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTTTA CACTTGGTCA ACGCTGGATC AGCGATACAG AAAGCGAATT GGGACTTGGA 
ACCGTTGTCG CGGTGGATGC GCGAACTGTC ACTTTACTTT TCCCATCTAC TGGTGAAAAC
CGTCTGTACG CACGCAGTGA TTCCCCCGTG ACCCGCGTGA TGTTCAACCC TGGTGATACC
ATTACCAGCC ATGACGGCTG GCAGATGCAA GTCGAAGAAG TAAAAGAAGA AAATGGCTTG
CTGACCTATA TCGGTACTCG CCTGGATACT GAAGAGTCCG GCGTAGCCCT GCGTGAAGTT
TTCCTTGATA GCAAACTGGT GTTCAGCAAA CCGCAGGACC GCCTGTTTGC CGGGCAGATT
GACCGTATGG ACCGCTTTGC GCTGCGTTAT CGCGCGCGTA AGTATTCCAG CGAACAGTTC
CGTATGCCGT ACAGCGGCCT GCGCGGTCAG CGTACCAGCC TGATCCCGCA CCAGCTCAAC
ATCGCTCATG ATGTTGGCCG CCGCCACGCG CCGCGCGTCC TGCTGGCTGA CGAAGTGGGT
TTAGGGAAAA CCATTGAAGC CGGGATGATC CTGCATCAGC AACTGCTCTC TGGCGCTGCT
GAACGTGTGC TGATTATCGT CCCGGAAACC TTACAGCATC AGTGGCTGGT AGAAATGCTG
CGCCGTTTCA ACCTGCGCTT TGCTCTGTTT GATGATGAGC GTTATGCCGA AGCTCAGCAC
GATGCTTACA ACCCGTTTGA CACCGAACAG CTGGTGATTT GCTCGCTGGA TTTTGCCCGT
CGTAGCAAAC AGCGCCTGGA ACATCTCTGT GAAGCCGAAT GGGACCTGCT GGTGGTCGAT
GAAGCGCATC ACCTGGTGTG GAGCGAAGAT GCGCCAAGCC GTGAATATCA GGCCATTGAA
CAACTGGCAG AGCACGTGCC GGGCGTTCTG CTGCTGACCG CGACCCCGGA ACAGCTGGGG
ATGGAAAGCC ACTTCGCCCG TCTGCGTCTG CTGGACCCGA ACCGTTTCCA CGATTTTGCG
CAGTTCGTTG AAGAGCAGAA AAATTATCGT CCGGTTGCGG ACGCCGTTGC CATGCTGCTG
GCAGGTAACA AACTGAGCAA TGACGAACTG AACATGCTCG GCGAGATGAT CGGCGAGCAG
GATATCGAGC CGCTGTTGCA GGCAGCAAAC AGCGACAGCG AAGATGCCCA GAGCGCCCGT
CAGGAGCTGG TTTCGATGCT GATGGATCGC CACGGCACCA GCCGCGTGCT GTTCCGTAAC
ACGCGTAACG GTGTGAAAGG ATTCCCGAAA CGCGAGCTGC ACACCATTAA GCTGCCGCTA
CCGACGCAGT ATCAGACGGC TATTAAAGTC TCCGGCATTA TGGGCGCACG TAAAAGTGCG
GAAGATCGTG CTCGCGATAT GCTCTACCCG GAGCGTATTT ATCAGGAATT TGAAGGTGAT
AACGCCACCT GGTGGAACTT CGATCCGCGC GTTGAGTGGC TGATGGGCTA CCTGACCAGC
CATCGCTCTC AGAAAGTGCT GGTGATCTGC GCCAAAGCTG CCACTGCGCT GCAACTGGAG
CAGGTACTGC GCGAACGTGA AGGTATTCGC GCTGCGGTGT TCCACGAAGG TATGTCGATT
ATCGAACGTG ACCGCGCTGC CGCCTGGTTT GCCGAAGAAG ACACCGGCGC ACAGGTACTG
CTGTGCTCAG AAATCGGTTC TGAAGGACGT AACTTCCAGT TCGCCAGCCA CATGGTGATG
TTTGACCTGC CATTCAACCC GGATCTACTG GAGCAGCGTA TTGGTCGTCT GGATCGTATC
GGCCAGGCGC ACGATATTCA GATCCATGTG CCTTATCTGG AGAAAACCGC TCAGTCGGTG
CTGGTGCGCT GGTATCACGA AGGTCTGGAT GCATTTGAGC ACACCTGCCC GACCGGACGC
ACTATTTACG ATAGCGTATA CAACGATCTG ATTAACTATC TGGCTTCACC GGATCAAACC
GAAGGCTTTG ACGATCTGAT CAAAAACTGC CGCGAGCAAC ATGAAGCGCT GAAAGCACAG
CTGGAACAGG GTCGTGACCG CCTGCTGGAA ATCCACTCCA ACGGTGGCGA AAAAGCCCAG
GCACTGGCAG AAAGCATTGA AGAGCAGGAT GACGATACCA ACCTGATCGC CTTCGCCATG
AACCTGCTCG ATATTATCGG TATCAATCAG GACGATCGCG GCGACAACAT GATCGTGCTG
ACGCCGTCCG ATCATATGCT GGTGCCGGAC TTCCCTGGCC TGTCGGAAGA TGGCATCACC
ATCACCTTTG ATCGTGAAGT GGCGCTGGCG CGTGAAGATG CACAGTTTAT TACCTGGGAG
CATCCGCTGA TCCGCAACGG TCTGGATCTG ATCCTTTCTG GCGATACCGG TAGCAGCACG
ATTTCACTGT TAAAAAACAA AGCGTTGCCG GTAGGTACGC TGTTGGTGGA ACTGATTTAT
GTGGTTGAAG CCCAGGCTCC GAAGCAGTTG CAGCTCAACC GCTTCCTGCC ACCGACGCCG
GTACGTATGC TGCTGGATAA AAACGGCAAC AACCTGGCGG CGCAGGTAGA GTTTGAAACC
TTTAACCGCC AGCTTAACGC GGTTAACCGT CACACCGGCA GCAAACTGGT TAACGCCGTG
CAGCAGGATG TTCACGCTAT CCTTCAACTG GGTGAAGCGC AGATCGAGAA ATCTGCCCGT
GCATTGATTG ATGCAGCGCG TAACGAAGCC GACGAAAAAC TGTCTGCCGA GCTGTCTCGT
CTGGAAGCTC TGCGTGCAGT GAACCCGAAC ATTCGTGACG ACGAACTGAC CGCCATTGAG
AGCAACCGTC AGCAGGTAAT GGAAAGCCTG GATCAGGCAG GTTGGCGTCT GGATGCCCTG
CGTTTGATCG TTGTAACGCA TCAGTAA
 
Protein sequence
MPFTLGQRWI SDTESELGLG TVVAVDARTV TLLFPSTGEN RLYARSDSPV TRVMFNPGDT 
ITSHDGWQMQ VEEVKEENGL LTYIGTRLDT EESGVALREV FLDSKLVFSK PQDRLFAGQI
DRMDRFALRY RARKYSSEQF RMPYSGLRGQ RTSLIPHQLN IAHDVGRRHA PRVLLADEVG
LGKTIEAGMI LHQQLLSGAA ERVLIIVPET LQHQWLVEML RRFNLRFALF DDERYAEAQH
DAYNPFDTEQ LVICSLDFAR RSKQRLEHLC EAEWDLLVVD EAHHLVWSED APSREYQAIE
QLAEHVPGVL LLTATPEQLG MESHFARLRL LDPNRFHDFA QFVEEQKNYR PVADAVAMLL
AGNKLSNDEL NMLGEMIGEQ DIEPLLQAAN SDSEDAQSAR QELVSMLMDR HGTSRVLFRN
TRNGVKGFPK RELHTIKLPL PTQYQTAIKV SGIMGARKSA EDRARDMLYP ERIYQEFEGD
NATWWNFDPR VEWLMGYLTS HRSQKVLVIC AKAATALQLE QVLREREGIR AAVFHEGMSI
IERDRAAAWF AEEDTGAQVL LCSEIGSEGR NFQFASHMVM FDLPFNPDLL EQRIGRLDRI
GQAHDIQIHV PYLEKTAQSV LVRWYHEGLD AFEHTCPTGR TIYDSVYNDL INYLASPDQT
EGFDDLIKNC REQHEALKAQ LEQGRDRLLE IHSNGGEKAQ ALAESIEEQD DDTNLIAFAM
NLLDIIGINQ DDRGDNMIVL TPSDHMLVPD FPGLSEDGIT ITFDREVALA REDAQFITWE
HPLIRNGLDL ILSGDTGSST ISLLKNKALP VGTLLVELIY VVEAQAPKQL QLNRFLPPTP
VRMLLDKNGN NLAAQVEFET FNRQLNAVNR HTGSKLVNAV QQDVHAILQL GEAQIEKSAR
ALIDAARNEA DEKLSAELSR LEALRAVNPN IRDDELTAIE SNRQQVMESL DQAGWRLDAL
RLIVVTHQ