Gene B21_00060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00060 
SymbolhepA 
ID8113879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp63163 
End bp66069 
Gene Length2907 bp 
Protein Length968 aa 
Translation table11 
GC content55% 
IMG OID644846354 
Producthypothetical protein 
Protein accessionYP_002997927 
Protein GI251783623 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.326785 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTTTA CACTTGGTCA ACGCTGGATC AGCGATACAG AAAGCGAATT GGGACTTGGA 
ACCGTTGTCG CGGTGGATGC GCGAACTGTC ACTTTACTTT TCCCATCTAC TGGTGAAAAC
CGTCTGTACG CACGCAGTGA TTCCCCCGTG ACCCGCGTGA TGTTCAACCC TGGTGATACC
ATTACCAGCC ATGACGGCTG GCAGATGCAA GTCGAAGAAG TAAAAGAAGA AAATGGCTTG
CTGACCTATA TCGGTACTCG CCTGGATACT GAAGAGTCCG GCGTAGCCCT GCGTGAAGTT
TTCCTTGATA GCAAACTGGT GTTCAGCAAA CCGCAGGACC GTCTGTTTGC CGGGCAGATT
GACCGTATGG ACCGCTTTGC GCTGCGTTAT CGCGCGCGTA AATATTCCAG CGAACAGTTC
CGTATGCCGT ACAGCGGCCT GCGCGGTCAG CGTACCAGCC TGATCCCGCA TCAGCTCAAC
ATCGCTCATG ATGTTGGTCG CCGCCACGCG CCGCGCGTCC TGCTGGCTGA CGAAGTGGGT
TTAGGGAAAA CCATTGAAGC CGGGATGATC CTGCATCAGC AACTGCTCTC TGGCGCTGCT
GAACGTGTGC TAATTATCGT CCCGGAAACC TTACAGCATC AGTGGCTGGT AGAAATGCTG
CGCCGTTTCA ACCTGCGCTT TGCGCTATTT GATGATGAGC GTTATGCCGA AGCTCAGCAC
GATGCTTACA ACCCGTTTGA CACCGAACAG CTGGTGATTT GCTCGCTGGA TTTTGCCCGT
CGTAGCAAAC AGCGCCTGGA ACATCTCTGT GAAGCCGAAT GGGACCTGCT GGTGGTCGAT
GAAGCGCATC ACCTGGTGTG GAGCGAAGAT GCGCCAAGCC GTGAATATCA GGCCATTGAA
CAACTGGCAG AGCACGTGCC GGGCGTTCTG CTGCTGACCG CGACCCCGGA ACAGCTGGGG
ATGGAAAGCC ACTTCGCCCG TCTGCGTCTG CTGGACCCGA ACCGTTTCCA CGATTTTGCG
CAGTTCGTTG AAGAGCAGAA AAATTATCGT CCGGTTGCGG ACGCCGTTGC CATGCTGCTG
GCAGGTAACA AACTGAGCAA TGACGAACTG AACATGCTCG GCGAGATGAT CGGCGAGCAG
GATATCGAGC CGCTGTTGCA GGCAGCAAAC AGCGACAGCG AAGATGCCCA GAGCGCCCGT
CAGGAGCTGG TTTCGATGCT GATGGATCGC CACGGCACCA GCCGCGTGCT GTTCCGTAAC
ACGCGTAACG GTGTGAAAGG ATTCCCGAAA CGCGAGCTGC ACACCATTAA GCTGCCGCTA
CCGACGCAGT ATCAGACGGC TATTAAAGTC TCCGGCATTA TGGGCGCACG TAAAAGTGCG
GAAGATCGTG CTCGCGATAT GCTCTACCCG GAGCGTATTT ATCAGGAATT TGAAGGTGAT
AACGCCACCT GGTGGAACTT CGATCCGCGC GTTGAGTGGC TGATGGGCTA CCTGACCAGC
CATCGCTCTC AGAAAGTGCT GGTGATCTGC GCCAAAGCTG CCACTGCGCT GCAACTGGAG
CAGGTACTGC GCGAACGTGA AGGTATTCGC GCTGCGGTGT TCCACGAAGG TATGTCGATT
ATCGAACGTG ACCGCGCTGC CGCCTGGTTT GCCGAAGAAG ACACCGGCGC ACAGGTACTG
CTGTGCTCAG AAATCGGTTC TGAAGGACGT AACTTCCAGT TCGCCAGCCA CATGGTGATG
TTTGACCTGC CATTCAACCC GGATCTACTG GAGCAGCGTA TTGGTCGTCT GGATCGTATC
GGCCAGGCGC ACGATATTCA GATCCATGTG CCTTATCTGG AGAAAACCGC TCAGTCGGTG
CTGGTGCGCT GGTATCACGA AGGTCTGGAT GCATTTGAGC ACACCTGCCC GACCGGACGC
ACTATTTACG ATAGCGTATA CAACGATCTG ATTAACTATC TGGCTTCACC GGATCAAACC
GAAGGCTTTG ACGATCTGAT CAAAAACTGC CGCGAGCAAC ATGAAGCGCT GAAAGCACAG
CTGGAACAGG GTCGTGACCG CCTGCTGGAA ATCCACTCCA ACGGTGGCGA AAAAGCCCAG
GCACTGGCAG AAAGCATTGA AGAGCAGGAT GACGATACCA ACCTGATCGC CTTCGCCATG
AACCTGTTCG ATATTATCGG TATCAATCAG GACGATCGCG GCGACAACAT GATCGTGCTG
ACGCCGTCCG ATCATATGCT GGTGCCGGAC TTCCCTGGTC TGTCGGAAGA TGGCATCACC
ATCACCTTTG ATCGTGAAGT GGCGCTGGCG CGTGAAGATG CACAGTTTAT TACCTGGGAG
CATCCGCTGA TCCGCAACGG TCTGGATCTG ATCCTTTCTG GCGATACCGG TAGCAGCACG
ATTTCACTGT TAAAAAACAA AGCGTTGCCG GTAGGTACGC TGTTGGTGGA ACTGATTTAT
GTGGTTGAAG CCCAGGCTCC GAAGCAGTTG CAGCTCAACC GCTTCCTGCC ACCGACGCCG
GTACGTATGC TGCTGGATAA AAACGGCAAC AACCTGGCGG CGCAGGTAGA GTTTGAAACC
TTTAACCGCC AGCTTAACGC GGTTAACCGT CACACCGGCA GCAAACTGGT TAACGCCGTG
CAGCAGGATG TTCACGCTAT CCTTCAACTG GGTGAAGCGC AGATCGAGAA ATCTGCCCGT
GCATTGATTG ATGCAGCGCG TAACGAAGCC GACGAAAAAC TGTCTGCCGA GCTGTCTCGT
CTGGAAGCTC TGCGTGCAGT GAACCCGAAC ATTCGTGACG ACGAACTGAC CGCCATTGAG
AGCAACCGTC AGCAGGTAAT GGAAAGCCTG GATCAGGCAG GTTGGCGTCT GGATGCCCTG
CGTTTGATCG TTGTAACGCA TCAGTAA
 
Protein sequence
MPFTLGQRWI SDTESELGLG TVVAVDARTV TLLFPSTGEN RLYARSDSPV TRVMFNPGDT 
ITSHDGWQMQ VEEVKEENGL LTYIGTRLDT EESGVALREV FLDSKLVFSK PQDRLFAGQI
DRMDRFALRY RARKYSSEQF RMPYSGLRGQ RTSLIPHQLN IAHDVGRRHA PRVLLADEVG
LGKTIEAGMI LHQQLLSGAA ERVLIIVPET LQHQWLVEML RRFNLRFALF DDERYAEAQH
DAYNPFDTEQ LVICSLDFAR RSKQRLEHLC EAEWDLLVVD EAHHLVWSED APSREYQAIE
QLAEHVPGVL LLTATPEQLG MESHFARLRL LDPNRFHDFA QFVEEQKNYR PVADAVAMLL
AGNKLSNDEL NMLGEMIGEQ DIEPLLQAAN SDSEDAQSAR QELVSMLMDR HGTSRVLFRN
TRNGVKGFPK RELHTIKLPL PTQYQTAIKV SGIMGARKSA EDRARDMLYP ERIYQEFEGD
NATWWNFDPR VEWLMGYLTS HRSQKVLVIC AKAATALQLE QVLREREGIR AAVFHEGMSI
IERDRAAAWF AEEDTGAQVL LCSEIGSEGR NFQFASHMVM FDLPFNPDLL EQRIGRLDRI
GQAHDIQIHV PYLEKTAQSV LVRWYHEGLD AFEHTCPTGR TIYDSVYNDL INYLASPDQT
EGFDDLIKNC REQHEALKAQ LEQGRDRLLE IHSNGGEKAQ ALAESIEEQD DDTNLIAFAM
NLFDIIGINQ DDRGDNMIVL TPSDHMLVPD FPGLSEDGIT ITFDREVALA REDAQFITWE
HPLIRNGLDL ILSGDTGSST ISLLKNKALP VGTLLVELIY VVEAQAPKQL QLNRFLPPTP
VRMLLDKNGN NLAAQVEFET FNRQLNAVNR HTGSKLVNAV QQDVHAILQL GEAQIEKSAR
ALIDAARNEA DEKLSAELSR LEALRAVNPN IRDDELTAIE SNRQQVMESL DQAGWRLDAL
RLIVVTHQ