Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_00060 |
Symbol | hepA |
ID | 8113879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 63163 |
End bp | 66069 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644846354 |
Product | hypothetical protein |
Protein accession | YP_002997927 |
Protein GI | 251783623 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.326785 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTTTTA CACTTGGTCA ACGCTGGATC AGCGATACAG AAAGCGAATT GGGACTTGGA ACCGTTGTCG CGGTGGATGC GCGAACTGTC ACTTTACTTT TCCCATCTAC TGGTGAAAAC CGTCTGTACG CACGCAGTGA TTCCCCCGTG ACCCGCGTGA TGTTCAACCC TGGTGATACC ATTACCAGCC ATGACGGCTG GCAGATGCAA GTCGAAGAAG TAAAAGAAGA AAATGGCTTG CTGACCTATA TCGGTACTCG CCTGGATACT GAAGAGTCCG GCGTAGCCCT GCGTGAAGTT TTCCTTGATA GCAAACTGGT GTTCAGCAAA CCGCAGGACC GTCTGTTTGC CGGGCAGATT GACCGTATGG ACCGCTTTGC GCTGCGTTAT CGCGCGCGTA AATATTCCAG CGAACAGTTC CGTATGCCGT ACAGCGGCCT GCGCGGTCAG CGTACCAGCC TGATCCCGCA TCAGCTCAAC ATCGCTCATG ATGTTGGTCG CCGCCACGCG CCGCGCGTCC TGCTGGCTGA CGAAGTGGGT TTAGGGAAAA CCATTGAAGC CGGGATGATC CTGCATCAGC AACTGCTCTC TGGCGCTGCT GAACGTGTGC TAATTATCGT CCCGGAAACC TTACAGCATC AGTGGCTGGT AGAAATGCTG CGCCGTTTCA ACCTGCGCTT TGCGCTATTT GATGATGAGC GTTATGCCGA AGCTCAGCAC GATGCTTACA ACCCGTTTGA CACCGAACAG CTGGTGATTT GCTCGCTGGA TTTTGCCCGT CGTAGCAAAC AGCGCCTGGA ACATCTCTGT GAAGCCGAAT GGGACCTGCT GGTGGTCGAT GAAGCGCATC ACCTGGTGTG GAGCGAAGAT GCGCCAAGCC GTGAATATCA GGCCATTGAA CAACTGGCAG AGCACGTGCC GGGCGTTCTG CTGCTGACCG CGACCCCGGA ACAGCTGGGG ATGGAAAGCC ACTTCGCCCG TCTGCGTCTG CTGGACCCGA ACCGTTTCCA CGATTTTGCG CAGTTCGTTG AAGAGCAGAA AAATTATCGT CCGGTTGCGG ACGCCGTTGC CATGCTGCTG GCAGGTAACA AACTGAGCAA TGACGAACTG AACATGCTCG GCGAGATGAT CGGCGAGCAG GATATCGAGC CGCTGTTGCA GGCAGCAAAC AGCGACAGCG AAGATGCCCA GAGCGCCCGT CAGGAGCTGG TTTCGATGCT GATGGATCGC CACGGCACCA GCCGCGTGCT GTTCCGTAAC ACGCGTAACG GTGTGAAAGG ATTCCCGAAA CGCGAGCTGC ACACCATTAA GCTGCCGCTA CCGACGCAGT ATCAGACGGC TATTAAAGTC TCCGGCATTA TGGGCGCACG TAAAAGTGCG GAAGATCGTG CTCGCGATAT GCTCTACCCG GAGCGTATTT ATCAGGAATT TGAAGGTGAT AACGCCACCT GGTGGAACTT CGATCCGCGC GTTGAGTGGC TGATGGGCTA CCTGACCAGC CATCGCTCTC AGAAAGTGCT GGTGATCTGC GCCAAAGCTG CCACTGCGCT GCAACTGGAG CAGGTACTGC GCGAACGTGA AGGTATTCGC GCTGCGGTGT TCCACGAAGG TATGTCGATT ATCGAACGTG ACCGCGCTGC CGCCTGGTTT GCCGAAGAAG ACACCGGCGC ACAGGTACTG CTGTGCTCAG AAATCGGTTC TGAAGGACGT AACTTCCAGT TCGCCAGCCA CATGGTGATG TTTGACCTGC CATTCAACCC GGATCTACTG GAGCAGCGTA TTGGTCGTCT GGATCGTATC GGCCAGGCGC ACGATATTCA GATCCATGTG CCTTATCTGG AGAAAACCGC TCAGTCGGTG CTGGTGCGCT GGTATCACGA AGGTCTGGAT GCATTTGAGC ACACCTGCCC GACCGGACGC ACTATTTACG ATAGCGTATA CAACGATCTG ATTAACTATC TGGCTTCACC GGATCAAACC GAAGGCTTTG ACGATCTGAT CAAAAACTGC CGCGAGCAAC ATGAAGCGCT GAAAGCACAG CTGGAACAGG GTCGTGACCG CCTGCTGGAA ATCCACTCCA ACGGTGGCGA AAAAGCCCAG GCACTGGCAG AAAGCATTGA AGAGCAGGAT GACGATACCA ACCTGATCGC CTTCGCCATG AACCTGTTCG ATATTATCGG TATCAATCAG GACGATCGCG GCGACAACAT GATCGTGCTG ACGCCGTCCG ATCATATGCT GGTGCCGGAC TTCCCTGGTC TGTCGGAAGA TGGCATCACC ATCACCTTTG ATCGTGAAGT GGCGCTGGCG CGTGAAGATG CACAGTTTAT TACCTGGGAG CATCCGCTGA TCCGCAACGG TCTGGATCTG ATCCTTTCTG GCGATACCGG TAGCAGCACG ATTTCACTGT TAAAAAACAA AGCGTTGCCG GTAGGTACGC TGTTGGTGGA ACTGATTTAT GTGGTTGAAG CCCAGGCTCC GAAGCAGTTG CAGCTCAACC GCTTCCTGCC ACCGACGCCG GTACGTATGC TGCTGGATAA AAACGGCAAC AACCTGGCGG CGCAGGTAGA GTTTGAAACC TTTAACCGCC AGCTTAACGC GGTTAACCGT CACACCGGCA GCAAACTGGT TAACGCCGTG CAGCAGGATG TTCACGCTAT CCTTCAACTG GGTGAAGCGC AGATCGAGAA ATCTGCCCGT GCATTGATTG ATGCAGCGCG TAACGAAGCC GACGAAAAAC TGTCTGCCGA GCTGTCTCGT CTGGAAGCTC TGCGTGCAGT GAACCCGAAC ATTCGTGACG ACGAACTGAC CGCCATTGAG AGCAACCGTC AGCAGGTAAT GGAAAGCCTG GATCAGGCAG GTTGGCGTCT GGATGCCCTG CGTTTGATCG TTGTAACGCA TCAGTAA
|
Protein sequence | MPFTLGQRWI SDTESELGLG TVVAVDARTV TLLFPSTGEN RLYARSDSPV TRVMFNPGDT ITSHDGWQMQ VEEVKEENGL LTYIGTRLDT EESGVALREV FLDSKLVFSK PQDRLFAGQI DRMDRFALRY RARKYSSEQF RMPYSGLRGQ RTSLIPHQLN IAHDVGRRHA PRVLLADEVG LGKTIEAGMI LHQQLLSGAA ERVLIIVPET LQHQWLVEML RRFNLRFALF DDERYAEAQH DAYNPFDTEQ LVICSLDFAR RSKQRLEHLC EAEWDLLVVD EAHHLVWSED APSREYQAIE QLAEHVPGVL LLTATPEQLG MESHFARLRL LDPNRFHDFA QFVEEQKNYR PVADAVAMLL AGNKLSNDEL NMLGEMIGEQ DIEPLLQAAN SDSEDAQSAR QELVSMLMDR HGTSRVLFRN TRNGVKGFPK RELHTIKLPL PTQYQTAIKV SGIMGARKSA EDRARDMLYP ERIYQEFEGD NATWWNFDPR VEWLMGYLTS HRSQKVLVIC AKAATALQLE QVLREREGIR AAVFHEGMSI IERDRAAAWF AEEDTGAQVL LCSEIGSEGR NFQFASHMVM FDLPFNPDLL EQRIGRLDRI GQAHDIQIHV PYLEKTAQSV LVRWYHEGLD AFEHTCPTGR TIYDSVYNDL INYLASPDQT EGFDDLIKNC REQHEALKAQ LEQGRDRLLE IHSNGGEKAQ ALAESIEEQD DDTNLIAFAM NLFDIIGINQ DDRGDNMIVL TPSDHMLVPD FPGLSEDGIT ITFDREVALA REDAQFITWE HPLIRNGLDL ILSGDTGSST ISLLKNKALP VGTLLVELIY VVEAQAPKQL QLNRFLPPTP VRMLLDKNGN NLAAQVEFET FNRQLNAVNR HTGSKLVNAV QQDVHAILQL GEAQIEKSAR ALIDAARNEA DEKLSAELSR LEALRAVNPN IRDDELTAIE SNRQQVMESL DQAGWRLDAL RLIVVTHQ
|
| |