Gene B21_02394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02394 
SymbolhcaE 
ID8115730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2534047 
End bp2535408 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content53% 
IMG OID644848596 
Producthypothetical protein 
Protein accessionYP_003000169 
Protein GI251785865 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACAC CCTCAGATTT GAACATTTAC CAACTGATTG ACACCCAAAA CGGTCGGGTC 
ACTCCGCGTA TTTATACCGA CCCGGACATT TACCAACTGG AGCTTGAGCG TATTTTCGGT
CGTTGCTGGC TATTTCTCGC CCACGAAAGC CAGATCCCAA AACCCGGTGA TTTCTTTAAC
ACCTACATGG GAGAAGATGC GGTTGTCGTA GTGCGTCAGA AAGACGGCAG CATCAAGGCG
TTTCTCAACC AATGCCGCCA CCGGGCCATG CGTGTGAGTT ATGCAGATTG CGGCAACACT
CGCGCCTTTA CCTGCCCGTA TCACGGCTGG TCTTATGGCA TTAACGGCGA GTTGATCGAT
GTACCGCTGG AACCTCGCGC CTATCCACAA GGGTTGTGTA AATCCCACTG GGGGCTAAAC
GAAGTTCCTT GTGTGGAGAG TTATAAAGGG CTGATTTTTG GCAACTGGGA TACCAGCGCA
CCGGGCCTGC GTGATTACCT GGGTGACATT GCCTGGTATC TGGATGGCAT GCTGGATCGT
CGCGAAGGCG GCACCGAAAT TGTCGGCGGC GTACAAAAGT GGGTGATCAA CTGCAACTGG
AAATTCCCGG CAGAGCAGTT CGCCAGTGAC CAGTATCATG CTCTGTTCAG CCATGCTTCT
GCCGTTCAGG TATTAGGGGC GAAAGATGAT GGCAGCGATA AGCGCCTCGG TGATGGACAA
ACCGCCCGCC CGGTGTGGGA AACCGCCAAA GATGCGCTGC AATTTGGTCA GGACGGTCAC
GGTAGCGGTT TCTTCTTTAC TGAAAAACCG GATGCTAATG TCTGGGTCGA TGGCGCAGTT
TCAAGCTATT ACCGCGAAAC CTATGCCGAA GCAGAACAAC GTTTAGGTGA AGTTCGCGCC
CTGCGCCTGG CGGGTCATAA CAATATTTTC CCCACGCTTT CATGGCTCAA CGGCACTGCC
ACGCTCCGCG TCTGGCATCC GCGCGGCCCT GATCAAGTTG AAGTGTGGGC GTTCTGTATT
ACTGACAAAG CCGCCTCCGA TGAAGTTAAA GCCGCTTTTG AAAACAGCGC CACTCGTGCT
TTTGGTCCTG CTGGTTTTCT CGAGCAGGAT GACTCGGAGA ACTGGTGTGA AATCCAGAAA
TTGCTTAAAG GCCACCGCGC CCGCAACAGC AAACTGTGTC TGGAAATGGG GCTTGGTCAG
GAAAAGCGTC GCGACGACGG CATTCCTGGC ATTACTAACT ATATTTTCTC AGAAACTGCC
GCTCGCGGAA TGTACCAACG TTGGGCCGAT CTCCTGAGTA GCGAAAGCTG GCAAGAAGTG
CTCGATAAAA CCGCCGCTTA CCAGCAGGAG GTGATGAAAT GA
 
Protein sequence
MTTPSDLNIY QLIDTQNGRV TPRIYTDPDI YQLELERIFG RCWLFLAHES QIPKPGDFFN 
TYMGEDAVVV VRQKDGSIKA FLNQCRHRAM RVSYADCGNT RAFTCPYHGW SYGINGELID
VPLEPRAYPQ GLCKSHWGLN EVPCVESYKG LIFGNWDTSA PGLRDYLGDI AWYLDGMLDR
REGGTEIVGG VQKWVINCNW KFPAEQFASD QYHALFSHAS AVQVLGAKDD GSDKRLGDGQ
TARPVWETAK DALQFGQDGH GSGFFFTEKP DANVWVDGAV SSYYRETYAE AEQRLGEVRA
LRLAGHNNIF PTLSWLNGTA TLRVWHPRGP DQVEVWAFCI TDKAASDEVK AAFENSATRA
FGPAGFLEQD DSENWCEIQK LLKGHRARNS KLCLEMGLGQ EKRRDDGIPG ITNYIFSETA
ARGMYQRWAD LLSSESWQEV LDKTAAYQQE VMK