Gene B21_03394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03394 
SymbolaldB 
ID8112595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3619438 
End bp3620976 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content55% 
IMG OID644849567 
Producthypothetical protein 
Protein accessionYP_003001140 
Protein GI251786836 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAATA ATCCCCCTTC AGCACAGATT AAGCCCGGCG AGTATGGTTT CCCCCTTAAG 
TTAAAAACCC GCTATGACAA CTTTATTGGC GGCGAATGGG TAGCCCCTGC CGACGGCGAG
TATTACCAGA ATCTGACGCC GGTGACCGGG CAGCTGCTGT GCGAAGTGGC GTCTTCGGGC
AAACGAGACA TCGATCTGGC GCTGGATGCT GCGCACAAAG TGAAAGATAA ATGGGCGCAC
ACCTCGGTGC AGGATCGCGC GGCGATTCTG TTTAAGATTG CCGATCGAAT GGAACAAAAC
CTCGAGCTGT TAGCGACAGC TGAAACCTGG GATAACGGCA AACCCATTCG CGAAACCAGT
GCTGCGGATG TACCGCTGGC GATTGACCAT TTCCGCTATT TCGCCTCGTG TATTCGGGCG
CAGGAAGGTG GGATCAGTGA AGTTGATAGC GAAACCGTGG CCTATCATTT CCATGAACCG
TTAGGCGTGG TGGGGCAGAT TATCCCGTGG AACTTCCCGC TGCTGATGGC GAGCTGGAAA
ATGGCTCCCG CGCTGGCGGC GGGCAACTGT GTGGTGCTGA AACCCGCACG TCTTACCCCG
CTTTCTGTAC TGCTGCTAAT GGAAATTGTC GGTGATTTAC TGCCGCCGGG CGTGGTGAAC
GTGGTCAATG GCGCAGGTGG GGTAATTGGC GAATATCTGG CGACCTCGAA ACGCATCGCC
AAAGTGGCGT TTACCGGCTC AACGGAAGTG GGCCAACAAA TTATGCAATA CGCAACGCAA
AACATTATTC CGGTGACGCT GGAGTTGGGC GGTAAGTCGC CAAATATCTT CTTTGCTGAT
GTGATGGATG AAGAAGATGC CTTTTTCGAT AAAGCGCTGG AAGGCTTTGC ACTGTTTGCC
TTTAACCAGG GCGAAGTTTG CACCTGTCCG AGTCGTGCTT TAGTGCAGGA ATCTATCTAC
GAACGCTTTA TGGAACGCGC CATCCGCCGT GTCGAAAGCA TTCGTAGCGG TAACCCGCTC
GACAGCGTGA CGCAAATGGG CGCGCAGGTT TCTCACGGGC AACTGGAAAC CATCCTCAAC
TACATTGATA TCGGTAAAAA AGAGGGCGCT GACGTGCTCA CAGGCGGGCG GCGCAAGCTG
CTGGAAGGTG AACTGAAAGA CGGCTACTAC CTCGAACCGA CGATTCTGTT TGGTCAGAAC
AATATGCGGG TGTTCCAGGA GGAGATTTTT GGCCCGGTGC TGGCGGTGAC CACCTTCAAA
ACGATGGAAG AAGCGCTGGA GCTGGCGAAC GATACGCAAT ATGGCCTGGG CGCGGGCGTC
TGGAGCCGCA ACGGTAATCT GGCCTATAAG ATGGGGCGCG GCATACAGGC TGGGCGCGTG
TGGACCAACT GTTATCACGC TTACCCGGCA CATGCGGCGT TTGGTGGCTA CAAACAATCA
GGTATCGGTC GCGAAACCCA CAAGATGATG CTGGAGCATT ACCAGCAAAC CAAGTGCCTG
CTGGTGAGCT ACTCGGATAA ACCGTTGGGG CTGTTCTGA
 
Protein sequence
MTNNPPSAQI KPGEYGFPLK LKTRYDNFIG GEWVAPADGE YYQNLTPVTG QLLCEVASSG 
KRDIDLALDA AHKVKDKWAH TSVQDRAAIL FKIADRMEQN LELLATAETW DNGKPIRETS
AADVPLAIDH FRYFASCIRA QEGGISEVDS ETVAYHFHEP LGVVGQIIPW NFPLLMASWK
MAPALAAGNC VVLKPARLTP LSVLLLMEIV GDLLPPGVVN VVNGAGGVIG EYLATSKRIA
KVAFTGSTEV GQQIMQYATQ NIIPVTLELG GKSPNIFFAD VMDEEDAFFD KALEGFALFA
FNQGEVCTCP SRALVQESIY ERFMERAIRR VESIRSGNPL DSVTQMGAQV SHGQLETILN
YIDIGKKEGA DVLTGGRRKL LEGELKDGYY LEPTILFGQN NMRVFQEEIF GPVLAVTTFK
TMEEALELAN DTQYGLGAGV WSRNGNLAYK MGRGIQAGRV WTNCYHAYPA HAAFGGYKQS
GIGRETHKMM LEHYQQTKCL LVSYSDKPLG LF