Gene SbBS512_E4003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4003 
SymbolaldB 
ID6271822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3731636 
End bp3733174 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content55% 
IMG OID641727849 
Productaldehyde dehydrogenase B 
Protein accessionYP_001882281 
Protein GI187732227 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAATA ATCCCCCTTC AGCACAGATT AAGCCCGGCG AGTATGGTTT CCCCCTCAAG 
TTAAAAGCCC GCTATGACAA CTTTATTGGC GGCGAATGGG TAGCCCCTGC CGATGGTGAG
TATTACCAGA ACCTGACGCC GGTGACTGGG CAGCTGCTGT GCGAAGTGGC GTCTTCGGGC
AAACGAGACA TCGATCTGGC GCTAGATGCT GCGCACAAAG TGAAAGATAA ATGGGCGCAC
ACCTCGGTGC AGGATCGCGC GGCGATTCTG TTTAAGATTG CCGATCGAAT GGAACAAAAC
CTCGAGCTGT TAGCGACAGC TGAAACCTGG GATAACGGCA AACCCATTCG CGAAACCAGT
GCTGCCGACG TGCCGCTGGC GATTGATCAT TTTCGCTATT TCGCCTCGTG TATTCGGGCG
CAGGAAGGTG GGATCAGTGA AGTTGATAGC GAAACCGTGG CCTATCATTT CCATGAACCG
TTAGGCGTGG TGGGGCAGAT TATCCCGTGG AACTTCCCGC TGCTGATGGC GAGCTGGAAA
ATGGCTCCCG CGCTGGCGGC GGGCAACTGT GTGGTGCTGA AACCCGCACG TCTTACCCCG
CTTTCTGTAC TGCTGCTAAT GGAAATTGTC GGTGATTTAC TGCCGCCGGG CGTGGTGAAC
GTGGTCAATG GCGCAGGTGG GGAAATTGGC GAATATCTGG CGACCTCGAA ACGCATCGCC
AAAGTGGCGT TTACCGGCTC AACGGAAGTG GGCCAACAAA TTATGCAATA CGCAACGCAA
AACATTATTC CGGTGACGCT GGAGTTGGGC GGTAAGTCGC CAAATATCTT CTTTGCTGAT
GTGATGGATG AAGAAGATGC CTTTTTCGAT AAAGCGCTGG AAGGCTTTGC ACTGTTTGCC
TTTAACCAGG GCGAAGTTTG CACCTGTCCG AGTCGTGCTT TAGTGCAGGA ATCTATCTAC
GAACGCTTTA TGGAACGCGC CATCCGCCGT GTCGAAAGCA TTCGTAGCGG TAACCCGCTC
GACAGCGTGA CGCAAATGGG CGCGCAGGTT TCTCACGGGC AACTGGAAAC CATCCTCAAC
TACATTGATA TCGGTAAAAA AGAGGGCGCT GACGTGCTCA CAGGCGGGCG GCGCAAGCTG
CTGGAAGGTG AACTGAAAGA CGGCTACTAC CTCGAACCGA CGATTCTGTT TGGTCAGAAC
AATATGCGGG TGTTCCAGGA GGAGATTTTT GGCCCGGTGC TGGCGGTGAC CACCTTCAAA
ACGATGGAAG AAGCGCTGGA GCTGGCGAAC GATACGCAAT ATGGCCTAGG CGCGGGCGTC
TGGAGCCGCA ACGGTAATCT GGCCTATAAG ATGGGGCGCG GCATACAGGC TGGGCGCGTG
TGGACCAACT GTTATCACGC TTACCCGGCA CATGCGGCGT TTGGTGGCTA CAAACAATCA
GGTATCGGTC GCGAAACCCA CAAGATGATG CTGGAGCATT ACCAGCAAAC CAAGTGCCTG
CTGGTGAGCT ACTCGGATAA ACCGTTGGGG CTGTTCTGA
 
Protein sequence
MTNNPPSAQI KPGEYGFPLK LKARYDNFIG GEWVAPADGE YYQNLTPVTG QLLCEVASSG 
KRDIDLALDA AHKVKDKWAH TSVQDRAAIL FKIADRMEQN LELLATAETW DNGKPIRETS
AADVPLAIDH FRYFASCIRA QEGGISEVDS ETVAYHFHEP LGVVGQIIPW NFPLLMASWK
MAPALAAGNC VVLKPARLTP LSVLLLMEIV GDLLPPGVVN VVNGAGGEIG EYLATSKRIA
KVAFTGSTEV GQQIMQYATQ NIIPVTLELG GKSPNIFFAD VMDEEDAFFD KALEGFALFA
FNQGEVCTCP SRALVQESIY ERFMERAIRR VESIRSGNPL DSVTQMGAQV SHGQLETILN
YIDIGKKEGA DVLTGGRRKL LEGELKDGYY LEPTILFGQN NMRVFQEEIF GPVLAVTTFK
TMEEALELAN DTQYGLGAGV WSRNGNLAYK MGRGIQAGRV WTNCYHAYPA HAAFGGYKQS
GIGRETHKMM LEHYQQTKCL LVSYSDKPLG LF