Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4003 |
Symbol | aldB |
ID | 6271822 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 3731636 |
End bp | 3733174 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641727849 |
Product | aldehyde dehydrogenase B |
Protein accession | YP_001882281 |
Protein GI | 187732227 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCAATA ATCCCCCTTC AGCACAGATT AAGCCCGGCG AGTATGGTTT CCCCCTCAAG TTAAAAGCCC GCTATGACAA CTTTATTGGC GGCGAATGGG TAGCCCCTGC CGATGGTGAG TATTACCAGA ACCTGACGCC GGTGACTGGG CAGCTGCTGT GCGAAGTGGC GTCTTCGGGC AAACGAGACA TCGATCTGGC GCTAGATGCT GCGCACAAAG TGAAAGATAA ATGGGCGCAC ACCTCGGTGC AGGATCGCGC GGCGATTCTG TTTAAGATTG CCGATCGAAT GGAACAAAAC CTCGAGCTGT TAGCGACAGC TGAAACCTGG GATAACGGCA AACCCATTCG CGAAACCAGT GCTGCCGACG TGCCGCTGGC GATTGATCAT TTTCGCTATT TCGCCTCGTG TATTCGGGCG CAGGAAGGTG GGATCAGTGA AGTTGATAGC GAAACCGTGG CCTATCATTT CCATGAACCG TTAGGCGTGG TGGGGCAGAT TATCCCGTGG AACTTCCCGC TGCTGATGGC GAGCTGGAAA ATGGCTCCCG CGCTGGCGGC GGGCAACTGT GTGGTGCTGA AACCCGCACG TCTTACCCCG CTTTCTGTAC TGCTGCTAAT GGAAATTGTC GGTGATTTAC TGCCGCCGGG CGTGGTGAAC GTGGTCAATG GCGCAGGTGG GGAAATTGGC GAATATCTGG CGACCTCGAA ACGCATCGCC AAAGTGGCGT TTACCGGCTC AACGGAAGTG GGCCAACAAA TTATGCAATA CGCAACGCAA AACATTATTC CGGTGACGCT GGAGTTGGGC GGTAAGTCGC CAAATATCTT CTTTGCTGAT GTGATGGATG AAGAAGATGC CTTTTTCGAT AAAGCGCTGG AAGGCTTTGC ACTGTTTGCC TTTAACCAGG GCGAAGTTTG CACCTGTCCG AGTCGTGCTT TAGTGCAGGA ATCTATCTAC GAACGCTTTA TGGAACGCGC CATCCGCCGT GTCGAAAGCA TTCGTAGCGG TAACCCGCTC GACAGCGTGA CGCAAATGGG CGCGCAGGTT TCTCACGGGC AACTGGAAAC CATCCTCAAC TACATTGATA TCGGTAAAAA AGAGGGCGCT GACGTGCTCA CAGGCGGGCG GCGCAAGCTG CTGGAAGGTG AACTGAAAGA CGGCTACTAC CTCGAACCGA CGATTCTGTT TGGTCAGAAC AATATGCGGG TGTTCCAGGA GGAGATTTTT GGCCCGGTGC TGGCGGTGAC CACCTTCAAA ACGATGGAAG AAGCGCTGGA GCTGGCGAAC GATACGCAAT ATGGCCTAGG CGCGGGCGTC TGGAGCCGCA ACGGTAATCT GGCCTATAAG ATGGGGCGCG GCATACAGGC TGGGCGCGTG TGGACCAACT GTTATCACGC TTACCCGGCA CATGCGGCGT TTGGTGGCTA CAAACAATCA GGTATCGGTC GCGAAACCCA CAAGATGATG CTGGAGCATT ACCAGCAAAC CAAGTGCCTG CTGGTGAGCT ACTCGGATAA ACCGTTGGGG CTGTTCTGA
|
Protein sequence | MTNNPPSAQI KPGEYGFPLK LKARYDNFIG GEWVAPADGE YYQNLTPVTG QLLCEVASSG KRDIDLALDA AHKVKDKWAH TSVQDRAAIL FKIADRMEQN LELLATAETW DNGKPIRETS AADVPLAIDH FRYFASCIRA QEGGISEVDS ETVAYHFHEP LGVVGQIIPW NFPLLMASWK MAPALAAGNC VVLKPARLTP LSVLLLMEIV GDLLPPGVVN VVNGAGGEIG EYLATSKRIA KVAFTGSTEV GQQIMQYATQ NIIPVTLELG GKSPNIFFAD VMDEEDAFFD KALEGFALFA FNQGEVCTCP SRALVQESIY ERFMERAIRR VESIRSGNPL DSVTQMGAQV SHGQLETILN YIDIGKKEGA DVLTGGRRKL LEGELKDGYY LEPTILFGQN NMRVFQEEIF GPVLAVTTFK TMEEALELAN DTQYGLGAGV WSRNGNLAYK MGRGIQAGRV WTNCYHAYPA HAAFGGYKQS GIGRETHKMM LEHYQQTKCL LVSYSDKPLG LF
|
| |