Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_2202 |
Symbol | |
ID | 8535366 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 2368512 |
End bp | 2369882 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 646384583 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_003264065 |
Protein GI | 261856782 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTTC AAGTAATCAA CCCAACCACC GGACAACCGC TGCATTCATT CCCGTTCTTG AACGCCACCG AGCGTGATGC CGCCATATCC GCCTCAGCCG AGGCGTACAC ACACTGGCGC AAAACCAGCA TGGCAACGCG TGCCGACTTG CTGCGCCGCG TCGCTCAAAT CATGCGCGAA GAGGTCGAAT CGCTGGCCCT GCCCATGGTC GAAGAAATGG GCAAACCACT CCGCGAGGCC CGAGGTGAAG TGCTTAAATC GGCCTGGTGT GCCGAGCATT ACGCCAGCCA CGCCGAAGGC TATCTGGCGC CCGAGATGGT GCCTTCAGAC GCGCTGATCA GCTACGTGCA ATACTTGCCG ATTGGCCCCG TGCTGGGCAT TTTGCCGTGG AATGCACCGT TCTGGCTGGC CTTCCGGTTT GCCGCGCCCG CCTTGATGGC AGGCAACACC TGCCTGATGA AACACGACGG CCATGTGCCT GCCTGCGCCG CTGCCATCGC GGATGTTTTC ACGCGTGCCG GTGCGCCCGA TGGTGTGTTT CAAAACCTGC CGCTCGATAC GCCGGATATC GCCGCCGCCA TCAATAATGA ACAGGTGCGC GCGGTCTCGT TCACGGGTTC CGACCGGGCC GGTTCCATCG TCGCAGCCAC CGCCGCCGCG CAGATCAAAC CCGCCGTGCT CGAACTCGGT GGCTCTGATC CTGTGATCGT ATTGGCCGAT GCCGACCTCG ACAAAGCCGC CGACACCATC GTGCTCTCGC GCATCATCAA CGCCGGGCAA TCCTGCATTG CCGCCAAGCG GATCATCATC GAGCAATCGG TGTACGAATC GTTTCTGGAC AAACTCAAAA CCCGTTTTGA GCGGTTAAAA CTCGGCGATC CGCATCTGGA AACCACTGAT GTCGGCCCCA TTGCCCGCAC CGATTTACGA GATAACCTGC ACCGCCAAGT CACCGCATCC ATTGAAGCGG GCGCGCGATG TTTGCTGGGC GGCACACTAC CCGAAGGCGA GGGTTTCTTT TATCCGGTCA CCCTGCTGGC CGATGTGGGC CCGGATATGG TGGTCAGCTG CGAGGAAACC TTCGGCCCGG TTGCCGTCGC CATGGCCGCA AAAGATGCCG ATGAGGCTTT GCGCATCGCC AACGACACGC CCTATGGCCT GGGCGCGGCC ATCTGGACAG CCAACACAGC GGCGGCAATT GCCATGGCCG CCGACATCGA ATCGGGCCAA GTTGCGATCA ACGGCATCGT TAAAACAGAC CCTCGGCTGC CGAGCGGCGG CACCAAACGT TCCGGCTATG GCCGCGAGCT CGGCGCGCAC GGCATCAAGA TGTTCGTGAA TGCACAGCAA GTTTGGGTGG GGCGGAGTTA A
|
Protein sequence | MSFQVINPTT GQPLHSFPFL NATERDAAIS ASAEAYTHWR KTSMATRADL LRRVAQIMRE EVESLALPMV EEMGKPLREA RGEVLKSAWC AEHYASHAEG YLAPEMVPSD ALISYVQYLP IGPVLGILPW NAPFWLAFRF AAPALMAGNT CLMKHDGHVP ACAAAIADVF TRAGAPDGVF QNLPLDTPDI AAAINNEQVR AVSFTGSDRA GSIVAATAAA QIKPAVLELG GSDPVIVLAD ADLDKAADTI VLSRIINAGQ SCIAAKRIII EQSVYESFLD KLKTRFERLK LGDPHLETTD VGPIARTDLR DNLHRQVTAS IEAGARCLLG GTLPEGEGFF YPVTLLADVG PDMVVSCEET FGPVAVAMAA KDADEALRIA NDTPYGLGAA IWTANTAAAI AMAADIESGQ VAINGIVKTD PRLPSGGTKR SGYGRELGAH GIKMFVNAQQ VWVGRS
|
| |