Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_0333 |
Symbol | |
ID | 7316472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 374119 |
End bp | 375477 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643615217 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_002512418 |
Protein GI | 220933519 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.909996 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGTCCG TGAATCCGGT CAATGGTGCC GTCCTGGGCC GTTTCGACAC CTGGGAGTGG CCCCGTGTGG ACAAGGCCCT GGGCCTGGCG GCCAGTGCCG CACCGGAATG GGCGCAGACC CCCCTGCCGG ACCGGGCTGA ACGGCTGCGC CGGGCGGCGC AGATCCTGCG CCAGCGCCGG GATCAATACG CCGGCATCAT CACCCAGGAG ATGGGCAAGC TTTCCAGCGA GGCCCGGGCC GAGATCGACA AGTGCGCCGC GGTCTGTGAT TTCTACGCCG GGGAAGGGCC GCGCTTCCTG GCCGACGCGA TGCTGGCATC CGACGCCAGC CGCTCCCTGA TCGCCTGGCA GCCCCTGGGC ACGGTGCTGG CGGTGATGCC CTGGAATTTC CCCTTCTGGC AGGTGTTCCG CTTCGCCGCG CCCGCGCTGA TGGCCGGCAA CACCGCCCTG CTCAAGCACG CCTCCAACGT GCCGCGCTGC GCGCTGGCCA TCGAGGAGGT GTTCAGGGAG GCGGGTTTCC CGGAGGGGGC GTTCCAGACC CTGATGATCC CGGCCTCCGC AGTGGAGGCG GTGATCCGGG ATCGTCGGGT GCACGCGGTG ACCCTGACCG GCAGCGAGCC CGCGGGCCGG GCCGTGGCCG CGGCCGCCGG TGCCGTGCTC AAGAAGGCGG TGCTGGAACT GGGCGGTTCC GATCCCTTCG TGGTGCTGGA GGATGCGGAC CTGGAACTGA CCGTGCAGCA GGCGGTGGCC TCCCGGTTCA TGAACGCGGG TCAGAGCTGC ATCGCCGCCA AGCGTTTCGT GGTGCTGGAG GGGGTTGCCG AGGCGTTCCT GTCGCGTTTT CGCGAGGCGG TTCGGGCGCT CAGGCCGGGA GACCCGAGCC ACGAGGCCAC CACCCTGGCA CCCATGGCAC GTACCGATCT GCGCGATGAA CTGCATCGGC AGGTGCAGCA GTCCATCGCC GCCGGTGCCG TGCCCCTGGA GGGCTGCGAG CCCGTGCCGG GCAAGGGGGC CTGGTACCGG CCCTCCGTCC TGGACCAGGT CGGCCCTGGC ATGCCCGCCT ACGACGAGGA ACTGTTCGGG CCGGTGGCCG CGATCCTCCG CGCCCGGGAC GAGGAAGATG CCCTGCGCAT CGCCAACGAC ACCCGTTTCG GCCTGGGCGG CAGCGTCTGG ACCCGGGATG TGGCGCGGGG CGAGGCGCTG GCACGCCGGC TGGCCTGCGG CTGTGCCTTC GTCAACGGAC TGGTGAAGAG CGACCCCAGG CTGCCCTTCG GCGGCATCAA GGATTCCGGC TATGGCCGTG AACTCTCCGC CCTGGGCATG CGCGAATTCC TGAACGCCAA GACGGTCTGG GTCCGCTAG
|
Protein sequence | MQSVNPVNGA VLGRFDTWEW PRVDKALGLA ASAAPEWAQT PLPDRAERLR RAAQILRQRR DQYAGIITQE MGKLSSEARA EIDKCAAVCD FYAGEGPRFL ADAMLASDAS RSLIAWQPLG TVLAVMPWNF PFWQVFRFAA PALMAGNTAL LKHASNVPRC ALAIEEVFRE AGFPEGAFQT LMIPASAVEA VIRDRRVHAV TLTGSEPAGR AVAAAAGAVL KKAVLELGGS DPFVVLEDAD LELTVQQAVA SRFMNAGQSC IAAKRFVVLE GVAEAFLSRF REAVRALRPG DPSHEATTLA PMARTDLRDE LHRQVQQSIA AGAVPLEGCE PVPGKGAWYR PSVLDQVGPG MPAYDEELFG PVAAILRARD EEDALRIAND TRFGLGGSVW TRDVARGEAL ARRLACGCAF VNGLVKSDPR LPFGGIKDSG YGRELSALGM REFLNAKTVW VR
|
| |