Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3325 |
Symbol | |
ID | 4075730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 334135 |
End bp | 335655 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638004833 |
Product | aldehyde dehydrogenase |
Protein accession | YP_611559 |
Protein GI | 99078301 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.476606 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCCACC CTCACGGAAA CCACCTGATC GCCGGGCAAT GGGTCACAGG AAGCGACCAC TTTGCCTCAA GCCCCGCACA TGGCGAAAGC TACGCCTTCT CCGTCGGCCG CGTATCTGAC GTGGACGCCG CTGTCGAAGC CGCAGAAGAG GCGTTTGCGG CCTATAGCGC AACCTCGCGT GCCGAACGGG CGGCGTTTCT CAATGCCATC GCCGACGAGA TCGAGGCGCG CGCGGATGAC ATCACCGCAA TCGGCACGCA GGAAACTGGC TTGCCCGAAG CGCGCCTCCA AGGCGAGCGT GGACGCACCA CGGGCCAATT GCGCCTCTTT GCCGACCATA TTCTTGCGGG TGACTATCTC GATCGCCGCC ACGACGAAGC GCTTCCGGAC CGCGCGCCCC TGCCGCGCCC CGACCTGCGC ATGGTGCAAC GCCCGATCGG CCCGGTGGCT GTCTTTGGCG CATCCAACTT CCCCCTCGCC TTCTCGACCG CTGGCGGCGA CACAGCCGCC GCGCTTGCGG CGGGTTGTCC GGTGGTCGTG AAAGGCCATA GCGCCCACCC CGGCACCGGC GAGATCATCG CAGAAGCCGT TCTGGCCGCG ATCCTGCGCT GCAAGATGCC TTCTGGCGTT TTCAGTTTGA TCCAAGGCGG CAACCGCCAA GTCGGCGCGG CGCTTGTCCA GCACCCGCTG ATCAAAGCCG TCGGCTTCAC CGGGTCTCTT CGCGGCGGTC GGGCCCTGTT CGATCTTTGC GCGCAACGCC CTGAGCCTAT TCCCTTCTTC GGGGAACTGG GCTCCGTCAA CCCGATGTTT ATCTTTGATG CCGCACTGAA CGCGCGCGGC GAAGCCTTGG CCGAAGGCTG GGCCGGATCG CTGACGATGG GCGCGGGACA GTTCTGCACC AATCCCGGCA TCGCGGTTCT TCGCGCTGGC GCAGATGCAG ATCGCTTTGT CGATGCAGCC GCAGCAGCGC TTGCACAAAC CGCTGCGCAA ACCATGCTGA CCGACGGCAT CGCTCATGCC TACCGCGATG GTCAGCGTCG TATGGCAGGT GTCGAAGGCG TCCGCGAGGT GCTCGCCACC GAAAGCGACG CGCGCAACGC GACCCCATTT CTCTACATGA CGGACGCGCA AAACTGGCTA CAAAACGAGT CTCTCTCCGA AGAGGTCTTT GGCCCGCTTG GCCTTGTTGT CACCGTCGCG GACATGGACG AGATGCGCGC GGTTGCACGT TCGCTGCAGG GACAGTTGAC CTGCACCCTG CATCTGGACG ACGGCGATAG CGATACAGCC GCGACCTTTG TACCGATCCT CGAGCGCAAG GCCGGGCGGG TGTTGGCCAA TGGCTTCCCG ACTGGAGTTG AAGTCGCTGA TACAATGGTA CACGGCGGCC CCTACCCGGC TTCGACGAAC TTTGGCGCCA CCTCGGTTGG CACCTTGTCG ATCCGCCGCT TCCTGCGTCC GGTCTGTTAC CAGAACATCC CCGAAGCCCT GCTGCCTGCA GATTTGCGCG ACGCAGGGTA A
|
Protein sequence | MFHPHGNHLI AGQWVTGSDH FASSPAHGES YAFSVGRVSD VDAAVEAAEE AFAAYSATSR AERAAFLNAI ADEIEARADD ITAIGTQETG LPEARLQGER GRTTGQLRLF ADHILAGDYL DRRHDEALPD RAPLPRPDLR MVQRPIGPVA VFGASNFPLA FSTAGGDTAA ALAAGCPVVV KGHSAHPGTG EIIAEAVLAA ILRCKMPSGV FSLIQGGNRQ VGAALVQHPL IKAVGFTGSL RGGRALFDLC AQRPEPIPFF GELGSVNPMF IFDAALNARG EALAEGWAGS LTMGAGQFCT NPGIAVLRAG ADADRFVDAA AAALAQTAAQ TMLTDGIAHA YRDGQRRMAG VEGVREVLAT ESDARNATPF LYMTDAQNWL QNESLSEEVF GPLGLVVTVA DMDEMRAVAR SLQGQLTCTL HLDDGDSDTA ATFVPILERK AGRVLANGFP TGVEVADTMV HGGPYPASTN FGATSVGTLS IRRFLRPVCY QNIPEALLPA DLRDAG
|
| |