Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0074 |
Symbol | |
ID | 4075971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 76421 |
End bp | 77863 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638005361 |
Product | aldehyde dehydrogenase |
Protein accession | YP_612069 |
Protein GI | 99079915 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.68258 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTGAGA AACGCGAATT CTACATCAAT GGCCAGTGGG TTGAACCCTC GGCGGCCAAT GACTGCGCCG TTATCGACCC CTCCACCGAA GAGCCCTGCA CGGTGATCTC GCTCGGCAGT CAGGCCGACA CCGATGCTGC CGTTGCCGCC GCCAAAGCCG CGCTGCCCGG CTGGATGGCG ACGCCACCAG CAGAGCGCAT CGCACTGGTG GAGAAGCTCG TCGAGATCTA CAACAGTCGC GCAGAAGATC TGGCGCAGGC CATGTCCAGC GAGATGGGCG CCCCCATCGA CATGTCGCGC TCCAGTCAGG TAGGCGCGGG CAGCTGGCAC CTTAACGGTT TTATCGAGGC GGCAAAGAAT TTCCAGTTCG AGCGCCCACT CGGCGATCAT GCCCCCAACG ATCGCATCAT CTATGAGGCC GTAGGCGTTG CCGCGCTGAT CACCCCGTGG AACTGGCCGA TGAACCAGAT CACGCTGAAG TTCGGCGCCG CTGCGATTGC GGGCTGCACC ATGGTGCTGA AACCCTCCGA GCAGAGCCCG CTCAATGCGA TGATCTTTGC CGAACTGGTG CACGAAGCCG GCTTCCCGCC CGGTGTTTTC AACCTCGTGA ACGGCGATGG CGCGGGCGTG GGCACGCAAC TGTCGTCGCA TCCGGATGTG GACATGGTAT CCTTTACCGG CTCGACCCGC GCGGGTACGG CGATCTCCAA GGCTGCGGCA GATACCCTGA AAAAGGTGCA TCTGGAGCTG GGTGGCAAAG GCGCCAACCT CGTCTTTGAA GACGCCGATG AAAAGGCCGT GAAACGCGGC GTGCTGCATA TGATGCAGAA CACCGGTCAG AGCTGCAACG CACCGTCGCG GATGCTGGTT CAAAAGAGTA TCTACGACCG CGTGGTTGAA GAGGCCGCTG CGGTTGCCAA CAAGGTCGAG GTGGGCCCCG CCTCGCAAGA AGGCCGCCAT ATCGGCCCCG TCGTCAACGA ACTGCAGTGG ACCAAGATCC AGGATCTGAT CCAGAAGGGC ATCGACGAGG GCGCGCGCCT TGTGGCCGGG GGCACCGGTC GCCCGGACGG TCTGAACAAG GGCTACTATG TGAAGCCCAC GGTGTTTGCA GATGTAAACA ACCAGATGAC CATCGCGCGC GAGGAAATCT TTGGCCCAGT GATGGCAATC ATCCCCTTCG AGACCGAAGA AGAAGCTGTC GAGATCGCCA ATGACACCCC CTATGGCCTG ACCAACTACG TGCAGACACA GGATGGTGCG CGCGCCAACC GTCTGGCGCG GGTGCTGCGC TCGGGCATGG TGGAAATGAA CGGTAAATCC CGCAGCGCCG GGTCGCCGTT TGGCGGCATG AAACAGTCCG GCAACGGCCG TGAAGGCGGC GTCTGGGGGC TTGAGGACTT TATGGAAGTC AAAGCCGTAG GGGGCTGGAC GCCCGACGCC TAA
|
Protein sequence | MLEKREFYIN GQWVEPSAAN DCAVIDPSTE EPCTVISLGS QADTDAAVAA AKAALPGWMA TPPAERIALV EKLVEIYNSR AEDLAQAMSS EMGAPIDMSR SSQVGAGSWH LNGFIEAAKN FQFERPLGDH APNDRIIYEA VGVAALITPW NWPMNQITLK FGAAAIAGCT MVLKPSEQSP LNAMIFAELV HEAGFPPGVF NLVNGDGAGV GTQLSSHPDV DMVSFTGSTR AGTAISKAAA DTLKKVHLEL GGKGANLVFE DADEKAVKRG VLHMMQNTGQ SCNAPSRMLV QKSIYDRVVE EAAAVANKVE VGPASQEGRH IGPVVNELQW TKIQDLIQKG IDEGARLVAG GTGRPDGLNK GYYVKPTVFA DVNNQMTIAR EEIFGPVMAI IPFETEEEAV EIANDTPYGL TNYVQTQDGA RANRLARVLR SGMVEMNGKS RSAGSPFGGM KQSGNGREGG VWGLEDFMEV KAVGGWTPDA
|
| |