Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3621 |
Symbol | |
ID | 4075048 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 677733 |
End bp | 679253 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638005140 |
Product | aldehyde dehydrogenase |
Protein accession | YP_611850 |
Protein GI | 99078592 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.096261 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGAAA TGACGCAGGT TGCGGGCGAA TATACGTCGC CCTTCAAGGC GCGCTATGAC AATTTCATTG GCGGTAAATT CGTCGCCCCG GTCAAAGGGC AGTATTTCGA CAACATTACC CCGATCACTG GCGCCAAGGT CTGTGAGGTC GCCCGCTCCA GCGCCGAGGA CGTGGAGCTT GCGCTCGATG CCGCTCACGC GGCCAAGGAC AGCTGGGGCA AGACCTCTCC TGCGGAACGC GCGAATACCT TGTTGAAAAT CGCGGACCGC ATCGAGGAAA ACCTCGAGCT GATCGCCACG GCGGAAACAT GGGACAATGG CAAACCGATC CGAGAGACCA TGGCAGCCGA TATCCCCCTG TCTGTCGATC ACTTTCGCTA TTTCGCGGGT GTTCTGCGTG CCCAAGAGGG CAGCATGTCC GAAATCGATC ACGACACGGT CGCCTACCAC TTCCACGAGC CGTTGGGCGT AGTGGGGCAG ATCATCCCGT GGAACTTCTC GGTTCTGATG GCCGCCTGGA AGCTGGCCCC CGCGCTGGCG GCCGGCAACT GCATCGTACT GAAGCCCGCC GAGCAGACCC CGGCGGCGAT CCATGTGCTG ATCGAAGTGA TCGCGGATCT TTTGCCTGCG GGAGTTCTGA ACATCGTGAA TGGCATGGGG CCAGAGGTGG GCGGCCCGCT TGCCGAGTCC GATCGGATCG CCAAGATCGC CTTTACCGGC TCGACCGAGA CCGGGCGGAT CATCATGAAG GCAGCGACCA AGAACCTGAT CCCGGTCACG CTCGAACTTG GGGGAAAATC GCCAAATATC TTCTTTTCGG ATGTGATGGC GCAGGATGAC GCGTTTCTCG ACAAGGCGGT CGAGGGCTTT GTTCTATTTG CCTTCAACCA GGGCGAGGTC TGCACCTGCC CCTCGCGAGC GCTGATCCAG GAAGACATCT ACGAAGAATT CATCGAACGC TGCATCGCGC GCGTCAAAGC GATCAAGCAA GGCGATCCGC GCGACATTGA AACCATGGTC GGGGCGCAGG CGAGCCAGGA ACAACAGGAA AAAATCATGT CCTACCTGAC CATCGGCGTC GAAGAAGGCG CTGAGGTACT TGTCGGTGGC GATGCCGCCC GGTTCAACGG GGACATTGCC AACGGGTTCT ATGTTCAGCC AACCATCCTA AAGGGCCACA ACAAGATGCG CGTGTTCCAA GAGGAAATCT TTGGCCCTGT CGTGTCGGTG ACCACCTTCA AAGACGAAGA AGAGGCCCTC GCGATTGCAA ATGACACCAT GTATGGCCTC GGCGCCGGCG TCTGGACCCG TGACGGAACC CGCGCGTATC GCTTTGGGCG CAATATCGAG GCAGGCCGCG TCTGGGTGAA CAACTACCAC GCATATCCGG CGCACGCGGC CTTTGGCGGG TATAAACAAT CCGGGATTGG GCGCGAAACC CACAAGATGA TGCTCGACCA TTATCAGCAG ACCAAGAACA TGCTGGTGAG CTACAACCCC AACAAACTCG GGTTCTTCTG A
|
Protein sequence | MNEMTQVAGE YTSPFKARYD NFIGGKFVAP VKGQYFDNIT PITGAKVCEV ARSSAEDVEL ALDAAHAAKD SWGKTSPAER ANTLLKIADR IEENLELIAT AETWDNGKPI RETMAADIPL SVDHFRYFAG VLRAQEGSMS EIDHDTVAYH FHEPLGVVGQ IIPWNFSVLM AAWKLAPALA AGNCIVLKPA EQTPAAIHVL IEVIADLLPA GVLNIVNGMG PEVGGPLAES DRIAKIAFTG STETGRIIMK AATKNLIPVT LELGGKSPNI FFSDVMAQDD AFLDKAVEGF VLFAFNQGEV CTCPSRALIQ EDIYEEFIER CIARVKAIKQ GDPRDIETMV GAQASQEQQE KIMSYLTIGV EEGAEVLVGG DAARFNGDIA NGFYVQPTIL KGHNKMRVFQ EEIFGPVVSV TTFKDEEEAL AIANDTMYGL GAGVWTRDGT RAYRFGRNIE AGRVWVNNYH AYPAHAAFGG YKQSGIGRET HKMMLDHYQQ TKNMLVSYNP NKLGFF
|
| |