Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_0330 |
Symbol | |
ID | 3748050 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 368830 |
End bp | 370203 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637772857 |
Product | aldehyde dehydrogenase family protein |
Protein accession | YP_378646 |
Protein GI | 78188308 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0316513 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTACAA CAATTAATCC CGCAACCGAA GAGCGGCTTG CAACCTATCC AACCATGAAT GCCGAAGAGC TGGCAGGCGT GCTTGAAGCT ACTCAGCGTG CCGCTGCCGT GTGGCGCAAA CTTTCATTTG AAGAGCGCAC TGCTCCAATG CGCAAAGTAG CCACACTTAT GCGCGAACAA AAAGAGCGCC ATGCAACCTT GATGAGCCTT GAAATGGGCA AACCTTTTTC GCAAGCCTTA GTCGAAGTTG AAAAATCAGC ATGGGTGTGC GACTTCTACG CCGACCATGC AGCCAACTAT CTTACCGCCG AAGAGCACGA TTTGGGCGAT GGCGTGCGTG GTATGGTGAA GTTTGAGCCG CTTGGGGTAA TTTTTGGAGT GATGCCATGG AACTTTCCCT TTTGGCAAGT CTTCCGTTTT GCGGCACCAA CCTTAATGGC TGGCAATGGC GTTGTGGTAA AACATTCACC AAACGTAACG GGATCAGCAA TTGCGATTGA AGAGCTTTTT CGTGAAGCAG GTTTTCCCAC AAATCTTTAT CGGACGGTGC ATATTGCGCT TGACGAAGTT GATGCGTTGA GCGGATTTAT TATTGAGCAT CCCGCAATTC AAGGCATCTC CATTACAGGC AGCACAGGAG CAGGGCGTGC GGTAGCGGCA AAAGCAGGCA AAGCCATAAA GCCAAGCGTT TTAGAGCTTG GTGGCAGCGA TCCCTACCTT GTGCTTGACG ATGCCGATAT TAACCGTGCA GTAAACTTGT GTGCAGCAGC ACGATTGCTC AATAGTGGGC AAACTTGTAT TGCCGCAAAA CGCTTTCTTG TACACCACTC GGTTATGGCG CAATTTCGCG AACTCTTTTT GCAACGCCTC CAAAACGCCG TGATGGGCGA TCCTTTTGAC CAAACGGTAG AAATTGGACC AATGGCACGG CTCGACCTTC GCGACCAATT GCACGATCAA GTGATGCGCT CGATTGCGGC TGGCGCCGAG TTACTTTGCG GTGGCGTTAT TCCCGATCGC GCAGGCTTTT TCTATCCCCC CACGTTGCTT GCAGGCGTTA CCAAAGATAT GCCAGCTTAT AGCGAAGAGT TTTTTGGTCC CGTTGTTACG CTTATTGAAG TTGCCGACGA TGCAGAGGCA ATCCATATTG CCAACGATAC CTCATTTGGG CTTGGTGCTG CCGTTTTTTC ACAAAACATT GAACGAGCAC TTCGTATAGC CGACCAGCTT GAAACAGGTA ACTGCTTTAT TAACAGTGGT GTAAAGTCCG ATCCACGAAT GCCATTTGGA GGCATTAAAG AGTCGGGTTA TGGCAGAGAG CTTGCAGCCT ACGGCATCCG TGCATTTGTG AACATTAAGA CAATTTGCGT GTAG
|
Protein sequence | MITTINPATE ERLATYPTMN AEELAGVLEA TQRAAAVWRK LSFEERTAPM RKVATLMREQ KERHATLMSL EMGKPFSQAL VEVEKSAWVC DFYADHAANY LTAEEHDLGD GVRGMVKFEP LGVIFGVMPW NFPFWQVFRF AAPTLMAGNG VVVKHSPNVT GSAIAIEELF REAGFPTNLY RTVHIALDEV DALSGFIIEH PAIQGISITG STGAGRAVAA KAGKAIKPSV LELGGSDPYL VLDDADINRA VNLCAAARLL NSGQTCIAAK RFLVHHSVMA QFRELFLQRL QNAVMGDPFD QTVEIGPMAR LDLRDQLHDQ VMRSIAAGAE LLCGGVIPDR AGFFYPPTLL AGVTKDMPAY SEEFFGPVVT LIEVADDAEA IHIANDTSFG LGAAVFSQNI ERALRIADQL ETGNCFINSG VKSDPRMPFG GIKESGYGRE LAAYGIRAFV NIKTICV
|
| |