Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_2056 |
Symbol | |
ID | 6375749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 2219728 |
End bp | 2221101 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642684547 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_001960447 |
Protein GI | 189500977 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0119311 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGTTA CCCTGAACCC CGCTAACGAA GAAGTGCTTG CCGAGTATCC GGTCATGACT TCTGCAGAGA TTGACAGGAT TCTTGAAGCT TCGGAGAACG CCGCCCTTAT CTGGAAAAAA ATCCCGATCG ATGAGCGAAA AATCGCGATG CATCGTCTTG CCGATCTGCT GAGGGAGCAA AAAGAGATGC ACGGGGCGAT GATCAGCCGT GAGATGGGCA AACGCTATGC CGAGTCGGTC GCCGAGGTTG AAAAATGCGC ATGGGTCTGT GATTATTACG CCGAACACGC GGAGGCCTTC CTGCAGCCTG AAAAGGTTGA TATGGATGGC GGAGCCGGAC TTGTGACCTT TGTCCCGCTT GGTGTCGTTC TCGGGGTCAT GCCCTGGAAT TTTCCTTTCT GGCAGGTGAT TCGTTTTGCG GCTGCGGTTA TGATGGCAGG GAACGGTGTT GTCATCAAGC ACGCTCCCAA CGTGACCGGA TCGGCGATCG CGCTGGAAAA CCTTTTTCGT GAAGCCGGTT TTCCCGTGAA CCTGTACAGG ACTTTGCATA TAGATCTTGA AGATGTTGAT CGCATGGTCG GCCACATCAT CGCTCATCCG GTGATCAAGG CTGTTTCGGT TACAGGCAGT ACCGGTGCGG GAGTTGCCGT GGCGTCTAAA GCAGGCAGTG CGCTCAAGAG AAGTGTTCTT GAACTGGGAG GTAATGATCC CTATCTGGTG CTCGATGATG CTGATCTTGA TGAGGCCGTA GGGTTCTGTA TCGCGTCCCG GCTTTTGAAC GCGGGTCAGA GTTGTATCGC CGCCAAGCGT TTTGTCGTTC ACCGTTCTGT TACATCACGT TTCGAGCAAA AGCTGCTCGA TACAATGAGC AAAAAGAAAG TCGGGGATCC TTTTGATCCC GGCATACACA TAGGGCCGAT AGCGAGGAAA GATCTCAGAG ACGCTCTCCA CCTTCAGGTG GAGCAGAGCA GAGGGCTCGG GGCAAAGGTC CTTTGCGGAG GCGAGATTCC TGACAGAAAA GGCTTTTTTT ATCCGCCGAC GATTGTTACG GATGTCTCTG CGGATATGGC GGTCTATAGT GAAGAGACAT TCGGGCCGGT CGCGACGATT CTTGAAGCAC GGGATGACGA TGATGCCGTC AGGATCGCCA ATGACAGCCC TTTCGGTCTT GGATCAGCGG TTTTTTCCGG TGATCCTGAC CGCGCCAGAC GGGTCGCCGC CAGGCTGGAT GCCGGAAACT GCTGTATCAA TTCGATGGTA AAGTCAGACC CTCGTCTGCC TTTTGGCGGG ATTAAACAGT CAGGTTACGG CCGTGAACTT TCCAGCTACG GTATTCGGGA GTTCGTCAAT ATCAAATCGA TCTATATCGC TTAG
|
Protein sequence | MIVTLNPANE EVLAEYPVMT SAEIDRILEA SENAALIWKK IPIDERKIAM HRLADLLREQ KEMHGAMISR EMGKRYAESV AEVEKCAWVC DYYAEHAEAF LQPEKVDMDG GAGLVTFVPL GVVLGVMPWN FPFWQVIRFA AAVMMAGNGV VIKHAPNVTG SAIALENLFR EAGFPVNLYR TLHIDLEDVD RMVGHIIAHP VIKAVSVTGS TGAGVAVASK AGSALKRSVL ELGGNDPYLV LDDADLDEAV GFCIASRLLN AGQSCIAAKR FVVHRSVTSR FEQKLLDTMS KKKVGDPFDP GIHIGPIARK DLRDALHLQV EQSRGLGAKV LCGGEIPDRK GFFYPPTIVT DVSADMAVYS EETFGPVATI LEARDDDDAV RIANDSPFGL GSAVFSGDPD RARRVAARLD AGNCCINSMV KSDPRLPFGG IKQSGYGREL SSYGIREFVN IKSIYIA
|
| |