Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_2121 |
Symbol | |
ID | 6375815 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 2296475 |
End bp | 2297884 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642684612 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_001960511 |
Protein GI | 189501041 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.604526 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00230582 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGATGGTGC AAAACAAACA AAATATAACT CTGAATCCTT CGGAGCTTTG TGCTGATCTC AGAACATATT TCGAGAGCGG TATCACACGC TCTTATTCCT GGAGAAAAAA ACAGCTGCTC ATGCTGAGGC GTTTTCTTCT GGAACAGGAA GAAGAGATTT ATGAAGCGCT GCAGAGTGAT TTCCGGAAGT CACAGGCGGA GACTTATTTC ACGGAAATTC ACTATCTCAT AACGGAAATT GATGCTGCTC TGAAACACCT CAAATCCTGG ATGAAAAAGA GGAGGGTGCC TACGCCATTA CGATTTCAGC CGGGCCGAAG TTTTTATACC AGGGAGCCCT ATGGCGTTGT GCTTGTCGTG GCGGCATGGA ATTATCCTCT GCAGCTTGCA GTCGCGCCTG CGATCGGCGC GATTGCGGCC GGAAACTGTG TGATGATAAA ACCATCCGAG CAGTCTCCGG CAACATCGGA GCTTCTTGTG GAAGGCGTGG GTCGCTACCT TGATGGAAAG GCGATAAAGG TTTTTCCGGG GGGAGTTGAA GAAAGCCGGC AGCTGCTCGA AGAATCTTTT GATTACATAT TCTTTACCGG CGGGATCAAA ACCGGAAAAC AGGTCATGCG CAAGGCGGCG GAACATCTTA CGCCGGTCTC TCTCGAACTC GGAGGAAAAA ATCCCTGTAT TGTCGATCAG AACACCAACA TACCTGTAGC GGCGAGAAGA ATTGTCTGGG CGAAGTTTCT CAATGCCGGG CAGACCTGTA TAGCGCCGGA CTACGTTCTT GTAGACCGTA ATGCTGAAGA AGAGCTTCTC AGGTGCATGA AGGATGCCGT TGAGGCGTTT TACGGAACCG AACCTGAAGA GAGCCTTGAT TATCCGGCAG TCATTACTGA AGCTCGTCTT GAGAAGCTTG CCGGCTATCT CGACGAGGGG ATCGTGGTTA CAGGAGGATC GACTGATGCA GGACGTCGTT ATCTCTCCCC GACTATTCTT CGGGAGGTTT CACCTGATGC ACGTGTGATG ACCGATGAGA TTTTCGGTCC CGTGTTACCT GTACTGAGTT ATGAGACGCC GGAAGACGCA CTGAGAGTAG TTCGTATGGC CCGCCACCCT CTTGCGCTCT ATGTTTTTTC CGGAGACCGT TCATTTCAAG AGTATATGAT ACAGAACACG CAGTCCGGCG GGGTCTGTAT CAACGATCTC ATGTTCCAGG CCGCAATTCC CGCTCTTCCT TTCGGCGGCG CCGGAGAAAG CGGAATGGGC GTATACCATG GGGCAGCGGG TTTTGAAACC TTTTCCCGAC CCAGAAGTGT GCATGTGAAA AGAACGTTTC CTGAAAACGT TCTCCGTTAT CCTCCGTTCA GTCAAAAAAA GTTCAACCGT CTCCGGAGGC TGTTTCGTCT GTTTTCATAA
|
Protein sequence | MMVQNKQNIT LNPSELCADL RTYFESGITR SYSWRKKQLL MLRRFLLEQE EEIYEALQSD FRKSQAETYF TEIHYLITEI DAALKHLKSW MKKRRVPTPL RFQPGRSFYT REPYGVVLVV AAWNYPLQLA VAPAIGAIAA GNCVMIKPSE QSPATSELLV EGVGRYLDGK AIKVFPGGVE ESRQLLEESF DYIFFTGGIK TGKQVMRKAA EHLTPVSLEL GGKNPCIVDQ NTNIPVAARR IVWAKFLNAG QTCIAPDYVL VDRNAEEELL RCMKDAVEAF YGTEPEESLD YPAVITEARL EKLAGYLDEG IVVTGGSTDA GRRYLSPTIL REVSPDARVM TDEIFGPVLP VLSYETPEDA LRVVRMARHP LALYVFSGDR SFQEYMIQNT QSGGVCINDL MFQAAIPALP FGGAGESGMG VYHGAAGFET FSRPRSVHVK RTFPENVLRY PPFSQKKFNR LRRLFRLFS
|
| |