Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_1972 |
Symbol | |
ID | 6355476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | - |
Start bp | 2188655 |
End bp | 2190772 |
Gene Length | 2118 bp |
Protein Length | 705 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642669570 |
Product | short chain dehydrogenase |
Protein accession | YP_001943983 |
Protein GI | 189347454 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only [S] Function unknown |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) [COG3347] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000000839125 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAATC TTTGGAACGA CACGGATCTT CGGTGCTCGG TGAACGGGCA GTGCAGTGCG GACGATATTC CTGCGGAACT TGCCGAACTG GTTTATGCCT CCCGTCTGCT TGGAAGGGAG AGCAGTCTGG TGATGCATGG CGGCGGCAAT ACCTCGGTGA AAAGCGAGCT GCACGACATT ATCGGTAACC GGGTTAACGT TATTTTCATC AAAGGAAGCG GTGTCGATCT GGCCGCTCTG GACGCACACG ATTTCACACC GGTCAGGATC GAACCCCTGC AGAAGCTGCA GCATCTCTAT GCTACCGGAG AGCGTCGCAG CGAGGAAGAT ATGCAGCGGT TTTCGACGAG GGAGTTCAAG AACTTTCTCT ATCTGAATCT TTTCTATCTG ACCGATCACA TGGTGAACAA CTCGCTTTCA CCTTCGATCG AAACTCTGCT GCATGCGTTT CTTCCCCACC GGTTTATTTT TCATACCCAT TCGACAGCGC TGTTGACGCT CAGCAATCAG CCGAACGGCG CCGAACTCTG CAGGGAGGTG CTCGGAGAGG AGTTCGGTCT TGTGCCCTAT ATCAAGCCGG GTCTCGGTCT TGCGCGTTCT GCTGCAGAGG CATACGGAAA CGCTCCGGAC ATCAGGGGTC TTGTCCTTCA GAAACACGGT CTTGTAACGC TGGCCGACAG TGCGGCTGCC GCCTATGACT GCATGATAGA ATGCGTTTCG AAACTTGAAG AGCGCATAGC CAGGGCCGGA AGAAACGTTT TCCCCTCGAT CTCCCTGCCG GAGACAGTCG CTCTGCTCGA AGATGTCGCA CCCGTTATCA GAGGCGCGGT TGTCGAAGAA AAATCTCCCG GCACGTTCGA ATACAACCAG TTTGTGCTTG ATTTCCGTTC ATCCCCGGAT ATACTGCAAT ACGTGAATGG TATCGAGCTT GAAGAGGTAA GCGGCAGAGG CGCCATGACG CCGGATTTCA TTATCCGGAC AAAAAACCGG CCGCTCGTCG TTCCGGCTCC GGACGCTCTT GATCCCGAGG GGTTCAAGAG TGCCGTTCAT GAGGCCGTCG AGCGGTATAA AGCGGAATAT CTCGCTTATT TTCAGCGTCA GCAGCAGGCT TCGGGCATGC AGGTGACGAT GCTCGATCCG CTTCCGAGGG TGGTTCTTGT CCCCGGGCTC GGACTGTTCG GTCTCGGAAG GACGGCCCAT GCGGCCTCGG TCAATGCCGA TATCGCCGAA AGCACCGCCT CGGCGATTCT CGATGCGCAA TCGGTCGGAA CCTTCGAGTC GATTACCGAG AGGGATGTTT TCAATATCGA GTACTGGGAG ATGGAGCAGG CGAAAATGAA GAAAGTCCGT CACGATGTAT TTGCCGGCAA GGTAGCTCTC GTTACCGGCG CGGCAAGCGG TATCGGGCTC GCTACGGCCA AGGCGTTCCG GCAGAGGGGG GCGGAGCTGG TAATCGTCGA TCTGAACCCG GAGGCACTTG AACGTGCCTC CGCAGAGCTT GGCGGAGGGG TACTCTCCAT AGCGTGCGAC GTTACCGACC GCAATGCGGT AAAGCGGGCC TTTGACGCTG TCTGCCGCCG ATTCGGAGGT CTCGACATTC TTGTTTCGAA TGTAGGAGTC GCACTGCAGG GAAGGATCGG CGACGTTGCG GACGAAGTGC TTCGCCGGAG TTTCGAATTG AATTTCTTTT CGCATCAGTC CATTGCGCAG CAGGCGGTCA GAATCATGAA ACTGCAGGGC ACCGGTGGCG TTCTGCTTTT CAACGTATCC AAGCAGGCGG TAAACCCCGG CCCCGATTTC GGCCCTTACG GGTTGCCCAA AGCGGCAACG ATGTTTCTCG TGCGCCAGTA TGCGCTTGAT CACGGACGCG ACGGCATTCG GGCCAACGGG ATCAATGCCG ACCGTATCCG TACCGGTCTT CTGACCGATG AAATGATCAA AACCCGCTCG AAAGCCCGAG GGCTGAGCGA ACGGGAGTAC ATGGCCGGCA ACCTGCTTCA GGTTGAGGTC ACGGCCGAAG ACGTTGCCGA GGCATTCGTG CACCAGGCGC TTGAAACAAA AACGACCGGC TCTATCGTCA CGGTTGACGG TGGCAACATT GCTGCCGCCC TTCGTTGA
|
Protein sequence | MQNLWNDTDL RCSVNGQCSA DDIPAELAEL VYASRLLGRE SSLVMHGGGN TSVKSELHDI IGNRVNVIFI KGSGVDLAAL DAHDFTPVRI EPLQKLQHLY ATGERRSEED MQRFSTREFK NFLYLNLFYL TDHMVNNSLS PSIETLLHAF LPHRFIFHTH STALLTLSNQ PNGAELCREV LGEEFGLVPY IKPGLGLARS AAEAYGNAPD IRGLVLQKHG LVTLADSAAA AYDCMIECVS KLEERIARAG RNVFPSISLP ETVALLEDVA PVIRGAVVEE KSPGTFEYNQ FVLDFRSSPD ILQYVNGIEL EEVSGRGAMT PDFIIRTKNR PLVVPAPDAL DPEGFKSAVH EAVERYKAEY LAYFQRQQQA SGMQVTMLDP LPRVVLVPGL GLFGLGRTAH AASVNADIAE STASAILDAQ SVGTFESITE RDVFNIEYWE MEQAKMKKVR HDVFAGKVAL VTGAASGIGL ATAKAFRQRG AELVIVDLNP EALERASAEL GGGVLSIACD VTDRNAVKRA FDAVCRRFGG LDILVSNVGV ALQGRIGDVA DEVLRRSFEL NFFSHQSIAQ QAVRIMKLQG TGGVLLFNVS KQAVNPGPDF GPYGLPKAAT MFLVRQYALD HGRDGIRANG INADRIRTGL LTDEMIKTRS KARGLSEREY MAGNLLQVEV TAEDVAEAFV HQALETKTTG SIVTVDGGNI AAALR
|
| |