Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_0035 |
Symbol | |
ID | 6355558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | + |
Start bp | 40764 |
End bp | 43625 |
Gene Length | 2862 bp |
Protein Length | 953 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642667660 |
Product | molydopterin dinucleotide-binding region |
Protein accession | YP_001942122 |
Protein GI | 189345593 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.813982 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTATA CCCATAAACC AACTGTAATA GAAAGCCTTG CTGAAAAGCT GCACCTTATT CCTGATCTGC ACGCAGAGAG CGGCGGGGCG AAAGCGCTTC GTGCGGCTGA AGAGGGCTCC GAGGTCAGCT CCCCCCCGCC CGAGCAATGG GATAACTGGG TCGAGTATGA CGCCAAAAGC TGGCCCGAGC GCAAGACGAA AGAGTACATG CTGGTTCCGA CCGCATGTTT CAACTGCGAA GCGGGCTGCG GACTTCTTGC CTATGTCGAT AAAGAGACCA TGGAGATCCG CAAGCTGACC GGGAATCCCT ATCATCCGGC AAGCCGCGGC AGAAACTGCG CGAAGGGCCC GGCAACGCTC AACCAGATTC AGGACACCGA CCGCGTGCTC TATCCCATGA AGCGCGCCGG AAAGCGGGGC GACGGTCAAT GGGAAAGGGT CAGCTGGGAC AGCGTGCTTG ACGATATAGC AGGACGGATG CGCAAGGCCA TCATGGAAGG TCGGAACAAC GAAATTTCTT ACCATGTCGG GCGTCCCGGC CATGACGGTT TCATGGAATG GATTCTTCGT GCATGGAATG TTGACGGCCA CAACAGTCAC ACCAACGTCT GTTCCTCGGG AGCGCGGTTC GGTTACGCCA TATGGGAAGG TTTCGACCGT CCCTCCCCCG ATCATGCCAA CGCTAAATTC ATTCTTCTGG TCAGCGCCCA TCTTGAATCG GGTCACTATT TCAATCCGCA TTCGCAGCGC ATTATCGAGG CTCGCATGAA AGGGGCCAAG CTTGCCGTGC TCGATCCCCG GCTTTCCAAC ACCGCAAGCA TGTCGGACTA CTGGATGCCG AGCTATCCCG GCACCGAACC GGCCGTGCTG CTTGCCATGG CGAAGGTGAT TATCGATGAA GGTCTCTACA ACAGAACCTA CCTTGAAAAC TGGGTGAACT GGCAGGAGTA CCTCCAGGCT GAATATCCAG GAACTCCTGT GACCTTCGAG AACTTCATAG AAGGGCTGAA AAAGGAGTAT GCGCACTATA CCCCCGAATA CGCTTCGAAA GAGAGCGGCG TCGATGCCGC CATGATCGTC GAGATCGCCC GAAAGATAGG AGAGGCCGGC TCTCAGTTTT CTACCCATGT CTGGCGAAGC GCCAGCAGCG GCAATCTCGG CGGCTGGGCA GTGTCGCGTA CGCTGCACTT CCTGAACGTA CTGACAGGCA GCGTCGGTAC GCCGGGCGGC ACCTCTCCCA GCGCATGGAA CAAGTTCAAG CCGCAAGTGC ATGCGGAACC CAAACCGCAG ACCTTCTGGA ACCCGCTCCA TCTGCCAAAC GACTATCCGC TCGCGCATTT CGAGATGAGC TTCCTGCTTC CCCATTTCCT CAAGGAAGGG CGAGGAAAAC TCGACGTCTA TTTCACCAGG GTGTTCAATC CGGTCTGGAC CTATCCCGAC GGCTTTTCAT GGATAGAGGC GCTCGAGGAC GAATCGAAGA TCGGACTGCA CGCCGCGCTT ACTCCAACCT GGAGCGAGAC CGCTTACTTT GCCGATTACG TCCTGCCGAT GGGCCACTCG GCCGAACGGC ACGACCTGCT CAGCTACGAA ACCCACGCAG GGAAATGGAT CGCTTTCCGC CAGCCTGTGC TGCGCACCGC CCTGAGAAGA ATGGGCAAGC CTGTCAGATA TACCTGGGAG GCCAATCCCG GCGAAGTCTG GGAAGAGGAT GAATTCTGGA TCGAACTGAC CTGGCGCATC GATCCTGACG GAAGCCTCGG CATCCGCCAG TACTGCATGT CGCCCTACAG GCCCGGCGAG AAAATAACCA TTGACGAGTA CTACCGATAC ATCTTCGAGC ATACGGCAGG TCTGCCTGAA AAGGCAGCAG AAGAGGGGCT TTCAGCGTTC GACTACATGC AGAAATACGG AGCGTTCGAG GTTGAAAGCA ACGTTTACAA TGTGCATGAA AAAGCGGTGC CGCCCTCCGA TCTCGACGGC GCTTCGGTGC AGCCTCAGAA CGGACTGATC GTGAAAAACG GCAAGGCGGT CGGCGTGGAA GTCGCAGGCC GCTCCTGCGC CGGTTTTCCG ACCCCTTCGA AGAAGCAGGA GTTCTATTCA GGCACCATGA TCGACTGGAA GTGGCCGGAG TATCGCCTGC CCGGCTACAT CAAGAGCCAT ATCCATGAGG AGACCATGAA CCATAAGAAC GGAGAGTTTG TGCTGGTGCC GACCTTCCGT CTTCCGGTGC TGATTCACTC CCGTTCCGGC AACGCCAAGT GGCTTGCTGA AATCGCTCAC CGCAATCCGG TCTGGATCAA CGTTGACGAC GGCGCGGCTC TCGGCATAGC CAACGGCGAC CTTATCAGGG TCAACACCGA TATAGGCTTT TTCGTGAACC GCGCATGGGT GACTGAAGGT ATCCGCCCGG GCGTGGTTGC CTGTTCGCAT CACATCGGAC GGTGGCGTCG CGAGCAGGAT CCAGAAGCCA ACCGCTGGGC GGCCAACAGG GTCAATATTT CCAAAGAAGG GAAAGGCAAG TGGAAAATGC GCGTTGAAGA GAACATCCAG CCCTACGAGA GCAGCGATGC CGACTCCTCG AGGATTTTCT GGTCGGATGG CGGCGTTCAT CAGAACATCA CCTTTCCGGT ACATCCCGAT CCGATAAGCG GCATGCACTG CTGGCACCAG AAGGTTCGGA TCGAGAAGGC GCACGAGGGA GACCAGTATG GCGATGTTTT TGTCGATACC GACCGTTCTT TCCGGATTTA CAAGGAGTGG CTCGCCATGA CGCGTCCCGC GCCCGGACCA GGAGGCCTTC GCCGTCCGCT CTGGCTTAAC CGTCCGTTCA GACCCGATGA GAAGACCTAC TATCTGAAAT AG
|
Protein sequence | MSYTHKPTVI ESLAEKLHLI PDLHAESGGA KALRAAEEGS EVSSPPPEQW DNWVEYDAKS WPERKTKEYM LVPTACFNCE AGCGLLAYVD KETMEIRKLT GNPYHPASRG RNCAKGPATL NQIQDTDRVL YPMKRAGKRG DGQWERVSWD SVLDDIAGRM RKAIMEGRNN EISYHVGRPG HDGFMEWILR AWNVDGHNSH TNVCSSGARF GYAIWEGFDR PSPDHANAKF ILLVSAHLES GHYFNPHSQR IIEARMKGAK LAVLDPRLSN TASMSDYWMP SYPGTEPAVL LAMAKVIIDE GLYNRTYLEN WVNWQEYLQA EYPGTPVTFE NFIEGLKKEY AHYTPEYASK ESGVDAAMIV EIARKIGEAG SQFSTHVWRS ASSGNLGGWA VSRTLHFLNV LTGSVGTPGG TSPSAWNKFK PQVHAEPKPQ TFWNPLHLPN DYPLAHFEMS FLLPHFLKEG RGKLDVYFTR VFNPVWTYPD GFSWIEALED ESKIGLHAAL TPTWSETAYF ADYVLPMGHS AERHDLLSYE THAGKWIAFR QPVLRTALRR MGKPVRYTWE ANPGEVWEED EFWIELTWRI DPDGSLGIRQ YCMSPYRPGE KITIDEYYRY IFEHTAGLPE KAAEEGLSAF DYMQKYGAFE VESNVYNVHE KAVPPSDLDG ASVQPQNGLI VKNGKAVGVE VAGRSCAGFP TPSKKQEFYS GTMIDWKWPE YRLPGYIKSH IHEETMNHKN GEFVLVPTFR LPVLIHSRSG NAKWLAEIAH RNPVWINVDD GAALGIANGD LIRVNTDIGF FVNRAWVTEG IRPGVVACSH HIGRWRREQD PEANRWAANR VNISKEGKGK WKMRVEENIQ PYESSDADSS RIFWSDGGVH QNITFPVHPD PISGMHCWHQ KVRIEKAHEG DQYGDVFVDT DRSFRIYKEW LAMTRPAPGP GGLRRPLWLN RPFRPDEKTY YLK
|
| |