Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0120 |
Symbol | |
ID | 4569026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 140647 |
End bp | 143508 |
Gene Length | 2862 bp |
Protein Length | 953 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639764722 |
Product | formate dehydrogenase |
Protein accession | YP_910614 |
Protein GI | 119355970 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTATA CCCATAAACC AACCGTAATA GAAAGCATTG CAGAAAAACT GCACCTTATT CCCGATCTCC ATAAGGAAAA CGTCCGGGAT GGAGCCCGGC GTGCAGCCGA AGAGGGTTCG GAAATAAGCT GCCCTCCTCC ATCGCAGTGG GACAACTGGG TTGAGTACGA TTCGAAAAGC TGGCCTGAGC GCAAGGCTAC CGAGTATATG CTGGTGCCGA CAGCCTGTTT CAATTGCGAG GCCGGTTGCG GTCTTCTTGC CTATGTTGAC AAGGAGAATA TGAAGATCCG TAAGTTGGTG GGCAATCCGT ATCATCCGGC GAGCAGAGGA CGGAACTGCG CCAAAGGGCC CGCAACGCTC AACCAGATTG AGGATTCCGA CAGGGTGCTT TACCCGATGA AGCGGACCGG TAAACGGGGC GAAGGAAAAT GGGCCAGGGT TACCTGGGAC AGCGTTCTTG ACGATATTGC CGGAAGAATG CGCAAGGCTA TTCTTGAGGG GCGCAATAAC GAAATATCCT ATCATGTCGG AAGACCTGGC CATGACGGGT TTATGGAGTG GATTCTCAGG GCGTGGAACG TTGACGGTCA TAACAGTCAT ACCAATGTCT GCTCTTCCGG CGCCCGATTC GGATATGCTA TCTGGGAAGG GTTCGATCGC CCCTCTCCCG ACCATGCCAA TGCGAAATTC ATTCTGCTGG TCAGCGCGCA TCTTGAATCG GGGCACTACT TCAACCCCCA TTCCCAGCGT ATTATCGAGG CCCGAATGAA GGGGGCAAAG CTTGCCGTGC TTGATCCGCG TCTTTCGAAT ACGGCCAGCA TGTCCGATTA CTGGATGCCG AGCTATCCGG GAAGCGAGCC GGCCATACTG CTCGCTATGG CAAAAATCAT AATTGACGAA GGGATTTACA ATCGCGACTA TCTGGAGAAC TGGGTGAACT GGCAGGCTTA TCTGCAGACT GAGTATCCAG GTACGCCGGT TACCTTTGAA AACTTTATCG ATGCCCTGAA AAAAGAGTAC AGCGAATACA CTCCCGAGTA TGCTTCAAAG GAAAGCGGGG TTGACGCAGC GGCCATTGTT GAAGTTGCCC GAAAAATCGG CGAAGCCGGT ACGCAGTTTT CAACCCATGT CTGGCGCAGC GCAAGCAGCG GCAATCTTGG CGGCTGGGCC GTATCGCGCA CCCTGCATTT TCTCAATGTG TTAACCGGCA GCGTCGGAAC CCCCGGAGGC ACCTCTCCAA GCGCATGGAA CAAGTTCAAG CCTACGGTGC ATGCCGAACC CAAACCGCAG ACCTACTGGA ATACCCTGCA GTTGCCTGAT GAGTATCCCC TTGCTCATTT CGAGATGAGT TTTCTTCTTC CTCATTTTCT GAAGGAGGGT CGGGGCAAAC TTGATGTCTA TTTTACAAGG GTTTTCAATC CGGTATGGAC CTATCCCGAC GGCTTTTCAT GGATTGAGGC GCTTGAAGAC GAATCGAAAA TCGGTCTGCA TGCCGCGCTG ACCCCGACAT GGAGCGAAAC GGCCTATTTT GCCGATTATG TGCTTCCGAT GGGCCACTCA GCAGAACGTC ACGATCTTCT GAGCTATGAA ACGCATGCCG GAAAATGGAT CGCATATCGT CAGCCGGTTT TGAGAACGGC TCTCAAGAGA ATGGGCAAGC CGGTCAAGTA TACCTGGGAG GCAAATCCCG GCGAGGTATG GGAGGAGGAT GAATTCTGGA TTGAACTGAC ATGGCGCATC GACCCTGACG GTACCATGGG AATCCGTCAG TACTGCATGT CTCCTTACCG TCCCGGCGAG AAAATCACGA TTGAAGAGTA CTATCGGTAT GTTTTTGAGC ATACGCACGG CTTGCCTGAA AAAGCAGCCG AAGAGGGTCT TACTGCGTAC GATTATATGC AGAAATATGG AGCATTCGAA GTCGAGAGCA ATGTGTACAG TCTGAACGAA AAGCCTGTGG CTCCGGCCGA TCTTCAAGGC TCGGAGGTTC ATCAGCAGAG CGGTCTGATC ACGAAAAACG GCAAGGCTGT GGGCGTTGAG GTGAATGGCC GTTCCTGTAC CGGTTTTCCC ACCCCGTCTC GCAAGCAGGA GTTCTTTTCG CAAACCATGG TGGACTGGAA GTGGCCCGAA TATCGCGTGC CTGGCTACAT TAAAAGCCAT ATTCATCAGG AGATCATGAA CCGGAGCAAG GGCGAGTTCG TTCTTGTGCC CACATTTCGT CTCCCCGTGC TGATTCACTC TCGTTCAGGA AATGCCAAAT GGCTTGCTGA AATCGCTCAT CGCAACCCGG TATGGATCAA CGCCGCAGAC GGCGCGGCTC TGCATATTGA AAATGGCGAT CTGATTCGGG TGAATACCGA TATCGGCTTT TTTGTGAACA GGGCGTGGGT GACTGAAGGG ATACGTCCGG GAGTCGTTGC CTGTTCCCAC CATATCGGTC GCTGGCGCAG GGATCAGGAT CCTGAGGCGA ACCGCTGGGC GACGAACAGG GTGCAGATTT CAAAAGAGGG AAAAGGAAAG TGGAAGATGC GTGTCGAGGA GAGCATTCAG CCTTACGAGA GCAACGATCC CGACTCGTCG AGAATTTTCT GGTCTGACGG CGGAGTGCAT CAGAATATCA CCTTCCCTGT TCATCCCGAT CCGATCAGCG GGATGCATTG CTGGCATCAG AAAGTCAGGA TCGAGAAAGC TCAAGACGGA GATTGTTATG GTGATGTTTT TGTCGATACC GAGCGTTCTT TTGCCATATA CAAGGAGTGG CTTGCCATGA CGCGGCCTGC GCCGGGCCCC GGCGGGCTTC GCCGCCCGCT CTGGCTGAAC CGCCCGTTCA GGCCGGATGA AAAGACCTAC TATCTGCAGT GA
|
Protein sequence | MSYTHKPTVI ESIAEKLHLI PDLHKENVRD GARRAAEEGS EISCPPPSQW DNWVEYDSKS WPERKATEYM LVPTACFNCE AGCGLLAYVD KENMKIRKLV GNPYHPASRG RNCAKGPATL NQIEDSDRVL YPMKRTGKRG EGKWARVTWD SVLDDIAGRM RKAILEGRNN EISYHVGRPG HDGFMEWILR AWNVDGHNSH TNVCSSGARF GYAIWEGFDR PSPDHANAKF ILLVSAHLES GHYFNPHSQR IIEARMKGAK LAVLDPRLSN TASMSDYWMP SYPGSEPAIL LAMAKIIIDE GIYNRDYLEN WVNWQAYLQT EYPGTPVTFE NFIDALKKEY SEYTPEYASK ESGVDAAAIV EVARKIGEAG TQFSTHVWRS ASSGNLGGWA VSRTLHFLNV LTGSVGTPGG TSPSAWNKFK PTVHAEPKPQ TYWNTLQLPD EYPLAHFEMS FLLPHFLKEG RGKLDVYFTR VFNPVWTYPD GFSWIEALED ESKIGLHAAL TPTWSETAYF ADYVLPMGHS AERHDLLSYE THAGKWIAYR QPVLRTALKR MGKPVKYTWE ANPGEVWEED EFWIELTWRI DPDGTMGIRQ YCMSPYRPGE KITIEEYYRY VFEHTHGLPE KAAEEGLTAY DYMQKYGAFE VESNVYSLNE KPVAPADLQG SEVHQQSGLI TKNGKAVGVE VNGRSCTGFP TPSRKQEFFS QTMVDWKWPE YRVPGYIKSH IHQEIMNRSK GEFVLVPTFR LPVLIHSRSG NAKWLAEIAH RNPVWINAAD GAALHIENGD LIRVNTDIGF FVNRAWVTEG IRPGVVACSH HIGRWRRDQD PEANRWATNR VQISKEGKGK WKMRVEESIQ PYESNDPDSS RIFWSDGGVH QNITFPVHPD PISGMHCWHQ KVRIEKAQDG DCYGDVFVDT ERSFAIYKEW LAMTRPAPGP GGLRRPLWLN RPFRPDEKTY YLQ
|
| |