Gene Cpha266_0120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0120 
Symbol 
ID4569026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp140647 
End bp143508 
Gene Length2862 bp 
Protein Length953 aa 
Translation table11 
GC content53% 
IMG OID639764722 
Productformate dehydrogenase 
Protein accessionYP_910614 
Protein GI119355970 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTATA CCCATAAACC AACCGTAATA GAAAGCATTG CAGAAAAACT GCACCTTATT 
CCCGATCTCC ATAAGGAAAA CGTCCGGGAT GGAGCCCGGC GTGCAGCCGA AGAGGGTTCG
GAAATAAGCT GCCCTCCTCC ATCGCAGTGG GACAACTGGG TTGAGTACGA TTCGAAAAGC
TGGCCTGAGC GCAAGGCTAC CGAGTATATG CTGGTGCCGA CAGCCTGTTT CAATTGCGAG
GCCGGTTGCG GTCTTCTTGC CTATGTTGAC AAGGAGAATA TGAAGATCCG TAAGTTGGTG
GGCAATCCGT ATCATCCGGC GAGCAGAGGA CGGAACTGCG CCAAAGGGCC CGCAACGCTC
AACCAGATTG AGGATTCCGA CAGGGTGCTT TACCCGATGA AGCGGACCGG TAAACGGGGC
GAAGGAAAAT GGGCCAGGGT TACCTGGGAC AGCGTTCTTG ACGATATTGC CGGAAGAATG
CGCAAGGCTA TTCTTGAGGG GCGCAATAAC GAAATATCCT ATCATGTCGG AAGACCTGGC
CATGACGGGT TTATGGAGTG GATTCTCAGG GCGTGGAACG TTGACGGTCA TAACAGTCAT
ACCAATGTCT GCTCTTCCGG CGCCCGATTC GGATATGCTA TCTGGGAAGG GTTCGATCGC
CCCTCTCCCG ACCATGCCAA TGCGAAATTC ATTCTGCTGG TCAGCGCGCA TCTTGAATCG
GGGCACTACT TCAACCCCCA TTCCCAGCGT ATTATCGAGG CCCGAATGAA GGGGGCAAAG
CTTGCCGTGC TTGATCCGCG TCTTTCGAAT ACGGCCAGCA TGTCCGATTA CTGGATGCCG
AGCTATCCGG GAAGCGAGCC GGCCATACTG CTCGCTATGG CAAAAATCAT AATTGACGAA
GGGATTTACA ATCGCGACTA TCTGGAGAAC TGGGTGAACT GGCAGGCTTA TCTGCAGACT
GAGTATCCAG GTACGCCGGT TACCTTTGAA AACTTTATCG ATGCCCTGAA AAAAGAGTAC
AGCGAATACA CTCCCGAGTA TGCTTCAAAG GAAAGCGGGG TTGACGCAGC GGCCATTGTT
GAAGTTGCCC GAAAAATCGG CGAAGCCGGT ACGCAGTTTT CAACCCATGT CTGGCGCAGC
GCAAGCAGCG GCAATCTTGG CGGCTGGGCC GTATCGCGCA CCCTGCATTT TCTCAATGTG
TTAACCGGCA GCGTCGGAAC CCCCGGAGGC ACCTCTCCAA GCGCATGGAA CAAGTTCAAG
CCTACGGTGC ATGCCGAACC CAAACCGCAG ACCTACTGGA ATACCCTGCA GTTGCCTGAT
GAGTATCCCC TTGCTCATTT CGAGATGAGT TTTCTTCTTC CTCATTTTCT GAAGGAGGGT
CGGGGCAAAC TTGATGTCTA TTTTACAAGG GTTTTCAATC CGGTATGGAC CTATCCCGAC
GGCTTTTCAT GGATTGAGGC GCTTGAAGAC GAATCGAAAA TCGGTCTGCA TGCCGCGCTG
ACCCCGACAT GGAGCGAAAC GGCCTATTTT GCCGATTATG TGCTTCCGAT GGGCCACTCA
GCAGAACGTC ACGATCTTCT GAGCTATGAA ACGCATGCCG GAAAATGGAT CGCATATCGT
CAGCCGGTTT TGAGAACGGC TCTCAAGAGA ATGGGCAAGC CGGTCAAGTA TACCTGGGAG
GCAAATCCCG GCGAGGTATG GGAGGAGGAT GAATTCTGGA TTGAACTGAC ATGGCGCATC
GACCCTGACG GTACCATGGG AATCCGTCAG TACTGCATGT CTCCTTACCG TCCCGGCGAG
AAAATCACGA TTGAAGAGTA CTATCGGTAT GTTTTTGAGC ATACGCACGG CTTGCCTGAA
AAAGCAGCCG AAGAGGGTCT TACTGCGTAC GATTATATGC AGAAATATGG AGCATTCGAA
GTCGAGAGCA ATGTGTACAG TCTGAACGAA AAGCCTGTGG CTCCGGCCGA TCTTCAAGGC
TCGGAGGTTC ATCAGCAGAG CGGTCTGATC ACGAAAAACG GCAAGGCTGT GGGCGTTGAG
GTGAATGGCC GTTCCTGTAC CGGTTTTCCC ACCCCGTCTC GCAAGCAGGA GTTCTTTTCG
CAAACCATGG TGGACTGGAA GTGGCCCGAA TATCGCGTGC CTGGCTACAT TAAAAGCCAT
ATTCATCAGG AGATCATGAA CCGGAGCAAG GGCGAGTTCG TTCTTGTGCC CACATTTCGT
CTCCCCGTGC TGATTCACTC TCGTTCAGGA AATGCCAAAT GGCTTGCTGA AATCGCTCAT
CGCAACCCGG TATGGATCAA CGCCGCAGAC GGCGCGGCTC TGCATATTGA AAATGGCGAT
CTGATTCGGG TGAATACCGA TATCGGCTTT TTTGTGAACA GGGCGTGGGT GACTGAAGGG
ATACGTCCGG GAGTCGTTGC CTGTTCCCAC CATATCGGTC GCTGGCGCAG GGATCAGGAT
CCTGAGGCGA ACCGCTGGGC GACGAACAGG GTGCAGATTT CAAAAGAGGG AAAAGGAAAG
TGGAAGATGC GTGTCGAGGA GAGCATTCAG CCTTACGAGA GCAACGATCC CGACTCGTCG
AGAATTTTCT GGTCTGACGG CGGAGTGCAT CAGAATATCA CCTTCCCTGT TCATCCCGAT
CCGATCAGCG GGATGCATTG CTGGCATCAG AAAGTCAGGA TCGAGAAAGC TCAAGACGGA
GATTGTTATG GTGATGTTTT TGTCGATACC GAGCGTTCTT TTGCCATATA CAAGGAGTGG
CTTGCCATGA CGCGGCCTGC GCCGGGCCCC GGCGGGCTTC GCCGCCCGCT CTGGCTGAAC
CGCCCGTTCA GGCCGGATGA AAAGACCTAC TATCTGCAGT GA
 
Protein sequence
MSYTHKPTVI ESIAEKLHLI PDLHKENVRD GARRAAEEGS EISCPPPSQW DNWVEYDSKS 
WPERKATEYM LVPTACFNCE AGCGLLAYVD KENMKIRKLV GNPYHPASRG RNCAKGPATL
NQIEDSDRVL YPMKRTGKRG EGKWARVTWD SVLDDIAGRM RKAILEGRNN EISYHVGRPG
HDGFMEWILR AWNVDGHNSH TNVCSSGARF GYAIWEGFDR PSPDHANAKF ILLVSAHLES
GHYFNPHSQR IIEARMKGAK LAVLDPRLSN TASMSDYWMP SYPGSEPAIL LAMAKIIIDE
GIYNRDYLEN WVNWQAYLQT EYPGTPVTFE NFIDALKKEY SEYTPEYASK ESGVDAAAIV
EVARKIGEAG TQFSTHVWRS ASSGNLGGWA VSRTLHFLNV LTGSVGTPGG TSPSAWNKFK
PTVHAEPKPQ TYWNTLQLPD EYPLAHFEMS FLLPHFLKEG RGKLDVYFTR VFNPVWTYPD
GFSWIEALED ESKIGLHAAL TPTWSETAYF ADYVLPMGHS AERHDLLSYE THAGKWIAYR
QPVLRTALKR MGKPVKYTWE ANPGEVWEED EFWIELTWRI DPDGTMGIRQ YCMSPYRPGE
KITIEEYYRY VFEHTHGLPE KAAEEGLTAY DYMQKYGAFE VESNVYSLNE KPVAPADLQG
SEVHQQSGLI TKNGKAVGVE VNGRSCTGFP TPSRKQEFFS QTMVDWKWPE YRVPGYIKSH
IHQEIMNRSK GEFVLVPTFR LPVLIHSRSG NAKWLAEIAH RNPVWINAAD GAALHIENGD
LIRVNTDIGF FVNRAWVTEG IRPGVVACSH HIGRWRRDQD PEANRWATNR VQISKEGKGK
WKMRVEESIQ PYESNDPDSS RIFWSDGGVH QNITFPVHPD PISGMHCWHQ KVRIEKAQDG
DCYGDVFVDT ERSFAIYKEW LAMTRPAPGP GGLRRPLWLN RPFRPDEKTY YLQ