Gene Cpha266_2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2003 
Symbol 
ID4569530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2317575 
End bp2319692 
Gene Length2118 bp 
Protein Length705 aa 
Translation table11 
GC content50% 
IMG OID639766585 
Productshort chain dehydrogenase 
Protein accessionYP_912440 
Protein GI119357796 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only
[S] Function unknown 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases)
[COG3347] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAACC TTTGGAACGA CAATGATTTT CAGCGTTTTG TGCAGGCGCA GAAAAACGTG 
AACGACACGA CTTCAGAACT TGCGGAACTG GTCTACGCTT CCCGGCTGCT TGGGAGGGAG
AGCTCTCTGG TGATGCATGG CGGCGGTAAT ACTTCGGTAA AAAATGACCT GCACGATATT
ATCGGCAATA CTGTCAATGT TATTTACATC AAAGGCAGCG GCATTGATCT TGCTGATGTC
GAGGCTTATG ACTTTACGGC GGTCCGTCTG GAGCCGTTGC GGAAGCTTCA GCATCTCTAT
GCAACGGGGG AGAGGCGGAG CGACGAGGAT ATTCGCAGGT TTTCGACAAG CGAATTCAAG
CATTTTCTCT ATCTGAATCT CTTTCAGCTC ACGGATCACA TTGTGAGCAA TTCGCTTTCG
CCCTCAATTG AGACGCTGTT GCATGCGTTT CTTCCGCACC GCTATATTTT TCATACGCAC
TCGTTGGCGT TGCTTACTCT GAGCAATCAG GCGGATGGCG GGGCGCGGGT TCGCGATGAG
TTTGGCGATG AGTTTGGTTA TCTCCCCTAT ATTCAACCCG GTCTTGGTCT CGCTCGTTCG
GCCGCTGATG CATACAGGGT GAATCCTGAA ATTTGCGGTC TTGTTCTGCA AAAGCATGGT
CTTGTTACGT TTGGCGCCAG TGCCAAAGAG GCATATATTC GTATGATCGA GCATGTTACC
AGACTTGAGG ATTGTATCGC GAGAGCTGCA AGGAAAACGC GTATTCCGGT AACTCTTCCT
GAAGAGATTG CTCTGCCTGA GGATGTTGCT CCCGTTATCA GGGGGGCTGT GGTAGAGGAA
ACTGCGCCGG GTATACGTCA GTACCATCAG TTTATTCTTG ATTTCAGAAC ATCTCCGGCT
ATTCTTGATT ATGTCAATAG TCCCGATCTT TCGGCGATGA GCAGTAAAGG TGCTATGACT
CCGGATTTTA TCATTCGCAC AAAAAATCAG CCGCTTGTGC TTCAGGCTCC CGATGCACGG
GATATTGAGG GCTTCAAGCT CAGTGTTCAA GAAGCCGTAC AGCGCTATCA GGCGAATTAT
TCGGCTTATT TCGATCGCCA GAAAGCTGCC AGTGCAATGA ACGTATCCAT GCTTGATTCG
ATGCCAAGGG TCGTTCTCGT GCCTGGATTG GGTCTTTTCG GTCTTGGCAA GACAGCAAAA
GCGGCCAGGA TCAATGCTGA TATTGCCGCA AGTACCGCTG TGGCTATTCT TGACGCCGAG
TCAATCGGAG AGTTTGAGTC CATTTCTGAA TCCGAGGCAT TTGCCATCGA GTATTGGGAT
ATGGAGCAGT CCAAAGTTAA CAAGGTTCGT CACGATGTAT TCGCAGGTAA AGTTGTGCTT
GTTACCGGAG CTGCCAGCGG GATCGGGCTT GCTACGGCTA AGGCTTTCCG CCAGAAAGGC
GCTGAACTGG TTATTCTCGA TCTGAGTTAT GATGCGCTTG AAAAGGCCAG GGAGGCTCTT
GGAGAGAGTA CGCTTGCAAT AACTTGCGAT GTTACCAACC GTGCGGCGGT GAAAAGCGCA
TTTGATCTTG CCTGTCGTAC GTTTGGGGGT GTTGATATCA TGGTTTCCAA TGTCGGAGTC
GCTTTACAGG GGAGAATCGG CGAGGTTTCT GACGAACTGC TTCGAAAGAG TTTTGAACTT
AATTTTTTTT CGCATCAGTC CATTGCCCAG GAGGCGGTGA GAGTAATGAA ATTGCAGGGA
ACCGGCGGGG TTCTTCTTTT CAATGTATCC AAACAGGCCG TCAACCCGGG AGCTGATTTT
GGTCCTTATG GTCTTCCGAA AGCAGCGACG CTTTTTCTTG TTCGCCAGTA TGCGCTTGAC
CATGGACGCG ATGGTATTCG CTCAAACGGC GTGAATGCAG ACCGTATTCG AAGTGGTCTT
TTGACGGAGG AGATGATCAA GACCCGTTCA CAGGCTCGCG GTCTGAGTGA AAAGGAGTAT
ATGGCGGGAA ATCTGCTTCA GCTTGAAGTA ACAGCCGAGG ATGTTGCCGA TGCTTTCGTT
CATCTGGCGC TTGAAATCAG GACAACAGGG TCGATAACGA CAGTGGACGG TGGAAATATT
GCAGCCGCAC TTCGATAA
 
Protein sequence
MLNLWNDNDF QRFVQAQKNV NDTTSELAEL VYASRLLGRE SSLVMHGGGN TSVKNDLHDI 
IGNTVNVIYI KGSGIDLADV EAYDFTAVRL EPLRKLQHLY ATGERRSDED IRRFSTSEFK
HFLYLNLFQL TDHIVSNSLS PSIETLLHAF LPHRYIFHTH SLALLTLSNQ ADGGARVRDE
FGDEFGYLPY IQPGLGLARS AADAYRVNPE ICGLVLQKHG LVTFGASAKE AYIRMIEHVT
RLEDCIARAA RKTRIPVTLP EEIALPEDVA PVIRGAVVEE TAPGIRQYHQ FILDFRTSPA
ILDYVNSPDL SAMSSKGAMT PDFIIRTKNQ PLVLQAPDAR DIEGFKLSVQ EAVQRYQANY
SAYFDRQKAA SAMNVSMLDS MPRVVLVPGL GLFGLGKTAK AARINADIAA STAVAILDAE
SIGEFESISE SEAFAIEYWD MEQSKVNKVR HDVFAGKVVL VTGAASGIGL ATAKAFRQKG
AELVILDLSY DALEKAREAL GESTLAITCD VTNRAAVKSA FDLACRTFGG VDIMVSNVGV
ALQGRIGEVS DELLRKSFEL NFFSHQSIAQ EAVRVMKLQG TGGVLLFNVS KQAVNPGADF
GPYGLPKAAT LFLVRQYALD HGRDGIRSNG VNADRIRSGL LTEEMIKTRS QARGLSEKEY
MAGNLLQLEV TAEDVADAFV HLALEIRTTG SITTVDGGNI AAALR