Gene Cphamn1_1889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1889 
Symbol 
ID6375581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp2052887 
End bp2053924 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content53% 
IMG OID642684386 
ProductdTDP-glucose 4,6-dehydratase 
Protein accessionYP_001960287 
Protein GI189500817 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID[TIGR01181] dTDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0868281 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00930049 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCACATTC TCATCACCGG GGGGGCGGGG TTTATCGGAT CGCATGTGGT GCGGTATTTT 
GTCGATACCT ACCCGGAATA CAGCATCACC AATCTTGACA AATTGACCTA CGCGGGTAAT
CTGGAGAACC TGCGGGATGT GGAGGATCGT TCGAATTACC GCTTTGTCAG GGGTGATATT
ACCGATGGCG CAGTCATGCT GGAGCTTTTT GAAAAAGAGC TTTTTGACGG CGTGATCCAT
CTTGCCGCCG AGTCGCACGT CGATCGTTCG ATTGCCAATC CGACCGGGTT TGTCATGACC
AATGTGCTGG GTACGGTGAA TCTGTTGAAC GCGGCAAGAG CACTCTGGCA AGACGACTAC
AGCGGAAAAC TTTTCTACCA TGTCTCGACC GATGAGGTCT ATGGTGCCTT GGGTGGCGAG
GGCATGTTTA CCGAAGAGAC CTCTTACGAT CCGCACAGCC CGTATTCAGC GTCCAAAGCC
TCTTCGGATC ATTTTGTCCG TGCGTATCAC GATACCTACG GACTGCCTGT GGTGATCAGC
AATTGCTCGA ACAACTACGG ACCGTGTCAG TTTCCGGAAA AGCTGATCCC GCTGTTCATC
AATAACATCA GGAACAACAA GCCGCTTCCT GTTTACGGAA AAGGAGAGAA CGTCCGGGAC
TGGTTGTGGG TCATCGATCA CGCCCGGGCG ATCGACATGA TTTATCACAA GGGAAAGCAG
GGGGAGACCT ACAATATCGG GGGAAACAAC GAGTGGACGA ACATCGCGCT GATCAGACTG
CTCTGCAGTA TCATGGACCG GAAGCTCGGT CGCCCGGAAG GAGAATCCGG GAAATTGATC
ACCTGTGTGA CCGATCGGGC AGGCCACGAT TTCCGCTACG CGATCGACTC CTCAAAACTT
CAGCGGGAAC TGGGATGGAC ACCTTCGCTT CAGTTTGAAG AGGGACTGGA GAAGACGGTG
GACTGGTATC TGGAGAACAG CACCTGGCTC GATCATATCG CCTCGGGGGA GTATTTGAAG
AGAGTGACTA ATGAGTAA
 
Protein sequence
MHILITGGAG FIGSHVVRYF VDTYPEYSIT NLDKLTYAGN LENLRDVEDR SNYRFVRGDI 
TDGAVMLELF EKELFDGVIH LAAESHVDRS IANPTGFVMT NVLGTVNLLN AARALWQDDY
SGKLFYHVST DEVYGALGGE GMFTEETSYD PHSPYSASKA SSDHFVRAYH DTYGLPVVIS
NCSNNYGPCQ FPEKLIPLFI NNIRNNKPLP VYGKGENVRD WLWVIDHARA IDMIYHKGKQ
GETYNIGGNN EWTNIALIRL LCSIMDRKLG RPEGESGKLI TCVTDRAGHD FRYAIDSSKL
QRELGWTPSL QFEEGLEKTV DWYLENSTWL DHIASGEYLK RVTNE