Gene Csal_0781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0781 
Symbol 
ID4026084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp874886 
End bp876307 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content66% 
IMG OID637965947 
ProductL-sorbosone dehydrogenase 
Protein accessionYP_572837 
Protein GI92112909 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCTAA CATCCAAGCC CACTGGCATG CGTTACGTCG CGCCCCTTTC GCTTCTGCTG 
CTCGCCGGCG GCGCCCAGGC CGCCGACTCG GGCCAGGAGT ACGGTCCTGA CCCGGAACTG
CCCGAACCGC AGCGAGGCCT TCTGCCCAAC ATGACGGTGC CCGAGCAGGC CCCCTGGGGC
GACGCGAAAC CCACGGTACC CGATGGCTAC ACGATCACCG CCATCGCCAC CGACCTGAAG
GTCCCACGTC AGACGTTGGT GCTTCCGAAC GGCGATATCC TGGTGGCGGA AGGCAAAGGC
GGCGGCAAGG CGCCGAAATC GCGGCCTAAG GATTTCATCG CCGGGCTGAT TCAGTCGCAA
GGCACCACCT CGGTCAAGGG CGGCGACCGG CTGACCTTGC TGCGCGACGG CGACGATGAT
GGCAAATACG AGGAACGAAC GGTCTTCGCC GAGAACCTCA ATGCGCCCTA CGGCCTCGCC
CTGGTGGATG ACGATCTCTA CGTGGCCAAT CAAGATTCAC TCGTTCGCTT CGACTACGAA
ACGGGCCAGA CCGAAGCCAG CGGCCCACCG GAACTGGTCA CGCCGCTGCC GTCGGAGATC
AATCATCACT GGACCAAGGC CCTGACCGCC AGCGCGGACG GCGACTATCT CTACGTCGGC
ATCGGTTCCA ACAGCAACAT CACCGAACGC GGTATGGCGG CGGAAGTGAA CCGTGCGGAA
ATCTGGGAGA TCGACCCGGA AACCGGGGCA CATCGCGCCT ATGCCACCGG CGTGCGCAAC
CCCACCGCGC TGACCATTCA GCCGGAGACC GACCGGCTCT GGGCCGTCGC CAACGAGCGT
GACGAACTCG GCCCCAATCT GGTGCCCGAC TATCTCACCT CGATCCAGGA GGGCGGTTTC
TACGGCTGGC CGTACAGCTA CTGGGGGCAG CATGTCGACC CGCGCGTTCG TCCGCAGAAC
CCCGAGAAGG TGGAAGCGGC GATCGCCCCC GACTACAGCC TCGGCTCGCA CCACGCGCCG
CTCGGCGTCG ACTTCTCCAA CCCGGCCGTG GGCGGAGAAT TCGCCAATGG CGTCTTCGTC
GGCGAGCACG GCAGTTGGAA CCGTGCCGAT CCCGTGGGGT ACAAGGTGGT CTTCATCCCG
TTCGAGAACG GCCGCCCCGC CGGCGACCCG GTCGACTTCG TCTCCGGCTT CCTGACCGAC
GACGGCAAGA CCCGCGGCCG CCCCGTCGGC GTCACCGTCG CGCCGGATGG CTCGGTGATC
GTCGCCGACG ACATGACCAA TGCGATCTGG CGGGTGACGC GAGATGACGA CCAGGCACCG
TCAGCGGAGT CCGCCACCGA GACATCGGGC TCCAGTGAGG AAACGGCATC TTCTGAAGGC
GAGTCCGAGA TGCCCGAGGA CGGATACGCC GGGGATGGAT GA
 
Protein sequence
MKLTSKPTGM RYVAPLSLLL LAGGAQAADS GQEYGPDPEL PEPQRGLLPN MTVPEQAPWG 
DAKPTVPDGY TITAIATDLK VPRQTLVLPN GDILVAEGKG GGKAPKSRPK DFIAGLIQSQ
GTTSVKGGDR LTLLRDGDDD GKYEERTVFA ENLNAPYGLA LVDDDLYVAN QDSLVRFDYE
TGQTEASGPP ELVTPLPSEI NHHWTKALTA SADGDYLYVG IGSNSNITER GMAAEVNRAE
IWEIDPETGA HRAYATGVRN PTALTIQPET DRLWAVANER DELGPNLVPD YLTSIQEGGF
YGWPYSYWGQ HVDPRVRPQN PEKVEAAIAP DYSLGSHHAP LGVDFSNPAV GGEFANGVFV
GEHGSWNRAD PVGYKVVFIP FENGRPAGDP VDFVSGFLTD DGKTRGRPVG VTVAPDGSVI
VADDMTNAIW RVTRDDDQAP SAESATETSG SSEETASSEG ESEMPEDGYA GDG