Gene Csal_1761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1761 
Symbol 
ID4028288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2006066 
End bp2007406 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content65% 
IMG OID637966949 
ProductUDP-glucose 6-dehydrogenase 
Protein accessionYP_573812 
Protein GI92113884 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1004] Predicted UDP-glucose 6-dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.385474 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATCA GTATCTTCGG TACGGGATAC GTGGGCCTGG TCACCGGGAC TTGCCTGGCG 
GACGTGGGGC ACGAGGTCAT GTGCATGGAC GTCGACGCTG ACAAGATCGC CCGCCTCGAG
CGCGGCGAGA TCCCCATCTA CGAGCCGGGG CTGGACGCCA TGGTGCGTCA GAACGTGAGC
GAGGGCCGGC TGCGGTTCAC CACCGATGCC GCCACCGCCG TGGGCTTCGC GCCGTTGCAG
TTCATCGCCG TGGGCACGCC CCCGGACGAG GATGGCAGTG CCGATTTGCA GTACGTGCTG
GCCGTGGCGC GCAGCATCGG CCAGCACATG CAGGACGACA AGGTCGTCGT CGACAAGTCG
ACCGTGCCGG TGGGCACTGC GGACAAGGTG CGCGCCGCCG TTCAGCAGGA GCTCGATGCC
CGCGACGCTG CGCTCACCGT GGATGTCTGC TCCAATCCCG AGTTCTTGAA GGAAGGCGCC
GCCATCGAGG ACTTCACGCA TGGCGCACGG ATCATCGTGG GCACCGATGC CGAGCGGGTG
CGCGAGATCA TGCGTGAATG CTACGGTCCC TATAACCGGC ACCATGAGAA GTTGATGTTC
ATGGATATCC GCAGCGCCGA GCTGACCAAA TACGCGGCCA ACGCCATGCT CGCCACCAAG
ATCAGCTTCA TGAACGAGAT CGCCAATCTC GCGGAGCGTC TCGGGGCGGA CATCGAACAG
GTGCGGCGCG GCATCGGCTC GGACCCGCGC ATCGGGTATC ACTTCATCTA TCCGGGATGC
GGCTACGGCG GCTCCTGCTT TCCCAAGGAC GTGCAGGCAC TGGCGCGCAC CGCCGGCGAG
ATCGGCTACC ATGCCGAGCT ACTCGAAGCG GTCGAAGGCG TCAACCAGCG TCAGAAGGCC
ACCCTGTTCG CCAAGCTCTC GCAAGCCTTC GACGGCGATC TGGCCGGCAA GACCATCGCG
CTCTGGGGGC TGGCCTTCAA GCCCAACACC GACGACATGC GCGAGGCCCC GAGTCGCGCC
TTGATGGAAG CGCTGTGGGA ATGCGGCGCC CGGGTGCAGG CCTTCGACCC GGAGGCCATG
GACGAATGCC GGCGGATCTA CGGCAAGCGC GACGACCTGG CACTGGTCGA TAATCGCGAG
CAGGCCATCG AGGGGGCCGA CGCCCTGGTG ATCTGTACCG AATGGAAGGC GTTCTGCAGC
GTGGATTTCG CCTGGCTCAA GCAGTCGCTG AGCACACCGG TGGTCGTCGA CGGCCGCAAC
CTGTTCGATC CGCAGGCGGT CAAGCGCGCG GGATTGTTGT ACTTCGCGGT GGGGCGCGGG
GATTCCTTGC GTACGCCATG A
 
Protein sequence
MKISIFGTGY VGLVTGTCLA DVGHEVMCMD VDADKIARLE RGEIPIYEPG LDAMVRQNVS 
EGRLRFTTDA ATAVGFAPLQ FIAVGTPPDE DGSADLQYVL AVARSIGQHM QDDKVVVDKS
TVPVGTADKV RAAVQQELDA RDAALTVDVC SNPEFLKEGA AIEDFTHGAR IIVGTDAERV
REIMRECYGP YNRHHEKLMF MDIRSAELTK YAANAMLATK ISFMNEIANL AERLGADIEQ
VRRGIGSDPR IGYHFIYPGC GYGGSCFPKD VQALARTAGE IGYHAELLEA VEGVNQRQKA
TLFAKLSQAF DGDLAGKTIA LWGLAFKPNT DDMREAPSRA LMEALWECGA RVQAFDPEAM
DECRRIYGKR DDLALVDNRE QAIEGADALV ICTEWKAFCS VDFAWLKQSL STPVVVDGRN
LFDPQAVKRA GLLYFAVGRG DSLRTP