Gene Csal_1722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1722 
Symbol 
ID4028830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1960938 
End bp1962059 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content66% 
IMG OID637966910 
ProductUDP-galactose 4-epimerase 
Protein accessionYP_573773 
Protein GI92113845 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID[TIGR01179] UDP-glucose-4-epimerase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.202169 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATTT TAGTGACGGG CGGGGCCGGG TACATCGGCT CGCATATGGT GCTGCGGCTC 
ATCGAAGCCG GCCACGAGGT GGTCGTGATC GACAACCTCT GCAATGCCTC GCGAGAGTCG
CTGGAGCGCG TCTCGCAGTT GACCGGCAAG GAGGTGACCT TCATCGAGGG CGACATTCGC
GATCGTTCGC TGCTGGATTA CGTGTTCGCG GACTTCGAGA TCAGCGATGT GCTGCATTTC
GCGGGTCTCA AGTCGGTGGG CGAGAGCGTC AGCGAGCCAC TGGCGTATTT CGAGAACAAC
GTGGCGGGCA CCATCACGCT GTGCCAGGCG ATGACGGCGG CGGGCGTGTA CCGCCTGGTG
TTCAGTTCCT CGGCGACGGT GTATGGCGAC GCCACGCGCA TGCCGTTGAG CGAAAACGCG
CCTACCGGGC AACCGACCAA CGCCTACGGG CATTCCAAGC TGATGGTCGA GGAGGTGCTG
CGCAAGCTGG CGCGGTCCGA CCCACGCTGG GCGATCGCCT TGTTGCGCTA CTTCAACCCG
GTGGGGGCGC ACCCCAGTGG CATGATCGGC GAGGACCCGT CGGGCACGCC CAACAATCTG
CTGCCGTTCA TCTCGCAGGT GGCGATCGGT CGGCTACCGG CGCTTTCGGT CTTCGGCGAC
GACTATCCGA CGCCCGATGG CACCGGGGTG CGCGATTACA TCCATGTGAT GGATCTGGTC
GAGGGACACC TGGCGGCAAT GCGCGTGCTG GCGGATCGTG CGGGCGTGAA CGTCTGGAAC
CTGGGCACGG GGCAGGGCTA CTCGGTACTG GAGATGGTGC GCGCCTTCGA GCATGTCGCC
CGGCGCGACG TGCCGTATCG CATCGTGCCG CGTCGCGACG GCGATATCGC CGCATGCTGG
GCCGACGCCT CGCTGGCCGA GCGTGAGCTG GGCTGGCGGG CGCAACGCGG CCTGATGGAC
ATGATCGCCG ATACCTGGCG CTGGCAGTCG CGCAACCCCG AAGGCTACCC GCGCAAGCGC
ATGATCCGGC GCGAGACCGT CGGGGCCGCG CGTGCCGTGG GCGCCGGCTT GCCCCGTATC
TATCTGATAG ACACCGCGCG GGCCAACTCC GTGGCGTCGT AG
 
Protein sequence
MTILVTGGAG YIGSHMVLRL IEAGHEVVVI DNLCNASRES LERVSQLTGK EVTFIEGDIR 
DRSLLDYVFA DFEISDVLHF AGLKSVGESV SEPLAYFENN VAGTITLCQA MTAAGVYRLV
FSSSATVYGD ATRMPLSENA PTGQPTNAYG HSKLMVEEVL RKLARSDPRW AIALLRYFNP
VGAHPSGMIG EDPSGTPNNL LPFISQVAIG RLPALSVFGD DYPTPDGTGV RDYIHVMDLV
EGHLAAMRVL ADRAGVNVWN LGTGQGYSVL EMVRAFEHVA RRDVPYRIVP RRDGDIAACW
ADASLAEREL GWRAQRGLMD MIADTWRWQS RNPEGYPRKR MIRRETVGAA RAVGAGLPRI
YLIDTARANS VAS