Gene Csal_2067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2067 
Symbol 
ID4026529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2337075 
End bp2338097 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content67% 
IMG OID637967266 
Productglycoside hydrolase family protein 
Protein accessionYP_574117 
Protein GI92114189 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.268597 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCAAC CCCTCGGCAC GGTCATGCTG GACATCGAAG GAACGCAGCT CGGCGATGAG 
GAACGTCGTC TGCTGGAACG CCCCGAGGTG GGTGGCGTGA TTCTGTTTGC ACGCAATACG
CGTGATGCCG AGCAGGTACG CCGCCTGACG CGCGAGATTC GCGAACTGCG CCCCGACATG
CTGCTGGCGA TCGACCAGGA AGGTGGTCGA GTGCAACGCT TGCGCGAGGG AGTGACGCGC
TTGCCGAGCA TGGCCGCGCT GGCCGCCGGT TATGCCGACG CGCCCGATGA GGTGCGTTCG
CGGGTACACG AGGCGGGGTG GCTGTTGGGT ATGGAAATGG CCGCCTGCGG CTTCGATGTC
ACTTTCGCGC CGGTGCTCGA CGTGGACGAT CAGCGTTCGC CGGCGATCGG GGACCGCAGT
TTTTCCGCCG ATCCGACAGT CGTGGCGGCT CTCGGCGAGG CCTTCATCGA AGGATTACAC
GAGGCCGGCA TGGTGGCCGT GGGCAAGCAC TTTCCCGGCC ACGGCGGCGT CACCCTCGAC
TCGCACCATG CCTTGCCCGA GGACAATCGG CCGTTGTCGG TTCTGCGCGA GCATGACCTG
GTGCCGTTCA AGGCCCTCTC CGGCAAGCTG GATGCCATGA TGCCGGCGCA TGTCGTCTAT
ACCGCGTTCG ATACACGTCC CGCGGGCTTC TCACCCTCCT GGCTGGGCAT GCTGCGCGAG
GAAATGGCCT TCAAGGGCGT GGTGTTTTCC GATGATCTGA GCATGGCGGG GGCGCATGTG
GCGGGCACCC CCGCGGCGCG TGCCGAGGCT GCTTGGTCGG CGGGGTGCGA CATGGTGCTG
GTGTGCAACG ACCGCGCGGC GGCGCTCGAG ATCGTGGACG CCGCGGCCGG CCGGACCTCG
AAGCGCCTGG GCAAGCTGCG CTACGGCCGC GCCCGTCCGG AACTGGAGAC GCTGCCGGCG
CTGGCACGCT GGCGCCGTGC CCATGCACGC CTGGAAGCGC TCTCGGAAAC ACCGGCGAGT
TGA
 
Protein sequence
MTQPLGTVML DIEGTQLGDE ERRLLERPEV GGVILFARNT RDAEQVRRLT REIRELRPDM 
LLAIDQEGGR VQRLREGVTR LPSMAALAAG YADAPDEVRS RVHEAGWLLG MEMAACGFDV
TFAPVLDVDD QRSPAIGDRS FSADPTVVAA LGEAFIEGLH EAGMVAVGKH FPGHGGVTLD
SHHALPEDNR PLSVLREHDL VPFKALSGKL DAMMPAHVVY TAFDTRPAGF SPSWLGMLRE
EMAFKGVVFS DDLSMAGAHV AGTPAARAEA AWSAGCDMVL VCNDRAAALE IVDAAAGRTS
KRLGKLRYGR ARPELETLPA LARWRRAHAR LEALSETPAS