Gene Csal_2221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2221 
Symbol 
ID4026413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2493962 
End bp2494942 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content68% 
IMG OID637967426 
ProductKpsF/GutQ family protein 
Protein accessionYP_574271 
Protein GI92114343 
COG category[K] Transcription
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0794] Predicted sugar phosphate isomerase involved in capsule formation
[COG2524] Predicted transcriptional regulator, contains C-terminal CBS domains 
TIGRFAM ID[TIGR00393] KpsF/GutQ family protein 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACCG TCACCGATCA TGATTACCGC GCCAGCGCCC GCCGTACGCT GACCCTCGAA 
TCGCATGCCG TGGCCGCCTT GATCGAACGC CTCGACGAAG CGTTCGATCA CGCCTGCCAG
CTATTTCTCG CCTGCGAGGG ACGCATCATC GTCACCGGCA TGGGCAAGTC CGGGCATATC
GCCCGCAAGA TTGCCGCGAC CCTGGCCAGC ACGGGCACGC CGGCGTTTTA CGTTCACCCC
GGCGAGGCCA GCCATGGCGA CATGGGCATG ATCACCGCGC GCGACGTGGT CCTGGCGCTG
TCCAATTCCG GCGAGACCGC CGAGGTCACG GCGCTGCTGC CGCTTCTCAA GCGCATGGGC
ACCCCTCTGG TCAGCATGAC CGGGCGCCCG GGCTCGAGCC TGGCGCGGCA TGCCGAAGCT
CACCTGGACA CCGCGGTGGA TCGCGAGGCG TGCCCGCTCG ACCTGGCCCC GACCGCGTCG
ACCACTGCCG CCCTGGCCAT GGGCGATGCC CTGGCGGTGG CCTTGCTCGA GGCACGCGGC
TTCACCGCCG AGGATTTTGC CCTGTCGCAT CCCGGCGGTA GCCTGGGCCG GCGATTGCTG
CTCAAGGTCG AAGACCTCAT GCATCAGGGC GATCGCCTGC CCCGGGTGGC GCTGGGCAGC
CCACTGCGCG ATGCCTTGCT GGAGATCACG CGTCAGGGCC TGGGATTCAC CTGCGTGCTC
GACGAGGACG GCCGCCTCGC CGGGGTCTAC ACCGACGGCG ACCTGCGTCG CACCCTCGAC
CATCATGACG ATCTGCGCCA GCTGCGCGTG GACGACGTCA TGACCCACGG CGGCAAGACG
ATTCGTCCTC AATTGCTGGC TGCCGAGGCG GTCAAGATCA TGGAAGACAA TCGCATCACA
GCCCTGGCCG TGGTCGACGA CCAGGGCCAT CCGGTCGGCG TCCTGCACAT GCACGACCTG
CTGGCCAGCG GCGTCATCTG A
 
Protein sequence
MNTVTDHDYR ASARRTLTLE SHAVAALIER LDEAFDHACQ LFLACEGRII VTGMGKSGHI 
ARKIAATLAS TGTPAFYVHP GEASHGDMGM ITARDVVLAL SNSGETAEVT ALLPLLKRMG
TPLVSMTGRP GSSLARHAEA HLDTAVDREA CPLDLAPTAS TTAALAMGDA LAVALLEARG
FTAEDFALSH PGGSLGRRLL LKVEDLMHQG DRLPRVALGS PLRDALLEIT RQGLGFTCVL
DEDGRLAGVY TDGDLRRTLD HHDDLRQLRV DDVMTHGGKT IRPQLLAAEA VKIMEDNRIT
ALAVVDDQGH PVGVLHMHDL LASGVI