Gene Csal_0226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0226 
Symbol 
ID4027309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp256873 
End bp258120 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content62% 
IMG OID637965377 
ProductHipA-like protein 
Protein accessionYP_572289 
Protein GI92112361 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAGT TCGAAGTACA TATCGACCTC GGTGGGAGCA CCTTTCCAGT CGGTCTGGCC 
AGGTGCAATC GCGTGCGTGG CACGGAGAGG ATTCTTTTCG AATACGACGC CGAATGGCTC
GCGTCGCCGG ATCGCTTCTC ACTGGAGCCT GCCCTTGCAC TCACGCGGGG AGCCTTCGCT
CCACCTGCAG GGATGACCAG CTTCGGCTCG ATTGGCGACT CGGCGCCCGA CACCTGGGGC
CGTCGTCTGA TGCAACGTGC CGAGCGGCGG CGTGCCGCGC GGGAGGGTCG CAGTGTTCGA
ACGCTTGCCG AGAGCGATTA CCTGCTGGGA GTGTCTGATG AGACTCGGCT TGGTGCCCTG
CGTTTTCGCC GAGTGGGCGA GACGTCATTT CAGGCGCCGA TTCAGGCTGG CGTTCCTGCC
CTCGTCGAGC TGGGTCGACT GCTTCAGGTC ACAGAGCGTG TCTTGCGTGA CGAGGAAACG
GATGAAGACC TCCAGCTCAT CTTTGCCCCG GGCTCGTCCC TGGGTGGGGC CCGCCCGAAA
GCCTCGGTCA TCGATCAGTA CGGCCACCTC GCGATCGCCA AGTTCCCGAA GGATACCGAC
GAGTACAGTG TGGAGACCTG GGAAGAAATC GCGCTCCGCT TGGCCAACCA GGCTGGCATC
GTGACGCCGC AGCACGAACT GGTCGACGTG GGCGGTCGGG CCGTCATGCT GTCGAGACGA
TTCGATCGAG ATGGCACTAT CCGCATTCCG TTTCTTTCCG CGATGGCAAT GATGGGCGCC
AGGGATGGCG AACCTGGCAG CTATCCCGAA ATGGTCGATG CTCTGACCGC GCATGGCGCT
CAAGGAAAGA CCGACGCGCA CGCACTGTAT CGGCGCGTCG TCTTCAGCGT GATGATCTCC
AATGTCGACG ATCATCTGCG CAATCACGGC TTTCTGTGGT TGGGGAACGC TGGATGGTCA
CTCGCTCCGG CCTATGACCT CAACCCGGTG CCCAGTGACC TCAAGGCGCG GGTGTTAACC
ACGAACATCG ATCTCGACGA GAGCACCTGC TCGGTCGATC TGCTGGAGGC GGCATCCGGT
TATTTTGGGC TCACGCTGGC CAACGCCCGG CGAATCATCA AGGACGTTGC GTCAGTAACC
GCAACCTGGC GGAACACTGC CAGGGCCGTT GGCGCACGTC CGGCCGAGAT CGAGCGTATG
GCCAGCGCCT TCGAGCATGA TGACTTACAG CATGGGCTGG CGTTGTAA
 
Protein sequence
MTEFEVHIDL GGSTFPVGLA RCNRVRGTER ILFEYDAEWL ASPDRFSLEP ALALTRGAFA 
PPAGMTSFGS IGDSAPDTWG RRLMQRAERR RAAREGRSVR TLAESDYLLG VSDETRLGAL
RFRRVGETSF QAPIQAGVPA LVELGRLLQV TERVLRDEET DEDLQLIFAP GSSLGGARPK
ASVIDQYGHL AIAKFPKDTD EYSVETWEEI ALRLANQAGI VTPQHELVDV GGRAVMLSRR
FDRDGTIRIP FLSAMAMMGA RDGEPGSYPE MVDALTAHGA QGKTDAHALY RRVVFSVMIS
NVDDHLRNHG FLWLGNAGWS LAPAYDLNPV PSDLKARVLT TNIDLDESTC SVDLLEAASG
YFGLTLANAR RIIKDVASVT ATWRNTARAV GARPAEIERM ASAFEHDDLQ HGLAL