Gene Csal_2107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2107 
Symbol 
ID4029252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2376671 
End bp2377774 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content63% 
IMG OID637967308 
Producthypothetical protein 
Protein accessionYP_574157 
Protein GI92114229 
COG category[S] Function unknown 
COG ID[COG3021] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATTA CAACGATCAT CGAGACGACG CTGGCACTGC TGGCACTACT GCTCTTCGCG 
GCGACGGCGA TTCCGTGGCT GAACCTGCGC TACTGGTGGG TACGCGGTTT CGACTTTCCC
CGCATGCAAC TGGCGATTCT CGCCGCCGTG ACGCTCCTCG CCCTGCTGAC CACCCTGCCA
TGGGGCGTCT GGGGATGGGT CGGTGCAGTC GCCGCGCTCA TCGTCATAGG AGTACAAGGC
GGCAGCATTT ATCACTGGAC ACCGCTCGCC AGGTGTCATG TGGTCGATGC CGACGGCGAA
GATGCCTCGC GGCAATTCTC GCTGCTGGTC GCCAATGTTC TCACCAGCAA CCGCCAGTCG
GCATCCCTCA TGCGCCAGAT CCGCGAGACC GACCCGGATA TCGTGCTGAC CCTGGAATCG
GATGCCTGGT GGCAGGAGCG CCTCGACGAG ACGCTCGACG AGAGTCATCC GTATGCCACC
CGGATTCCGC TCGACAATCT TTACGGCATG CACCTGTATT CTCGCCTGCC GGTACACGCC
CCTCAGATCG AGTGGCTGAT CCAGGATGAT ATTCCCTCGA TCCACGGCTG GTTCGAACTT
CCCAGCGGCG ACCGTGTGCG GTTTCACGCC GTCCACCCAA GGCCGCCTGC GCCCGGCGAA
AGCGATGAAT CCTTGTGGCG GGATGCCGAA CTCTTGCTGG TCGGTCAGAC GATTCGCGAC
TCGGGGCTCC CCACTCTGGT GGCCGGCGAT CTCAACGATG TGGCATGGTC GAAGACCACC
CGTCTGTTCT GCCGCATCAG CGGAATGCTC GATCCACGCC GGGGTCGCGG TCTCTACAGC
ACCTTTCATG CCGAGTATCG ATGGCTGCGC TGGCCCTTGG ATCATGTCTT CGTCAGCGAA
CATTTCACCC TGGTGGCCCT GCAGCGCCTC TCGGCCTTCG GCTCCGATCA TTTCCCGATC
CTCGCCACCT TTCGCTACCA CCCGGCACGC GCCGACGAAA ACGACAGTCC CGAGGCCGAT
CGCGAAGAAC GCCAGGATGC CGAGGAAACC ATCGAGGAAG CCCACGAACG ACGCGGCGAA
GCGCCCGTCA AGCACGACGA CTGA
 
Protein sequence
MTITTIIETT LALLALLLFA ATAIPWLNLR YWWVRGFDFP RMQLAILAAV TLLALLTTLP 
WGVWGWVGAV AALIVIGVQG GSIYHWTPLA RCHVVDADGE DASRQFSLLV ANVLTSNRQS
ASLMRQIRET DPDIVLTLES DAWWQERLDE TLDESHPYAT RIPLDNLYGM HLYSRLPVHA
PQIEWLIQDD IPSIHGWFEL PSGDRVRFHA VHPRPPAPGE SDESLWRDAE LLLVGQTIRD
SGLPTLVAGD LNDVAWSKTT RLFCRISGML DPRRGRGLYS TFHAEYRWLR WPLDHVFVSE
HFTLVALQRL SAFGSDHFPI LATFRYHPAR ADENDSPEAD REERQDAEET IEEAHERRGE
APVKHDD