Gene Csal_0795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0795 
Symbol 
ID4026098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp889936 
End bp891225 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content65% 
IMG OID637965961 
Productsodium:dicarboxylate symporter 
Protein accessionYP_572851 
Protein GI92112923 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCGAC ACCATCATGG CCAAACACAG GGAGAAGGAG CGGTGAAAGC ACTGCTGCAT 
GCTTATTTGA ATGCCTCGTT GATTCTGCGC GTCACGCTGG CGCTGATACT GGGTGTCGTG
GTGGGCCTGG CAGGTGGCCA GAGCGTGGCC GACTGGCTGG CACCGTTCGG CGATCTATTG
CTGCGCCTGC TGAAGTTCCT GATCGTGCCG ATCGTGCTGT TCACCCTGCT GGTGGGCATC
AACCAGGCCA GTGCCGGGAG CATGGGGCGC ATCGGGCGCA AGGTGTTTTT CTATTACGTC
GGGACCTCGG CGCTGGCGAT CGTCGTGGGG CTCACGGTGG CCACGTTGCT GTCGCCCGGC
AGCGGCATTA CGCTGGACGA CAGCGCCGAG GTATCGGTGC CCGAGAATCC CGGTATCGCG
CAGGTAGTGC TGGGCGTGGT GCCGGACAAT ATCATCGGTG CCTTTGCCGA GCTGAACCTG
CTGGGCATCA TCTTTACCGC CATCGTCTTC GGCATTGCCT TGTTGAAGCT GCGCGCGTCC
GAGTCCCACG GCGCGCTGGC CGAGCAGCTG TTCCGGGTCA TCGAGGCGCT CAACGAGGTC
ACGCTCAAGG TGATGTCGGG GGTGCTGCAT TACGTGCCGA TCGGGGTGTT CGCCATCGTG
GCCGGCACCG TGGCCGAGCA GGGAATGGCT ACGCTGCTGT CGCTGGGCGA CATGGTGCTG
GTGCTGTACC TCGCGCTGGG CGTGCATGTC CTGCTGTATT GCGTCTTGAT GGGCGTCTTC
GGCGTCAAGT TGCGGGATTT CTTTCGCGAA GCGCGCACGC CGATGGCGAC CGCCTTCGCC
ACCCAGAGCA GCTCCGGCAC GCTGCCGCTG ACGCTGGACG CGGCCCGGCG CATGGGGCTG
TCGCGTGGGG TCTACGGCTT CAGCCTGCCG CTGGGGGCGA CCATCAACAT GGATGGCGCC
GCCATTCGCA TCGCCATTTC CGCCGTGTTT GCGGCCAACG TGATCGGCGC GCCGCTGGAT
TTTGCCAGCA TGCTGGAAAT CGTGCTGATC GGCACCCTGG TCTCCATCGG GACCGCCGGC
GTGCCGGGCG CGGGCATCAT CATGATCGCC ACGGTGTTCT CCCAGGTGGG CCTGCCCATC
GAGACGGTAG CGTTGCTGAC CGCCATCGAT GCCCTGGTGG GCATGGGCTG CACGGCGCTC
AACGTCACCG GCGACATGGT AGGGGCGTCG ATCATCGGGC GCAGCGAGCG GACACGGAAT
GCCGAGTGCG TCGATGCGCC GCAAGCGTAG
 
Protein sequence
MERHHHGQTQ GEGAVKALLH AYLNASLILR VTLALILGVV VGLAGGQSVA DWLAPFGDLL 
LRLLKFLIVP IVLFTLLVGI NQASAGSMGR IGRKVFFYYV GTSALAIVVG LTVATLLSPG
SGITLDDSAE VSVPENPGIA QVVLGVVPDN IIGAFAELNL LGIIFTAIVF GIALLKLRAS
ESHGALAEQL FRVIEALNEV TLKVMSGVLH YVPIGVFAIV AGTVAEQGMA TLLSLGDMVL
VLYLALGVHV LLYCVLMGVF GVKLRDFFRE ARTPMATAFA TQSSSGTLPL TLDAARRMGL
SRGVYGFSLP LGATINMDGA AIRIAISAVF AANVIGAPLD FASMLEIVLI GTLVSIGTAG
VPGAGIIMIA TVFSQVGLPI ETVALLTAID ALVGMGCTAL NVTGDMVGAS IIGRSERTRN
AECVDAPQA