Gene Clim_1794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1794 
Symbol 
ID6354623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1971254 
End bp1973038 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content56% 
IMG OID642669397 
ProductNa+/solute symporter 
Protein accessionYP_001943812 
Protein GI189347283 
COG category[R] General function prediction only 
COG ID[COG4147] Predicted symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR03648] probable sodium:solute symporter, VC_2705 subfamily 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0136163 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGTAC AAACGTGGAC GTATATCATC GTAGGCGCTA CGTTTCTGAT CTATATCGCA 
ATCGCGATAT GGGCAAAAGC TGGCTCTACA AAGGAATTCT ACGTTGCCGG TGCAGGCGTT
CCGCCTATCA TCAACGGCAT GGCAACCGCT GCGGACTGGA TGTCGGCGGC ATCATTCATC
TCGATGGCCG GTCTTATCTC CTTTATGGGT TACGACGGCT CGGTTTACCT GATGGGCTGG
ACGGGCGGTT ACGTGCTGCT CGCGCTTCTG CTCGCTCCTT ACCTGAGAAA GTTCGGCAAG
TTCACCGTTC CCGACTTCGT CGGCGACCGA TACTACTCCA ACGTAGCGCG TACGGTAGCC
GTTATCTGCG CAATCTTCGT CTCCTTCACC TATGTCGCAG GCCAGATGCG CGGCGTCGGC
GTGGTGTTCT CCCGCTTCCT CGAAGTCGAT ATCAACACCG GCATCCTGCT CGGCATGGGC
ATCGTCTTCT TCTATGCGGT GCTCGGAGGC ATGAAAGGCA TCACCTACAC CCAGGTTGCC
CAGTACTGGG TACTGATTTT CGCGTACATG GTTCCCGCCA TCTTCCTCTC AATCATGATC
GCCGGCAATC CCATCCCGCA GTTCGGCATG GGCGGCGCAG GCTCTGACGG TGTTTACCTG
CTCGACAAGC TCGACGGCCT GCATCAACAA CTCGGGTTTG CGGCCTACAC AACCGGCTCG
AAACCCATGA TCGACGTATT TGCCATTACC GTTGCCCTTA TGGTCGGCAC CGCCGGTCTG
CCGCACGTTA TTGTCCGTTT CTTCACCGTG CCGAGAGTCC GTGACGCTCG CATCTCTGCC
GGATGGGCGC TTATCTTCAT CGCCCTGCTC TACACCACGG CGCCGGCAAT CGCCACCTTC
GCCCGCCTCA ACCTGATCCA GACCGTCAGC AACAACAGCT ACACCGACAT GCCCGGCTGG
TTCAAAAAGT GGGAAAAAAC CGGTCTGCTT GCATGGATGG ACAAGAACAA CGACGGCAAA
ATTCAGTACA CCGGCAAGAA CGCCGGCGGC GGAGATCCGT TTGAAGGCAA GAAACCGGAA
TTCACCAAGG AAAAAGGGCA GCACGGAGAG CTTCTCATGT CGAACAAGCC CACGGACAAC
TCCAATGAGC TCTTCATCGA CAAGGACATC ATGGTGCTCG CCAACCCGGA AATCGGCAAC
CTGCCGAATT GGGTAATCGC ACTGGTAGCC GCCGGCGGTC TTGCTGCAGC TCTATCGACG
GCAGCGGGTC TCCTGCTGGT CATCTCGACC TCTATTTCGC ACGATCTCAT CAAAAAGCAG
ATCAATCCGA ACATCAGCGA AAAAGGCGAA CTGATGTACG CCCGTATCGC GGTCGGCGTA
GCCATCGTGG TTGCCGGATA CTTCGGCATC AACCCTCCCG GATTCGTGGC CGAAGTGGTG
GCCTTCGCCT TCGGTCTTGC TGCGGCCTCT TTCTTCCCGG TCATCATTCT CGGCATCTTC
TCCAAGAGAA TGAACAAGGA GGGAGCCATC TCGGGCATGA TCACCGGACT GCTCTTCACT
GCGGCCTACA TCGTCTATTT CAAATTCATC AGTCCGGATA TGAACAAGCC GGAATTCTGG
TGGTTCGGCA TATCACCGGA AGGCATCGGA ACTCTCGGCA TGCTGATCAA CGTTGCCGTG
AGCTTTGTCG TTTCGCGCAT CACCCCTGCA CCTCCCCAGG AAATCCAGGA GCTGGTCGAC
AGCCTGAGAT ACCCGAAAGG AGCCGGAGAG GCTTCTGCAC ACTGA
 
Protein sequence
MDVQTWTYII VGATFLIYIA IAIWAKAGST KEFYVAGAGV PPIINGMATA ADWMSAASFI 
SMAGLISFMG YDGSVYLMGW TGGYVLLALL LAPYLRKFGK FTVPDFVGDR YYSNVARTVA
VICAIFVSFT YVAGQMRGVG VVFSRFLEVD INTGILLGMG IVFFYAVLGG MKGITYTQVA
QYWVLIFAYM VPAIFLSIMI AGNPIPQFGM GGAGSDGVYL LDKLDGLHQQ LGFAAYTTGS
KPMIDVFAIT VALMVGTAGL PHVIVRFFTV PRVRDARISA GWALIFIALL YTTAPAIATF
ARLNLIQTVS NNSYTDMPGW FKKWEKTGLL AWMDKNNDGK IQYTGKNAGG GDPFEGKKPE
FTKEKGQHGE LLMSNKPTDN SNELFIDKDI MVLANPEIGN LPNWVIALVA AGGLAAALST
AAGLLLVIST SISHDLIKKQ INPNISEKGE LMYARIAVGV AIVVAGYFGI NPPGFVAEVV
AFAFGLAAAS FFPVIILGIF SKRMNKEGAI SGMITGLLFT AAYIVYFKFI SPDMNKPEFW
WFGISPEGIG TLGMLINVAV SFVVSRITPA PPQEIQELVD SLRYPKGAGE ASAH