Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0871 |
Symbol | |
ID | 4027874 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 978229 |
End bp | 980010 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637966040 |
Product | Na+/solute symporter |
Protein accession | YP_572927 |
Protein GI | 92112999 |
COG category | [R] General function prediction only |
COG ID | [COG4147] Predicted symporter |
TIGRFAM ID | [TIGR00813] transporter, SSS family [TIGR03648] probable sodium:solute symporter, VC_2705 subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCAAT TTGCCATCAA TATCCTGTTC GTCGGCGCCT CCTTCGCCCT CTACATCGGT ATCGCGATCT GGGCCAAGGC GGGCTCGACC AAGGACTTCT ATGTCGCCGG GGGCGGTGTC CACCCGATCA CGAACGGCAT GGCCATCGGG GCCGACTGGA TGTCGGCCGC CTCCTTCATT TCCATGGCGG GCCTCATCGC CGCTGGAGGG TATGCCAACT CGACCTTCCT CATGGGCTGG ACCGGCGGTT ATGTACTCCT GGCCCTGCTG TTGGCGCCCT ACCTGAGAAA ATTCGGCAAG TTCACGGTAC CCGAGTTCAT CGGCGACCGT TTCTACAGCC GTACGGCACG CATGGTGGCC GTCGTCTGTC TGATCGTGGC GTCCCTGACA TACGTCATCG GTCAGATGGC GGGGGCAGGC GTGGCGTTCT CGCGCTTTCT GGAAGTCGAT GCGACCACCG GCCTGATCAT CGCCGCCGTC GTGGTGTTCT TCTACTCGGT GCTGGGGGGC ATGAAAGGCA TCACCTATAC CCAGGTGGCC CAGTACATCG TCCTGATCGT CGCCTATACC ATTCCAGCAG TGTTCATCTC ACTGGAACTG ACCGGCAACC CGCTGCCGCC GCTCGGCCTG TTCTCCGACC ATAGCGCGTC GGGCGAACCG CTGCTCGCCA AGCTCAACGA GGTCGTGACC GCACTGGGCT TCAACGCCTA CACCGCCATG GTCGACAACC AGCTCAACAT GGTGCTGTTC ACGCTCTCGC TGATGATCGG CACGGCCGGC TTGCCGCACG TCATCATGCG CTTCTTCACC GTGCCCAAGG TCGCCGATGC ACGCTGGTCC GCCGGCTGGG CGCTGGTGTT CATCGGTCTG CTCTATCTGA CCGCACCGGC GGTGGCCTCC ATGGCACGCT TGAACCTGAT GACCACCATC TACCCGGACA TGGCGGGTCA GGTCGAGAAT TACGACGCCG CCGCCTCCAA CGCCATCGCC TACGGAGACC GTCCCGACTG GATTCGGACC TGGGAAGAAA CCGGGCTGAT CACCTTCGAG GACAAGAACA ACGACGGCAA CATCCAGTTC TACAACGACA GCACCGATTT CTCGGATCGC GGCTGGCAAG GCAACGAGTT GACCGTCAAC AACGACATCC TGGTCCTGGC CAACCCGGAG ATCGCCAACC TGCCCGGCTG GGTCATCGGG CTGATCGCCG CGGGCGGCCT GGCAGCGGCG CTCTCCACGG CAGCGGGCCT GCTGCTGGCC ATCTCGTCGG CGATCAGTCA TGACCTGATA AAGGGGGCCA TCAATCCCAA CATCACGGAA CGCGGGGAGC TGAAAGCGGC ACGCATATCG ATGTCGATCG CCATCGTGGT CGCCACGTAC CTGGGGGCCA ATCCGCCAGG GTTCGCGGCA CAGGTCGTGG CGCTGGCCTT CGGCATCGCG GCCGCCTCGC TGTTCCCGGC GCTGATGATG GGGATCTTCT CCAAGCGCGT GAACAACAAG GGCGCCATCG CCGGGATGTT GTCGGGGCTG ACCTTCACCC TTGTCTACAT CTTCGTTTAC AAGGGCTGGT TCTTCATCCC CGGCACCAAC AACCTGGCCG ACACTCCGGA AAACTGGGTA CTGGGCATCT CCCCGCTCTC GATTGGTGCA GTGGGTGCCA TCATCAACTT CGCGGTGGCC TACCTCGTCT CCAAATCCTC CGAGGAGCCG CCCCTGGAGA TCCAGGAACT GGTGGAAAGC GTGCGCTACC CGAAAGGTGC TGGTGGTGCC GTCAATCACT AA
|
Protein sequence | MSQFAINILF VGASFALYIG IAIWAKAGST KDFYVAGGGV HPITNGMAIG ADWMSAASFI SMAGLIAAGG YANSTFLMGW TGGYVLLALL LAPYLRKFGK FTVPEFIGDR FYSRTARMVA VVCLIVASLT YVIGQMAGAG VAFSRFLEVD ATTGLIIAAV VVFFYSVLGG MKGITYTQVA QYIVLIVAYT IPAVFISLEL TGNPLPPLGL FSDHSASGEP LLAKLNEVVT ALGFNAYTAM VDNQLNMVLF TLSLMIGTAG LPHVIMRFFT VPKVADARWS AGWALVFIGL LYLTAPAVAS MARLNLMTTI YPDMAGQVEN YDAAASNAIA YGDRPDWIRT WEETGLITFE DKNNDGNIQF YNDSTDFSDR GWQGNELTVN NDILVLANPE IANLPGWVIG LIAAGGLAAA LSTAAGLLLA ISSAISHDLI KGAINPNITE RGELKAARIS MSIAIVVATY LGANPPGFAA QVVALAFGIA AASLFPALMM GIFSKRVNNK GAIAGMLSGL TFTLVYIFVY KGWFFIPGTN NLADTPENWV LGISPLSIGA VGAIINFAVA YLVSKSSEEP PLEIQELVES VRYPKGAGGA VNH
|
| |