Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1248 |
Symbol | |
ID | 4027626 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 1424136 |
End bp | 1425626 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637966426 |
Product | Sodium/proline symporter |
Protein accession | YP_573302 |
Protein GI | 92113374 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | [TIGR00813] transporter, SSS family [TIGR02121] sodium/proline symporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.126391 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTATTG GTGTCTGGAT AAGCCTTTTC GCCTACTTTG CGCTCATGAT CGGCATCGGC CTTTATGCCA TGCGCAAGTC CACGTCCACA TCCGAAGACT ACATACTGGG TGGCCGTACC CTCAGTCCCA AAGTGGCGGC GCTGTCGGCC GGCGCTTCGG ACATGAGCGG CTGGCTGCTG CTGGGGCTGC CCGGGGCAAT GTTCGTCTCC GGCCTGGGCT CGGCATGGAT CGGCATCGGC CTGCTGGTGG GGGCTTTCTT CAACTGGCTC CTGGTCGCGC CACGACTGCG TGAACAGACG GTTCACTATG GCAACGCGAT CACCATCCCC TCCTTCCTGG CCAACCGGTT CCCGACCCAG GCCTTGTCGT TGAGGACGGT TTCCGCCATC GTCATCGTCG TCTTCTTCGC GGTCTACACG GCGTCCGGCC TGGTGGCAGG CGGCAAGCTG TTCGAAAGCG CGTTTTCCGG GGTCATCAAC ATCGGCGGGC TCAGCGATTA CGCGGTAGGC ATCATCATCA CCCTGGGCGT GGTGCTGGTC TACACGGTCG TCGGCGGCTT CCTGGCCGTG AGCATGACGG ACTTCGTGCA GGGCTGCATC ATGATGCTGG CGCTGGTGAT CATGCCGGCA GTCGTGATCT TCGGCGAAGG CGGCGGCGGC TTCTCCCAGG CCTCGCAGAC GCTGAACGAA GTCGACCCCA CGCTCTTGTC ATGGACGGAC GGCCTGACCT TCATCGGCTG GTTGTCCGCC GTGACCTGGG GCCTGGGCTA TTTCGGTCAG CCGCACATCA TCGTGCGCTT CATGGCCATC CGGTCGCTGA AGGACGTGCC CGTCGCCCGC AACATCGGCA TGAGCTGGAT GCTCATTTCC CTGATCGGCG CGATCTCGCT CGGGATCTTC GGCCGCGCCT ACGCCATCCG CAATGGCATG GACGTAGAGG ATCCGGAAAC GATCTTCATC ATCCTGGCGG ACCTGCTGTT CCACCCGCTG ATCACCGGCT TCCTCTACGC CGCATTGCTG GCCGCGATCA TGAGTACCGT GTCCAGCCAG TTGCTGGTGG CCTCCTCGTC GCTGACCGAA GACTTCTACC GCCTGTTCCT GCGCAAGGAA GCCACGGAGA AAGAGACCGT CGGCATCGGC CGGGTCAGTG TCGTGCTGGT CGGCCTGGTG GCGGCCGTGA TTGCATCCGA TCCGGACTCT CAGGTGCTGG GACTGGTGAG CAACGCCTGG GCAGGTTTCG GCGCGGCATT CGGTCCGCTG ATCATCCTGT CGCTGATGTG GTCGCGCACG AACGGTGCCG GCGCCATCGC GGGCATGGTC GTGGGTGCCG CTACGGTCAT GATCTGGATT TCTCTGGGCT GGAACGCATC GTTCATGGGC GGTCCCGGCG TTTACGAGAT CATCCCCGGC TTCATCGCTG CCTTCATCGC GATCCTGGTG GTGAGCAGCC TGACCAGCGA CGCCGGAGAA TACAAGCACA TCTCCCGCTG A
|
Protein sequence | MAIGVWISLF AYFALMIGIG LYAMRKSTST SEDYILGGRT LSPKVAALSA GASDMSGWLL LGLPGAMFVS GLGSAWIGIG LLVGAFFNWL LVAPRLREQT VHYGNAITIP SFLANRFPTQ ALSLRTVSAI VIVVFFAVYT ASGLVAGGKL FESAFSGVIN IGGLSDYAVG IIITLGVVLV YTVVGGFLAV SMTDFVQGCI MMLALVIMPA VVIFGEGGGG FSQASQTLNE VDPTLLSWTD GLTFIGWLSA VTWGLGYFGQ PHIIVRFMAI RSLKDVPVAR NIGMSWMLIS LIGAISLGIF GRAYAIRNGM DVEDPETIFI ILADLLFHPL ITGFLYAALL AAIMSTVSSQ LLVASSSLTE DFYRLFLRKE ATEKETVGIG RVSVVLVGLV AAVIASDPDS QVLGLVSNAW AGFGAAFGPL IILSLMWSRT NGAGAIAGMV VGAATVMIWI SLGWNASFMG GPGVYEIIPG FIAAFIAILV VSSLTSDAGE YKHISR
|
| |