Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0089 |
Symbol | |
ID | 4026011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 112276 |
End bp | 113766 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637965240 |
Product | Na+/solute symporter |
Protein accession | YP_572152 |
Protein GI | 92112224 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | [TIGR00813] transporter, SSS family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000762337 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAGTC ACGTATTCCT CGGTGCCTTC GTCACCTACG TGGTGGCCAT GATCGCGTTC GGGTGGTGGG TCTCCCGCCA TAGCCGAAGC AATGGCGATG ACTTCCTGCT CGGTGGGCGC AGCATCCCGA TCTTCCTGAC GATCGGGACC ACCGTCGCCA CCATGGTGGG TACCGGCTCG AGCATGGGCG CGGTCGGCTT CGGCTATGCC AACGGCTGGG CCGGGGCCCT CTACGGCATC GGCGGCTCCA TCGGCGTGCT GCTGCTGGCG GCGTGGTTCG CCCCCGTACG CAAGCTGCGC TTCATGACCA TGAGCGAGGA GCTTTCCTAT TATGTCGGCG CCAACCGCTG GGTGCGCAAC ATCGTCGCGG TGCTGATCTA CATCGCCTGT ATCGGCTGGC TCGGCGCGCA CATTCTCGGT GGCGGGCTCT ATCTATCCTG GATGGCCGAC ATCGACCTCA CCACCGCCCG CGTTCTGGTG GCGCTGGGCT TCGGCATCTA CTGCGTGATC GGCGGTTACA TGGCCGTGGT GTGGACCGAT ACCGTCCAGG CGGTGATATT GTTCGTCGGC TTCATCGTGA TGGCCATCAT CGCATTGTTC GAAGTGGGTG GATTCTCGGG ACTGGGCGCC AACATGGACG TCGCCACCAC CAGTTTCCTG GGCATCGACA AGATCGGCGT GGTGCCGGCA CTGTCGCTGG CGGCGGTCAT CGCCGTCGGC GTGCTGGCAA CCCCCTCCTA CCGTCAGCGC ATCTACTCCG GCAACTCGGT CCATTCCGTA CGCAAGTCAT TCGTGATCAC CGGCATTCTT TACATGATCT TCTCGATCAT CCCGGCGATC ATCGGCATGG CCACCCACGC GCTCAACCCC GGTCTGGACA ACAGCAACTT CGCCTTTCCG TTTCTGGCGA CCGAGATCCT GCCGCTGTGG CTGGGTTTGT TGCTGCTCGT CGCGGGCCTC TCGGCGACCA TGTCGAGCGC CAGCTCGGAC GCCATCGCCG GCGTCTCGAT CCTGTTGCGC GACGTCTACG TGCTGTTCAC CGGGCATACG CCTTCGGCAC ACAAGGTCGT CTTGCTCTCG CGTCTCGCGC TGATCGCCAC CATCGGCCTG GCGCTGCTGT TCGCATTGAC CTCCGACAAC GTCATCGACT ACATCACCAG CATGATCGCC ACGGTCATGG CCGGCATGTT CGTGTGTGGC GTGCTGGGTC GCTTCTGGCC GCGCTACAAC TGGCAGGGTG CCGTCGCCAC GCTCATCGCG GCCTCCGCCA CCTCGCTCGC CGTCATCGGC GTCGATAGCT GGAGTGCCTT CTGGGGCAAC CCGAGCATTC CCGCCGTGCT GGTCGCACTC CTCGTGGGTG TCGTGGTCAG CCTGATCACC CCCGCTTCGA AGGTGACACC GGAGCAGGCG CTCAAGATCA TCGACGACGA GCGTGAGCGC ATGGAGATGA CCGAGGACCC CGCCGAGGCC ATTCATCACA AGACGGCATA A
|
Protein sequence | MNSHVFLGAF VTYVVAMIAF GWWVSRHSRS NGDDFLLGGR SIPIFLTIGT TVATMVGTGS SMGAVGFGYA NGWAGALYGI GGSIGVLLLA AWFAPVRKLR FMTMSEELSY YVGANRWVRN IVAVLIYIAC IGWLGAHILG GGLYLSWMAD IDLTTARVLV ALGFGIYCVI GGYMAVVWTD TVQAVILFVG FIVMAIIALF EVGGFSGLGA NMDVATTSFL GIDKIGVVPA LSLAAVIAVG VLATPSYRQR IYSGNSVHSV RKSFVITGIL YMIFSIIPAI IGMATHALNP GLDNSNFAFP FLATEILPLW LGLLLLVAGL SATMSSASSD AIAGVSILLR DVYVLFTGHT PSAHKVVLLS RLALIATIGL ALLFALTSDN VIDYITSMIA TVMAGMFVCG VLGRFWPRYN WQGAVATLIA ASATSLAVIG VDSWSAFWGN PSIPAVLVAL LVGVVVSLIT PASKVTPEQA LKIIDDERER MEMTEDPAEA IHHKTA
|
| |