Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1159 |
Symbol | |
ID | 4028098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 1325439 |
End bp | 1326746 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637966336 |
Product | major facilitator transporter |
Protein accession | YP_573214 |
Protein GI | 92113286 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTACTT CGCAAGAGAC CGACGCGAGT GCGTCGCGCC CGCTGAATCG TCGCGACGTC AAGGTGCTGT CGCTGTCCGC GCTCGGCGGC GCCCTGGAAT TCTACGACTT CATCATTTTC GTGTACTTCG CGACGGTGGT GGGGCAACTG TTCTTTCCCC CTGAAATGCC CGAATGGCTG CGTCAGATCC AGACCTTCGG GATCTTCGCC GCCGGCTACC TGGCACGCCC GCTGGGCGGC ATCATCATGG CGCATTTCGG CGATCTGCTG GGGCGCAAGA AGATGTTCAC CCTGTCGATC TTCCTGATGT CGGTGCCGAC GCTGCTGATC GGCGTGATGC CCACCTACGA CACCCTGGGG TATGCGGCGC CCTTGCTGCT GGTGGCGCTG CGCATCCTGC AAGGGGCCGC CGTAGGCGGT GAAGTCCCCG GCGCCTGGGT GTTCGTCACC GAGCACGTCA AGCGCCACCA TGTGGGCTTC GCCTGCGGCA CGCTGTCCGC CGGCCTGGTA TCGGGCATTC TCATCGGCTC GCTGATGTCG GCGTTCATCA AGACCACCTA CAGTGATGCG GAGCTGGCCG CGTATGCCTG GCGGATTCCG TTCTTGATCG GCGGGGTGTT CGGATTGCTG GCCGTGTATC TGCGGCGCTG GCTGCACGAG ACGCCGGTCT TCGCCGAAAT GCAGCAGAAA AAGGCGCTGG CCGAGGAACT GCCGGTCAAG AGCGTCTTGC GCAACCACTT GCCGAGCGTG GTGCTGTCGA TGGGCGTGAC CTGGATTCTC ACCGCCGCCA TCGTGGTGGT GATCCTGATG ACGCCGAGTC TGCTCGAGAC GCGTTATGGC CTGGATGCCT CGCTGGCCAA TGTGTATGCC ATTCTCGGTG TCGTGGTGGG GAGTCTGGCT TCCGGCTGGT GCGCGGATCG CCTCGGCAGC GGCCCGACCA TCGCCTTTTG GGGCGTGCTG CTGGCCATCA GCTACTGGGT GATGATGACC ACCGTGAGCA CGCACCCCGA ATGGCTGACG CCGCTCTATA TCCTGAGCGG CTTCGCGGTG GGGATCGTCG GTGTGGTACC CACCATCGCC GTCAAGTCGT TTCCGGCGGT GGTGCGTTTC ACGGGACTGT CGTTTTCCTA TAACGTCGCC TACGCCATCT TCGGTGGCTT CACGCCCATC GTGGTGTCGG TATTGATGAC CGTGCATCCG CTGTTCCCTG CGGTCTACGT CGCCGCGCTG GGCGGGCTGG GCGTGATCAT CGGCGTGTAC TTGATGCAAA CGTCCAGCGG GCGCCGACTG GCGGTCATGC CATCGTGA
|
Protein sequence | MATSQETDAS ASRPLNRRDV KVLSLSALGG ALEFYDFIIF VYFATVVGQL FFPPEMPEWL RQIQTFGIFA AGYLARPLGG IIMAHFGDLL GRKKMFTLSI FLMSVPTLLI GVMPTYDTLG YAAPLLLVAL RILQGAAVGG EVPGAWVFVT EHVKRHHVGF ACGTLSAGLV SGILIGSLMS AFIKTTYSDA ELAAYAWRIP FLIGGVFGLL AVYLRRWLHE TPVFAEMQQK KALAEELPVK SVLRNHLPSV VLSMGVTWIL TAAIVVVILM TPSLLETRYG LDASLANVYA ILGVVVGSLA SGWCADRLGS GPTIAFWGVL LAISYWVMMT TVSTHPEWLT PLYILSGFAV GIVGVVPTIA VKSFPAVVRF TGLSFSYNVA YAIFGGFTPI VVSVLMTVHP LFPAVYVAAL GGLGVIIGVY LMQTSSGRRL AVMPS
|
| |