Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0203 |
Symbol | |
ID | 4027168 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 222948 |
End bp | 224279 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637965354 |
Product | major facilitator transporter |
Protein accession | YP_572266 |
Protein GI | 92112338 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.371325 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGCCG CCACCACTCG TCCCACCACC TTGTCCGCTG CCTGGAGCCT ATGGACGCTG CTGCTCGGCG TGGGGCTTTT GATGATGGGC AATGGCCTGC AGGGCTCGCT GCTCGGGGTG CGGGCCTCGG CGGCGGATTT CGGCAACACG GTCACGGGGC TGGTGATGTC GTCCTACTTC GTGGGCTTTC TGGTCGGCTC GGTGCTGACG CCCCGCAAGC TGCGTGAGGT CGGACACGTG CGGGTCTTCG CCGCGCTGGC CTCGATCACC TCGGTGGCGA TCCTGATTCA TGCCCTGTTC GTCGAGCCTC TGGTGTGGGC GGCCATGCGC TTTGTCACCG GCTTTACGTA CGCCGGACTC TACGTCGTCG CCGAAAGTTG GCTCAACGGT TATGCCTCCA ACCGGCTGCG CGGGCGCCTG CTGGCGATCT ACATGGTGAT CAGCTATCTC GGCATGGGCG GCGGGCAACT GCTGCTGGGC AGTGCCGACC CCTCGGGCAT GGCGTTGTTC CTGCTGGTGT CGATCCTGGT CTCGCTCGCC CTGGTGCCGA TCTTGATCAG CTACACGCCG CAGCCCGAAC TCAGTCAGCC CGAGGCGATG AGCCTGCGCG CGTTGTACCG GCTCTCGCCG CTGGGCACGG TGGGCTGCTT CATGACCGGC ATCACCAATG GCGCCGTGTT CGGCATGGGG GCGGTGTTCG CCACCAACAG CGGCTTGAGC GTCGCCCAGG TCTCGGTCTT CATGAGCGCC TTCATCTTCG GCGGCGCGAT CCTGCAGTGG CCGCTGGGCA AGTTGTCCGA CAAGGCCGAC CGCCAGTGGG TGATCGTCGG CGTCGCCCTG GTCGCGGTGA TGCTGGCGCT GGTTGGTGCG CTCGTCAGCG GCTGGTCGCC GATGGCCTTG ACGCTGCTCG GGGCGCTGCT GGGCGCGACG ACACTCACGC TCTACTCGAT CTTTCTGGCC TGCGCCAATG ACTTCCTCAC CGATCAGCAG ACCGTGGCGG CCAGTGCCAG TCTCGTTCTG GCGTTGGGCA TCGGCGCGAT TCTGGGACCG GCCAGCGCCG GGGTGCTGAT GGAATGGCTG GGACCGGATG GTTTCCTGTG GGACCTGGCC GTCATGCACA TCGTCATGGT GTTGTTCGGG CTTTACTGCA TTCGTCACTA TCCCACCAGC GAATCGCCCG AGCAGGGCCA CTACGTGATG GTGGCCTCGG ACACCACGCC GCTGGGCACG GCCTGGACGG AAGAAGCCGC GCAGGAAGAA GGGCAACTGG AACTGGCGCT GGAGCTGGAA GGTGAGGGTG ACGATGAGAG TGCCGAGGGC ATGACGAGAT AG
|
Protein sequence | MKAATTRPTT LSAAWSLWTL LLGVGLLMMG NGLQGSLLGV RASAADFGNT VTGLVMSSYF VGFLVGSVLT PRKLREVGHV RVFAALASIT SVAILIHALF VEPLVWAAMR FVTGFTYAGL YVVAESWLNG YASNRLRGRL LAIYMVISYL GMGGGQLLLG SADPSGMALF LLVSILVSLA LVPILISYTP QPELSQPEAM SLRALYRLSP LGTVGCFMTG ITNGAVFGMG AVFATNSGLS VAQVSVFMSA FIFGGAILQW PLGKLSDKAD RQWVIVGVAL VAVMLALVGA LVSGWSPMAL TLLGALLGAT TLTLYSIFLA CANDFLTDQQ TVAASASLVL ALGIGAILGP ASAGVLMEWL GPDGFLWDLA VMHIVMVLFG LYCIRHYPTS ESPEQGHYVM VASDTTPLGT AWTEEAAQEE GQLELALELE GEGDDESAEG MTR
|
| |