Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1821 |
Symbol | |
ID | 4027378 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 2071097 |
End bp | 2072431 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637967010 |
Product | major facilitator transporter |
Protein accession | YP_573872 |
Protein GI | 92113944 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000983336 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCGCAGCT TGCCCGCAAC CCTTCCCATT CTATTTCTTA GCGAAATCAG TTTCTTGCTT GGCCATGGGC TGATCATGAC CTTGCTCGGG GTGCGCATGT CGCTGGAGGG GTTCCCCTCC CAGATGGCGG GTTTGCTGAT GTCGAGTTTC TCGTTGGGCT TCGTGATCGG AAGCTATCTG ATCGAGAAGC GTATTCGCAC GGTGGGGCAC ATTCGGGTTT TCGCGGCGTG TGCCGCGGTA CTGGCCGTGA CGGCGATGTT GCATGGGCTC TGGGTCAATC CCTGGTCGTG GCTGGTCTGG CGACTGTTCG GGGGCGCAGC GACCGCGGGG TTGCTGATGG TGATGGAGTC CTGGGTCTCG GGCGAATCGA CCAACGACAA TCGCGGGCGT GTGCTCGGCT GGTACCTGGT CATCTCGACA TCGTCGTTGG CGCTGGGGCA ATGGCTGCTC AATGCCGCCG ACCCCGCAAC CTTCGTGCTG TTTTCCGTGT CCGGCATTCT CTTCGCACTG TCGCTGGTGC CCCTGTCGAT TTATCGTATC CACGGGCCGA CGCGTTCGTC GCACGATGTC TCGCGCAAGA TTTCACTGAA GGAACTGTAT CGGCGTGCGC CAGTGGGGCT GGTGAGTGCC TTCACGGCGG GGCTGATGGT GCAGGCCTTC TTCGCCATGA CGCCTTTCTA CGGGCAGGAA ATCGGTCTGT CGACGTCACA GACCGCGCAA TTCATGGCCA TCACCACGCT GGTCGCGCTG GTGGCGCAGT GGACCCTGGG GCGCATTTCC GATCGTTTCG ACCGCCGCAA GGTGATTCTC TGCATGGCGC TGGTCATGGC CATCTCGGGG GCCATGATCT CGGTCGCGGC GCGTTTCGAC TTCTGGGTGC TGCTGGTGGT GGCCTGCTTC CATACCGCCA TGCTGCATAC GCTGTATTCC TTGAGCCTGT CGCATACCAA CGACTGGCTC GAGCCGGAGG AAACCATTCA AGCCAACGCC AAGCTCTTGA TCTGGTATGG CATCGGTTCG GTGATCGGGC CTTACAGCGC TTCACTGATC ATGGAGCTGA CCGGGCCGGA CGGGCTGTGG CTTTTCCTGG GCGGCGTGGC GTTGACGCTG GCCATGTTCG TGATGGTGCG CCTGCACGGC CATCACGGCA TTCCCCCGGA AGTCGAGCAG GAGCCCTATG TGGCGGCCGT CCCCATGGTG GAGTCGACGC ACTATCTCAG CGAGATGGAT CCGCGTTTCG AACCTCAGCA GTTCGAGCTC GACTTCGAGC CGGATGACGA AGACTGGTCG TCATATGACG ACCACGCCCA CGAGTCTCGA CACCAAGAAG CCTAG
|
Protein sequence | MRSLPATLPI LFLSEISFLL GHGLIMTLLG VRMSLEGFPS QMAGLLMSSF SLGFVIGSYL IEKRIRTVGH IRVFAACAAV LAVTAMLHGL WVNPWSWLVW RLFGGAATAG LLMVMESWVS GESTNDNRGR VLGWYLVIST SSLALGQWLL NAADPATFVL FSVSGILFAL SLVPLSIYRI HGPTRSSHDV SRKISLKELY RRAPVGLVSA FTAGLMVQAF FAMTPFYGQE IGLSTSQTAQ FMAITTLVAL VAQWTLGRIS DRFDRRKVIL CMALVMAISG AMISVAARFD FWVLLVVACF HTAMLHTLYS LSLSHTNDWL EPEETIQANA KLLIWYGIGS VIGPYSASLI MELTGPDGLW LFLGGVALTL AMFVMVRLHG HHGIPPEVEQ EPYVAAVPMV ESTHYLSEMD PRFEPQQFEL DFEPDDEDWS SYDDHAHESR HQEA
|
| |