Gene Csal_1159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1159 
Symbol 
ID4028098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1325439 
End bp1326746 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content64% 
IMG OID637966336 
Productmajor facilitator transporter 
Protein accessionYP_573214 
Protein GI92113286 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTACTT CGCAAGAGAC CGACGCGAGT GCGTCGCGCC CGCTGAATCG TCGCGACGTC 
AAGGTGCTGT CGCTGTCCGC GCTCGGCGGC GCCCTGGAAT TCTACGACTT CATCATTTTC
GTGTACTTCG CGACGGTGGT GGGGCAACTG TTCTTTCCCC CTGAAATGCC CGAATGGCTG
CGTCAGATCC AGACCTTCGG GATCTTCGCC GCCGGCTACC TGGCACGCCC GCTGGGCGGC
ATCATCATGG CGCATTTCGG CGATCTGCTG GGGCGCAAGA AGATGTTCAC CCTGTCGATC
TTCCTGATGT CGGTGCCGAC GCTGCTGATC GGCGTGATGC CCACCTACGA CACCCTGGGG
TATGCGGCGC CCTTGCTGCT GGTGGCGCTG CGCATCCTGC AAGGGGCCGC CGTAGGCGGT
GAAGTCCCCG GCGCCTGGGT GTTCGTCACC GAGCACGTCA AGCGCCACCA TGTGGGCTTC
GCCTGCGGCA CGCTGTCCGC CGGCCTGGTA TCGGGCATTC TCATCGGCTC GCTGATGTCG
GCGTTCATCA AGACCACCTA CAGTGATGCG GAGCTGGCCG CGTATGCCTG GCGGATTCCG
TTCTTGATCG GCGGGGTGTT CGGATTGCTG GCCGTGTATC TGCGGCGCTG GCTGCACGAG
ACGCCGGTCT TCGCCGAAAT GCAGCAGAAA AAGGCGCTGG CCGAGGAACT GCCGGTCAAG
AGCGTCTTGC GCAACCACTT GCCGAGCGTG GTGCTGTCGA TGGGCGTGAC CTGGATTCTC
ACCGCCGCCA TCGTGGTGGT GATCCTGATG ACGCCGAGTC TGCTCGAGAC GCGTTATGGC
CTGGATGCCT CGCTGGCCAA TGTGTATGCC ATTCTCGGTG TCGTGGTGGG GAGTCTGGCT
TCCGGCTGGT GCGCGGATCG CCTCGGCAGC GGCCCGACCA TCGCCTTTTG GGGCGTGCTG
CTGGCCATCA GCTACTGGGT GATGATGACC ACCGTGAGCA CGCACCCCGA ATGGCTGACG
CCGCTCTATA TCCTGAGCGG CTTCGCGGTG GGGATCGTCG GTGTGGTACC CACCATCGCC
GTCAAGTCGT TTCCGGCGGT GGTGCGTTTC ACGGGACTGT CGTTTTCCTA TAACGTCGCC
TACGCCATCT TCGGTGGCTT CACGCCCATC GTGGTGTCGG TATTGATGAC CGTGCATCCG
CTGTTCCCTG CGGTCTACGT CGCCGCGCTG GGCGGGCTGG GCGTGATCAT CGGCGTGTAC
TTGATGCAAA CGTCCAGCGG GCGCCGACTG GCGGTCATGC CATCGTGA
 
Protein sequence
MATSQETDAS ASRPLNRRDV KVLSLSALGG ALEFYDFIIF VYFATVVGQL FFPPEMPEWL 
RQIQTFGIFA AGYLARPLGG IIMAHFGDLL GRKKMFTLSI FLMSVPTLLI GVMPTYDTLG
YAAPLLLVAL RILQGAAVGG EVPGAWVFVT EHVKRHHVGF ACGTLSAGLV SGILIGSLMS
AFIKTTYSDA ELAAYAWRIP FLIGGVFGLL AVYLRRWLHE TPVFAEMQQK KALAEELPVK
SVLRNHLPSV VLSMGVTWIL TAAIVVVILM TPSLLETRYG LDASLANVYA ILGVVVGSLA
SGWCADRLGS GPTIAFWGVL LAISYWVMMT TVSTHPEWLT PLYILSGFAV GIVGVVPTIA
VKSFPAVVRF TGLSFSYNVA YAIFGGFTPI VVSVLMTVHP LFPAVYVAAL GGLGVIIGVY
LMQTSSGRRL AVMPS