Gene Csal_1675 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1675 
Symbol 
ID4028687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1904107 
End bp1905324 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content64% 
IMG OID637966864 
Productmajor facilitator transporter 
Protein accessionYP_573727 
Protein GI92113799 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.816846 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAACCTG CCCGCCTGGA AAAGCGTAAC GTGGCCATTC TGGTCAGCAG TCAGATTCTG 
TTCATGGTGG CCTCGATCAC GGTCATGACG CTGAGCGGGA TGGTCGGCCT GCAGTTGAGC
CCGACCTCTG GGCTCGCCAC GCTGCCCATT GCCATCTCGA TGCTGGGTAC GGTGGCCTCG
ACCCTGCCGG CTTCGCTCTA CATGAAGCGC GTGGGCAGGC GCCGCGGGTT CATCACCGGC
ACGATCCTGG GCGGCATCGC GGGTGGCTTG CTGAGTTTCG TGGCCATTGC CCAGCAGTCG
TTCTGGCTGT TCTGCGTCGG CAACCTGCTG CTGGGGCTCT ACCAAGGATT CGCCATGTAC
TATCGCTTTG CCGCTCTGGA CGTGGCGAGC CCTGCCTTTC GCAGCCGGGC GATTTCTTTC
GTCATGGCGG GGGGCGTGGT GGCTGCGTTC CTCGGCCCCT GGAACGTCAG TGCCACGGCC
GACTGGATCG CCGGCGTGCC GTCCGGTGGG CCTTACCTGG TGATCGCCAT TCTCGCCCTG
TTGGCCACCG GCCTGCTGAC CCAGCTCAAG ATGCCCGCCA GTGAGGAACC GCAACCCGGC
GAGACGTCTC GACCCATGCC GGTCATTGCC ACTCAGGCGG GTTTCATGGT CGCCTTGCTG
GCCGGCGCGG TGGGCTACGC CATCATGACA CTGGTCATGA CGGCCACGCC GCTGGCCATG
CGCGCGCATG GCTTCGGGAT GGAGCAGATT GCCTTCATCA TGCAGTGGCA TGTGCTAGGC
ATGTTCGCCC CCTCCTTCGT GACCGGCAGC CTCATCGCCC GCTTCGGGAT ACCGCGCATG
CTGCTGACCG GCACGCTCTT GATGGCCGGC ACGGCCCTGA TCAGCAATCT TGGCGTTAGC
CTGGCCCATT TCTGGGTGGC CCTGGTACTG CTGGGTATCG GCTGGAACTT CCTGTTCGTG
GGCGGCAGCA CCCTGCTCTC GGCCGCCCAT ACGGATGCCG AACGCGGCAA GGTACAGGGC
ATCAATGATC TGGTCATCTT CTCCCTGGTC GCCCTCGGCT CTCTGATGTC GGGCGCATTG
TCGTACCACC TTGGCTGGAA GGCGCTCAAT CTGGCGATGC TGCCCCCCAT TGTGCTGGTG
GCCCTGGCCA CGCTCTGGTA TCGCTGGCAC GCCGCCGCGA AGCCTTCCAT CAGCCTGGCG
CCTCAATCCA AGAAGTGA
 
Protein sequence
MEPARLEKRN VAILVSSQIL FMVASITVMT LSGMVGLQLS PTSGLATLPI AISMLGTVAS 
TLPASLYMKR VGRRRGFITG TILGGIAGGL LSFVAIAQQS FWLFCVGNLL LGLYQGFAMY
YRFAALDVAS PAFRSRAISF VMAGGVVAAF LGPWNVSATA DWIAGVPSGG PYLVIAILAL
LATGLLTQLK MPASEEPQPG ETSRPMPVIA TQAGFMVALL AGAVGYAIMT LVMTATPLAM
RAHGFGMEQI AFIMQWHVLG MFAPSFVTGS LIARFGIPRM LLTGTLLMAG TALISNLGVS
LAHFWVALVL LGIGWNFLFV GGSTLLSAAH TDAERGKVQG INDLVIFSLV ALGSLMSGAL
SYHLGWKALN LAMLPPIVLV ALATLWYRWH AAAKPSISLA PQSKK