Gene Csal_0040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0040 
Symbol 
ID4026383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp48242 
End bp49426 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content71% 
IMG OID637965192 
Productmajor facilitator transporter 
Protein accessionYP_572104 
Protein GI92112176 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCGCA CCGAATTCCG CCTGATCGTC GCCGGCACCA TGCTGATCGG CACCACCTAC 
GGCCTGGCAC GCTTCGCCTA CGGGCTGTTC CTGCCCAGCA TGCGCGACGA AGTCGGCTTG
AGCGCGACCC TGGCCGGCAT CATCGGCAGC GGCGCCTATG TCGGCTACTG CCTCGCCATC
GTGCTCAGCG CGCTGTTGGT GGAACGCTAC GGACCACGCC GGATCGCCGT CGCCGCCGCC
CTGATCGCGG CGGTGGGCAT GGCGGGCGTC GCCGTCAGCA CGCAGGCGAT ATGGCTGGCC
GGCGCCGTAC TGCTCGCCGG AACGAGCACC GGCCTGGCCT CGCCGCCCAT GGCGCAGGCG
GTATCCCGCG CGATCACCGC GCCCCGGCAA GGGCGGGCCA ACACCGTCAT CAATGCCGGT
ACCAGCCTGG GCGTCGCCGT CTCCGGGCCG GTCGCGTTCA TCGCCACCGG CCAGTGGCGG
CTCGCCTACG CCGCCTTTGC CGTCACCGCG TTGCTCAACG CCCTGCTGCT GTTGATCAGC
GTGCCACGCA CCAGCGCCAA CGATACCGCC AAGGCGGCCG ACGGCCAGGA CGACGACCTC
CCCGGCGGGC TCTGGCGGCC TCGTGCAGTG ACGCTGATCG CCGCAGCCAC CGGCATGGGC
GTGGCCAGCG CCGCCTTCTG GACCTTCTCG AGCGAAGTGG TCATCACGCT GGGGCACTTC
GAGCAGGCCA CGGCCAACAT CGCCTGGATC CTGATCGGCG TCGCGGGGCT GGTAGGTGGC
GCGGCAGGCG ACCTGATCGC ACGGCTCGGC CTCAACACCG TGCATCGTGG CAGTCTCGCG
GCGATGGCCG GTGCACTCGG GCTGCTGGTC CTCAGCCCCT CGAACCTGGC GGCGGTGCTC
GTCTCGGGCG CGCTGTTCGG CGCGGCCTAC ATCATGCTGA CCGGCGTCTA TCTCGTCTGG
GGCATCCGGC TGTATGCGGA CCGCCCCGCC ATCGGCCTCG GGCTTCCCTT TCTGATGATC
GCCGCGGGGC AGATCGTCGG CTCGCCCCTC GCGGGCTACC TGATCGGCAG CCGAGGGTAC
CTCGTCTGCT TCATCGCCTT CGCGCTGATC GCAGTGGCCA CGGCTCTCAT CGGCGCCAGG
ACCACCGAGC GCGCGGCGCT CCAACCGGCT TCCACTTCCC CCTGA
 
Protein sequence
MTRTEFRLIV AGTMLIGTTY GLARFAYGLF LPSMRDEVGL SATLAGIIGS GAYVGYCLAI 
VLSALLVERY GPRRIAVAAA LIAAVGMAGV AVSTQAIWLA GAVLLAGTST GLASPPMAQA
VSRAITAPRQ GRANTVINAG TSLGVAVSGP VAFIATGQWR LAYAAFAVTA LLNALLLLIS
VPRTSANDTA KAADGQDDDL PGGLWRPRAV TLIAAATGMG VASAAFWTFS SEVVITLGHF
EQATANIAWI LIGVAGLVGG AAGDLIARLG LNTVHRGSLA AMAGALGLLV LSPSNLAAVL
VSGALFGAAY IMLTGVYLVW GIRLYADRPA IGLGLPFLMI AAGQIVGSPL AGYLIGSRGY
LVCFIAFALI AVATALIGAR TTERAALQPA STSP