Gene Csal_3229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_3229 
Symbol 
ID4028563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp3598610 
End bp3599827 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content68% 
IMG OID637968444 
Productmajor facilitator transporter 
Protein accessionYP_575272 
Protein GI92115344 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.250614 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACAT CCACGGACAG CTCACGCCAT GCCGGCGGGG CCGCGCATCC TCGTCTGGCG 
GAACTCGCGC TGGCGCTCGG CGGCTTCGGC ATCGGCACCG GCGAATTCGT GATCATGGGG
CTGATGAGCC GCGTCGCCGA GGACCTTCAG GTGTCGGTGC CCGATGTCGG CTACGCCATC
AGCAGTTATG CGCTCGGGGT CGTCGTCGGA GCGCCGATCA TTTCCGCACT GGCGGCGCGT
CTGCCCAAGC GGGCATTGCT GATCGGGCTG ATGCTGCTGT TCGCCATCGG CAATTTCGCC
AGCATCATGG CGCCGCATTT CGGCACCTTC GTGGGCCTGC GCTTCATCGC CGGCCTGCCC
CACGGGGCCT ACTTCGGGGT CGCGGCGCTG GTCGCGGCGG CGGCGGTGCC CGTCGAGCAG
CGCGCCCGCG CGGTGGCCAG GGTGATGACC GGGCTGACCG TGGCGATCCT GATCGGGGCG
CCGTTGGCCA CCTGGACGGG CAACCTGCTC GGCTGGCAGG CGGCCTTCGC CGGCGTGGGC
GGCATTGCGC TGCTGACGGC GCTGATGGTG CGCCTGTGGG TGCCGTATCA GTCCGCCGAC
CACCAGGCCA GTCCCAAGCG CGAGATGACC GCGATGATCA AGCCGCGTGT CCTGTTCACG
CTGGGTGTGG CCTGCTTCGG ATGCGGCGGC ATGTTCGCGG TCTTCAGCTA CGTCATGCCC
ACGCTGACCC AGCAGGCGGG CATGGCCGAG TCGCTGGGGC CGCTGGTGCT GGCGATCTTC
GGGATGGGCA CCATCCTCGG CAATTTCGCG GGTGCGCGCA TCGCCGATTG GAACCTGCTG
CGCGGCATCC CGATCATCCT GCTGTGGGTT GCCTGCGTGC AGGGCGGCTT CTACTTCGCC
GCCAACACGG TATGGACGGG GCTGTTGTTC GTCGCTCTGG TGGGGACCAG CATGGCCGTC
GCCCCGGCGA TGCAGACCCG GTTGATGGAT GTCGCCGAGG ATGCCCAGAC CATGGCCGCC
TCGCTCAATC ACGCTGCCTT CAACATGGCC AATGCCCTGG GCGCCTGGCT GGCCGGCGTC
ACCATCAAGA TGGGGCTGGC GTGGTCTTCC ACCGGCCTGG TCGGCACTTC GCTGGCCCTG
CTGGGCATCG CGATCTTCGC CACGGGGCGC TGGATGGAAA AGCGCGAGGC ACGTCCGCAT
CGCTCCGTCT CGTCTTAA
 
Protein sequence
MSTSTDSSRH AGGAAHPRLA ELALALGGFG IGTGEFVIMG LMSRVAEDLQ VSVPDVGYAI 
SSYALGVVVG APIISALAAR LPKRALLIGL MLLFAIGNFA SIMAPHFGTF VGLRFIAGLP
HGAYFGVAAL VAAAAVPVEQ RARAVARVMT GLTVAILIGA PLATWTGNLL GWQAAFAGVG
GIALLTALMV RLWVPYQSAD HQASPKREMT AMIKPRVLFT LGVACFGCGG MFAVFSYVMP
TLTQQAGMAE SLGPLVLAIF GMGTILGNFA GARIADWNLL RGIPIILLWV ACVQGGFYFA
ANTVWTGLLF VALVGTSMAV APAMQTRLMD VAEDAQTMAA SLNHAAFNMA NALGAWLAGV
TIKMGLAWSS TGLVGTSLAL LGIAIFATGR WMEKREARPH RSVSS