Gene Csal_0449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0449 
Symbol 
ID4027023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp496253 
End bp497635 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content66% 
IMG OID637965607 
Productmajor facilitator transporter 
Protein accessionYP_572510 
Protein GI92112582 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.780134 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTCAT CTCTACTTCG TACCGAGCGG CGCGCCATTT TCGGCCTCGC CAGCTTATAC 
GCCACGCGCA TGCTCGGTCT GTTCATGGTA TTACCGGTCC TGGCGGTCTA TGCCGAGGAC
CTGGATGGCG CCACGCCCCT GCTGGTGGGC ATGGCGTTGG GCGGTTACGG TCTGACGCAG
GCGATTCTGC AGATTCCCTT CGGGCTCATG TCGGATCGCT TCGGCCGCAA GCTCATCATC
AGCGTGGGAT TGCTCCTGTT TCTGCTGGGC AGCGTGATCG CGGCGCAGGC GACCTCGATC
GGCTGGGTCA TCGCCGGCCG CTGTCTGCAA GGCAGCGGCG CGGTCGCCGG TGCGATCATG
GCGTTGCTCG CCGACCAGAC GCGGGAGGAA GTCCGCACGG CGGCGATGGC GACCATCGGG
CTGTCCATCG GTATCGCCTT CGGCGTGGCG ATGGTCGTGG GTCCCCTGGT CGCCGACCCT
TTCGGCCTGG CCGGCGTGTT CTGGTTCACG GCCGTGCTGG CGGTGGTCGG TCTGCTGGTG
CTGTGGCGTC TGGTGCCGCG GGCGCCACGC TTGATGGCAC ACCGTGACGT CGGACTCGAT
CGCACGCAGT TGCGCGCGAT GCTCGCGCGG CATGATCTGC TGCGGCTGGA TTTCTCGATT
TTCGCACTGC ATGCCATTCT CACCGCCTGC TTCGTGGCCG TGCCCTTCCG GCTGGAAGCC
CTGGGCATCG CCCCGGCACA CCATGGCTGG GTCTATTTGC CGATCATGGC ACTGGCCTTC
GTGGGCATGG TTCCGCTGGT GATCGTGGCC GAGAAATACC GCAAGATGAA ACCCATCTTT
CTCGGGGCAG TGGCCTGGCT GACCCTCTGT CTGGCCGGGC TGGTGGAGTT TTCCGACGGC
CGCTGGAGCC TGTTTGCCCT TCTCTGGGGC TTCTTCGTGG CATTCAACCT GCTCGAGGCG
ACGCTGCCGT CGATGATCAG CAAGCTGGCA CCGGCGGGTG CCAAGGGCAC GGCGATGGGC
GTGTACTCCA CCAGCCAGTT CCTGGGGGCG TTCCTGGGCG GCACCGCGGG CGGTTTCCTC
TCGCAGCACT ACGGGCTGAG TGCCGTCTTT CTGGGCGCCG CCCTGCTGGG TGCGGTATGG
TTGGCGATTG TCTGGAAGAT GCCGGCACCC CGCCATCTCT CGAGCGAGAT CGTCGCCCTG
GACGAGCGCT CCCTGGGCAC AGTGGATACA CTGATGGACC GCTTCGCCGC CGTGGCCGGT
GTCGAGGATG TCATGGTAGT GCCCGACGAG CGTGTCGCCT ACCTCAAGGT CGACCGGCAG
CGGCTGGACG CGGAGGCGCT GGCCCGCGTT CTGGGTACCG AACCGGATCA CAAGCGCGCC
TGA
 
Protein sequence
MSSSLLRTER RAIFGLASLY ATRMLGLFMV LPVLAVYAED LDGATPLLVG MALGGYGLTQ 
AILQIPFGLM SDRFGRKLII SVGLLLFLLG SVIAAQATSI GWVIAGRCLQ GSGAVAGAIM
ALLADQTREE VRTAAMATIG LSIGIAFGVA MVVGPLVADP FGLAGVFWFT AVLAVVGLLV
LWRLVPRAPR LMAHRDVGLD RTQLRAMLAR HDLLRLDFSI FALHAILTAC FVAVPFRLEA
LGIAPAHHGW VYLPIMALAF VGMVPLVIVA EKYRKMKPIF LGAVAWLTLC LAGLVEFSDG
RWSLFALLWG FFVAFNLLEA TLPSMISKLA PAGAKGTAMG VYSTSQFLGA FLGGTAGGFL
SQHYGLSAVF LGAALLGAVW LAIVWKMPAP RHLSSEIVAL DERSLGTVDT LMDRFAAVAG
VEDVMVVPDE RVAYLKVDRQ RLDAEALARV LGTEPDHKRA