Gene Csal_0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0203 
Symbol 
ID4027168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp222948 
End bp224279 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content66% 
IMG OID637965354 
Productmajor facilitator transporter 
Protein accessionYP_572266 
Protein GI92112338 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.371325 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGCCG CCACCACTCG TCCCACCACC TTGTCCGCTG CCTGGAGCCT ATGGACGCTG 
CTGCTCGGCG TGGGGCTTTT GATGATGGGC AATGGCCTGC AGGGCTCGCT GCTCGGGGTG
CGGGCCTCGG CGGCGGATTT CGGCAACACG GTCACGGGGC TGGTGATGTC GTCCTACTTC
GTGGGCTTTC TGGTCGGCTC GGTGCTGACG CCCCGCAAGC TGCGTGAGGT CGGACACGTG
CGGGTCTTCG CCGCGCTGGC CTCGATCACC TCGGTGGCGA TCCTGATTCA TGCCCTGTTC
GTCGAGCCTC TGGTGTGGGC GGCCATGCGC TTTGTCACCG GCTTTACGTA CGCCGGACTC
TACGTCGTCG CCGAAAGTTG GCTCAACGGT TATGCCTCCA ACCGGCTGCG CGGGCGCCTG
CTGGCGATCT ACATGGTGAT CAGCTATCTC GGCATGGGCG GCGGGCAACT GCTGCTGGGC
AGTGCCGACC CCTCGGGCAT GGCGTTGTTC CTGCTGGTGT CGATCCTGGT CTCGCTCGCC
CTGGTGCCGA TCTTGATCAG CTACACGCCG CAGCCCGAAC TCAGTCAGCC CGAGGCGATG
AGCCTGCGCG CGTTGTACCG GCTCTCGCCG CTGGGCACGG TGGGCTGCTT CATGACCGGC
ATCACCAATG GCGCCGTGTT CGGCATGGGG GCGGTGTTCG CCACCAACAG CGGCTTGAGC
GTCGCCCAGG TCTCGGTCTT CATGAGCGCC TTCATCTTCG GCGGCGCGAT CCTGCAGTGG
CCGCTGGGCA AGTTGTCCGA CAAGGCCGAC CGCCAGTGGG TGATCGTCGG CGTCGCCCTG
GTCGCGGTGA TGCTGGCGCT GGTTGGTGCG CTCGTCAGCG GCTGGTCGCC GATGGCCTTG
ACGCTGCTCG GGGCGCTGCT GGGCGCGACG ACACTCACGC TCTACTCGAT CTTTCTGGCC
TGCGCCAATG ACTTCCTCAC CGATCAGCAG ACCGTGGCGG CCAGTGCCAG TCTCGTTCTG
GCGTTGGGCA TCGGCGCGAT TCTGGGACCG GCCAGCGCCG GGGTGCTGAT GGAATGGCTG
GGACCGGATG GTTTCCTGTG GGACCTGGCC GTCATGCACA TCGTCATGGT GTTGTTCGGG
CTTTACTGCA TTCGTCACTA TCCCACCAGC GAATCGCCCG AGCAGGGCCA CTACGTGATG
GTGGCCTCGG ACACCACGCC GCTGGGCACG GCCTGGACGG AAGAAGCCGC GCAGGAAGAA
GGGCAACTGG AACTGGCGCT GGAGCTGGAA GGTGAGGGTG ACGATGAGAG TGCCGAGGGC
ATGACGAGAT AG
 
Protein sequence
MKAATTRPTT LSAAWSLWTL LLGVGLLMMG NGLQGSLLGV RASAADFGNT VTGLVMSSYF 
VGFLVGSVLT PRKLREVGHV RVFAALASIT SVAILIHALF VEPLVWAAMR FVTGFTYAGL
YVVAESWLNG YASNRLRGRL LAIYMVISYL GMGGGQLLLG SADPSGMALF LLVSILVSLA
LVPILISYTP QPELSQPEAM SLRALYRLSP LGTVGCFMTG ITNGAVFGMG AVFATNSGLS
VAQVSVFMSA FIFGGAILQW PLGKLSDKAD RQWVIVGVAL VAVMLALVGA LVSGWSPMAL
TLLGALLGAT TLTLYSIFLA CANDFLTDQQ TVAASASLVL ALGIGAILGP ASAGVLMEWL
GPDGFLWDLA VMHIVMVLFG LYCIRHYPTS ESPEQGHYVM VASDTTPLGT AWTEEAAQEE
GQLELALELE GEGDDESAEG MTR