Gene Csal_1821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1821 
Symbol 
ID4027378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2071097 
End bp2072431 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content61% 
IMG OID637967010 
Productmajor facilitator transporter 
Protein accessionYP_573872 
Protein GI92113944 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000983336 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGCAGCT TGCCCGCAAC CCTTCCCATT CTATTTCTTA GCGAAATCAG TTTCTTGCTT 
GGCCATGGGC TGATCATGAC CTTGCTCGGG GTGCGCATGT CGCTGGAGGG GTTCCCCTCC
CAGATGGCGG GTTTGCTGAT GTCGAGTTTC TCGTTGGGCT TCGTGATCGG AAGCTATCTG
ATCGAGAAGC GTATTCGCAC GGTGGGGCAC ATTCGGGTTT TCGCGGCGTG TGCCGCGGTA
CTGGCCGTGA CGGCGATGTT GCATGGGCTC TGGGTCAATC CCTGGTCGTG GCTGGTCTGG
CGACTGTTCG GGGGCGCAGC GACCGCGGGG TTGCTGATGG TGATGGAGTC CTGGGTCTCG
GGCGAATCGA CCAACGACAA TCGCGGGCGT GTGCTCGGCT GGTACCTGGT CATCTCGACA
TCGTCGTTGG CGCTGGGGCA ATGGCTGCTC AATGCCGCCG ACCCCGCAAC CTTCGTGCTG
TTTTCCGTGT CCGGCATTCT CTTCGCACTG TCGCTGGTGC CCCTGTCGAT TTATCGTATC
CACGGGCCGA CGCGTTCGTC GCACGATGTC TCGCGCAAGA TTTCACTGAA GGAACTGTAT
CGGCGTGCGC CAGTGGGGCT GGTGAGTGCC TTCACGGCGG GGCTGATGGT GCAGGCCTTC
TTCGCCATGA CGCCTTTCTA CGGGCAGGAA ATCGGTCTGT CGACGTCACA GACCGCGCAA
TTCATGGCCA TCACCACGCT GGTCGCGCTG GTGGCGCAGT GGACCCTGGG GCGCATTTCC
GATCGTTTCG ACCGCCGCAA GGTGATTCTC TGCATGGCGC TGGTCATGGC CATCTCGGGG
GCCATGATCT CGGTCGCGGC GCGTTTCGAC TTCTGGGTGC TGCTGGTGGT GGCCTGCTTC
CATACCGCCA TGCTGCATAC GCTGTATTCC TTGAGCCTGT CGCATACCAA CGACTGGCTC
GAGCCGGAGG AAACCATTCA AGCCAACGCC AAGCTCTTGA TCTGGTATGG CATCGGTTCG
GTGATCGGGC CTTACAGCGC TTCACTGATC ATGGAGCTGA CCGGGCCGGA CGGGCTGTGG
CTTTTCCTGG GCGGCGTGGC GTTGACGCTG GCCATGTTCG TGATGGTGCG CCTGCACGGC
CATCACGGCA TTCCCCCGGA AGTCGAGCAG GAGCCCTATG TGGCGGCCGT CCCCATGGTG
GAGTCGACGC ACTATCTCAG CGAGATGGAT CCGCGTTTCG AACCTCAGCA GTTCGAGCTC
GACTTCGAGC CGGATGACGA AGACTGGTCG TCATATGACG ACCACGCCCA CGAGTCTCGA
CACCAAGAAG CCTAG
 
Protein sequence
MRSLPATLPI LFLSEISFLL GHGLIMTLLG VRMSLEGFPS QMAGLLMSSF SLGFVIGSYL 
IEKRIRTVGH IRVFAACAAV LAVTAMLHGL WVNPWSWLVW RLFGGAATAG LLMVMESWVS
GESTNDNRGR VLGWYLVIST SSLALGQWLL NAADPATFVL FSVSGILFAL SLVPLSIYRI
HGPTRSSHDV SRKISLKELY RRAPVGLVSA FTAGLMVQAF FAMTPFYGQE IGLSTSQTAQ
FMAITTLVAL VAQWTLGRIS DRFDRRKVIL CMALVMAISG AMISVAARFD FWVLLVVACF
HTAMLHTLYS LSLSHTNDWL EPEETIQANA KLLIWYGIGS VIGPYSASLI MELTGPDGLW
LFLGGVALTL AMFVMVRLHG HHGIPPEVEQ EPYVAAVPMV ESTHYLSEMD PRFEPQQFEL
DFEPDDEDWS SYDDHAHESR HQEA