Gene Csal_1021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1021 
SymbolaraH 
ID4027867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1152325 
End bp1153326 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content66% 
IMG OID637966198 
ProductL-arabinose transporter permease protein 
Protein accessionYP_573077 
Protein GI92113149 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACAG ATAAACTGGC TGCCAGCGAC AACACGTCAT CGGCACCGAA ACCCCGTGCC 
AAACCGCTAC GCACCCTGCT CGACACCTCC GGCCTGATCG CCATATTCCT GGTGCTGTTC
GTGGCCCTGG CGCTGTTCGT GCCGGACTTC CTGACCGGGC GAAATATCGT CGGGTTGCTG
CTCTCGGTGA CCTTGATCGG CACCATCGCC ACGACCATGA TGATGGTCCT CGCGCTCGGT
GAGGTGGATC TTTCGGTGGC CTCGATCGTG GCCTTCACCG GGGTCGTGGC AGCGGTCGTG
ACCTCGGCCA CCGGCAGCGT GTTCGTCGGC GTGCTGGGGG GCGTGGCCGC CGGCGGTGCG
GTGGGGGCGT TCAATGGCTT CGTGGTGGCC AAGTTCGGCA TCAACTCGTT GATCGCCACC
CTGGCGGCGA TGGAGTTCGT GCGCGGCCTG GCGTACATCA CCTCCGGCGG CGACGCGGTG
ATGGTGACCG TGCCGAGCTT CTTCAGTCTG GGGAGCGCTT CTTTCCTGGG GCTGACCCTG
CCGGTGTGGA CGATGATCGT GTGCTTCGTG ATCTTCGGCA TCGTGCTCAA CATGACGGCC
TTCGGTCGCA ACACCCTGGC CACCGGGGGC AACGCCGAAG CGGCGAGCCT GGCGGGGGTC
AACGTGCGTC GCCTGAAAAT CGCGGTGTTC GCGCTGCAGG GCGTCGTCGC CGGGGTCGCC
GGGGTGTTGC TGGCCTCGCG CATGGGCCTG GGCGATCCCA ATACCTCCAT GGGGCTGGAG
CTCGCGGTGA TCTCCGCCTG CGTGCTGGGC GGCGTGTCGC TTTCCGGCGG GGTCGCCTCG
ATCACCGGCG TGCTGGTCGG CGTGCTGATC ATGGGCTGCG TGCAGAACGC CATGGGGCTG
CTCAACGTAC CGACCTTCTA TCAGTACCTG GTACGCGGGG CGATCCTGCT GCTGGCGGTG
ATGTTCGATC GCTGGAAGCA AACCCGGCGC GCCAAGGGAT GA
 
Protein sequence
MSTDKLAASD NTSSAPKPRA KPLRTLLDTS GLIAIFLVLF VALALFVPDF LTGRNIVGLL 
LSVTLIGTIA TTMMMVLALG EVDLSVASIV AFTGVVAAVV TSATGSVFVG VLGGVAAGGA
VGAFNGFVVA KFGINSLIAT LAAMEFVRGL AYITSGGDAV MVTVPSFFSL GSASFLGLTL
PVWTMIVCFV IFGIVLNMTA FGRNTLATGG NAEAASLAGV NVRRLKIAVF ALQGVVAGVA
GVLLASRMGL GDPNTSMGLE LAVISACVLG GVSLSGGVAS ITGVLVGVLI MGCVQNAMGL
LNVPTFYQYL VRGAILLLAV MFDRWKQTRR AKG