Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1021 |
Symbol | araH |
ID | 4027867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 1152325 |
End bp | 1153326 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637966198 |
Product | L-arabinose transporter permease protein |
Protein accession | YP_573077 |
Protein GI | 92113149 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACAG ATAAACTGGC TGCCAGCGAC AACACGTCAT CGGCACCGAA ACCCCGTGCC AAACCGCTAC GCACCCTGCT CGACACCTCC GGCCTGATCG CCATATTCCT GGTGCTGTTC GTGGCCCTGG CGCTGTTCGT GCCGGACTTC CTGACCGGGC GAAATATCGT CGGGTTGCTG CTCTCGGTGA CCTTGATCGG CACCATCGCC ACGACCATGA TGATGGTCCT CGCGCTCGGT GAGGTGGATC TTTCGGTGGC CTCGATCGTG GCCTTCACCG GGGTCGTGGC AGCGGTCGTG ACCTCGGCCA CCGGCAGCGT GTTCGTCGGC GTGCTGGGGG GCGTGGCCGC CGGCGGTGCG GTGGGGGCGT TCAATGGCTT CGTGGTGGCC AAGTTCGGCA TCAACTCGTT GATCGCCACC CTGGCGGCGA TGGAGTTCGT GCGCGGCCTG GCGTACATCA CCTCCGGCGG CGACGCGGTG ATGGTGACCG TGCCGAGCTT CTTCAGTCTG GGGAGCGCTT CTTTCCTGGG GCTGACCCTG CCGGTGTGGA CGATGATCGT GTGCTTCGTG ATCTTCGGCA TCGTGCTCAA CATGACGGCC TTCGGTCGCA ACACCCTGGC CACCGGGGGC AACGCCGAAG CGGCGAGCCT GGCGGGGGTC AACGTGCGTC GCCTGAAAAT CGCGGTGTTC GCGCTGCAGG GCGTCGTCGC CGGGGTCGCC GGGGTGTTGC TGGCCTCGCG CATGGGCCTG GGCGATCCCA ATACCTCCAT GGGGCTGGAG CTCGCGGTGA TCTCCGCCTG CGTGCTGGGC GGCGTGTCGC TTTCCGGCGG GGTCGCCTCG ATCACCGGCG TGCTGGTCGG CGTGCTGATC ATGGGCTGCG TGCAGAACGC CATGGGGCTG CTCAACGTAC CGACCTTCTA TCAGTACCTG GTACGCGGGG CGATCCTGCT GCTGGCGGTG ATGTTCGATC GCTGGAAGCA AACCCGGCGC GCCAAGGGAT GA
|
Protein sequence | MSTDKLAASD NTSSAPKPRA KPLRTLLDTS GLIAIFLVLF VALALFVPDF LTGRNIVGLL LSVTLIGTIA TTMMMVLALG EVDLSVASIV AFTGVVAAVV TSATGSVFVG VLGGVAAGGA VGAFNGFVVA KFGINSLIAT LAAMEFVRGL AYITSGGDAV MVTVPSFFSL GSASFLGLTL PVWTMIVCFV IFGIVLNMTA FGRNTLATGG NAEAASLAGV NVRRLKIAVF ALQGVVAGVA GVLLASRMGL GDPNTSMGLE LAVISACVLG GVSLSGGVAS ITGVLVGVLI MGCVQNAMGL LNVPTFYQYL VRGAILLLAV MFDRWKQTRR AKG
|
| |