Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_3229 |
Symbol | |
ID | 4028563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 3598610 |
End bp | 3599827 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637968444 |
Product | major facilitator transporter |
Protein accession | YP_575272 |
Protein GI | 92115344 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.250614 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACAT CCACGGACAG CTCACGCCAT GCCGGCGGGG CCGCGCATCC TCGTCTGGCG GAACTCGCGC TGGCGCTCGG CGGCTTCGGC ATCGGCACCG GCGAATTCGT GATCATGGGG CTGATGAGCC GCGTCGCCGA GGACCTTCAG GTGTCGGTGC CCGATGTCGG CTACGCCATC AGCAGTTATG CGCTCGGGGT CGTCGTCGGA GCGCCGATCA TTTCCGCACT GGCGGCGCGT CTGCCCAAGC GGGCATTGCT GATCGGGCTG ATGCTGCTGT TCGCCATCGG CAATTTCGCC AGCATCATGG CGCCGCATTT CGGCACCTTC GTGGGCCTGC GCTTCATCGC CGGCCTGCCC CACGGGGCCT ACTTCGGGGT CGCGGCGCTG GTCGCGGCGG CGGCGGTGCC CGTCGAGCAG CGCGCCCGCG CGGTGGCCAG GGTGATGACC GGGCTGACCG TGGCGATCCT GATCGGGGCG CCGTTGGCCA CCTGGACGGG CAACCTGCTC GGCTGGCAGG CGGCCTTCGC CGGCGTGGGC GGCATTGCGC TGCTGACGGC GCTGATGGTG CGCCTGTGGG TGCCGTATCA GTCCGCCGAC CACCAGGCCA GTCCCAAGCG CGAGATGACC GCGATGATCA AGCCGCGTGT CCTGTTCACG CTGGGTGTGG CCTGCTTCGG ATGCGGCGGC ATGTTCGCGG TCTTCAGCTA CGTCATGCCC ACGCTGACCC AGCAGGCGGG CATGGCCGAG TCGCTGGGGC CGCTGGTGCT GGCGATCTTC GGGATGGGCA CCATCCTCGG CAATTTCGCG GGTGCGCGCA TCGCCGATTG GAACCTGCTG CGCGGCATCC CGATCATCCT GCTGTGGGTT GCCTGCGTGC AGGGCGGCTT CTACTTCGCC GCCAACACGG TATGGACGGG GCTGTTGTTC GTCGCTCTGG TGGGGACCAG CATGGCCGTC GCCCCGGCGA TGCAGACCCG GTTGATGGAT GTCGCCGAGG ATGCCCAGAC CATGGCCGCC TCGCTCAATC ACGCTGCCTT CAACATGGCC AATGCCCTGG GCGCCTGGCT GGCCGGCGTC ACCATCAAGA TGGGGCTGGC GTGGTCTTCC ACCGGCCTGG TCGGCACTTC GCTGGCCCTG CTGGGCATCG CGATCTTCGC CACGGGGCGC TGGATGGAAA AGCGCGAGGC ACGTCCGCAT CGCTCCGTCT CGTCTTAA
|
Protein sequence | MSTSTDSSRH AGGAAHPRLA ELALALGGFG IGTGEFVIMG LMSRVAEDLQ VSVPDVGYAI SSYALGVVVG APIISALAAR LPKRALLIGL MLLFAIGNFA SIMAPHFGTF VGLRFIAGLP HGAYFGVAAL VAAAAVPVEQ RARAVARVMT GLTVAILIGA PLATWTGNLL GWQAAFAGVG GIALLTALMV RLWVPYQSAD HQASPKREMT AMIKPRVLFT LGVACFGCGG MFAVFSYVMP TLTQQAGMAE SLGPLVLAIF GMGTILGNFA GARIADWNLL RGIPIILLWV ACVQGGFYFA ANTVWTGLLF VALVGTSMAV APAMQTRLMD VAEDAQTMAA SLNHAAFNMA NALGAWLAGV TIKMGLAWSS TGLVGTSLAL LGIAIFATGR WMEKREARPH RSVSS
|
| |