Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_2117 |
Symbol | |
ID | 8419967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 2407308 |
End bp | 2408483 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 645038710 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003198979 |
Protein GI | 258406237 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000191868 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.363164 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCACGAAT TGCTGAAAAA TAGAAATCTC CAAATCCTGT TCGCCGTCAC TTTGATGGTT GTCATGGGGG TTTCGAGTAT CATCCCCGTT CTGCCCACCC TGATCAAGGT CTTTGACCTC ACACCGGAAA CCATCGGACT GGTGCTGACC ACATTTACCC TGCCCGGAGT CCTGTTTACC CCTGTTTTCG GGGTCTTGGC CGATCGAGTC GGACGCAAAA AAATCCTCAT TCCGGCGCTG TTGGTCTTCG GCATTGCCGG GGCCGCCTGC GCGCTGGTCA GAAACTGGGA ACTGCTGCTC CTGTTGCGCT TTGTCCAGGG CACTGGCGCA GCTGCGTTTG GCATGATCAA TATCACGATC ATCGGCGATC TGTTCAGCGG CACGCGTCGC ACTGCCGCGC TTGGCCTCAA TGCGAGCATC CTCAGCGTGG GCACGGCAGT CTATCCGGCC GTGGGTGGCG CACTGGCTAT TTTGGGCTGG TATTTCCCGT TCGCCCTGCC CCTGGCCGCA ATTCCCCTGG CCCTCATTGT CCTCTGGCAC CTTGACAATC CCGAACCCGA CGGCGGGGAA TCCCTGGGTG ACTATTTCCG CAACGCCGCA GGACAAATGC GCAGCAGACA GGCTCTGGGA CTTTTCCTCT GCACCCTGGG GACCTTCATC CTGCTGTACG GGCCGTTTAT CACCTTTTTC CCGATCCTGC TGGACACCCG GTTCCAGGCC ACCGCCCCGC AAATCGGTCT TCTTATTTCA GCCGCGTCAT TTATCACCGC CATCGCCGCC TCCCAGTTGG GGCGATTGGC CGCCCATTTT TCCGAGCTCT GCCTGTTACG CTCGGCCTTC GTCTTCTATG GACTTGCTTT TGCCATCATC CCCTTTTTCA CGAGCTTCGC CTGGCTTTTG GTCCCGGCCT TGCTTTTTGG TGCGGCCCAG GCCCTGAGTA TCCCCAACGT CATGTCCCTG CTCAATGATA TCGCCACACC GCGGACCCGA GCCGCCTTCA TGGCCGCCAA CGGCACCCTG CTCCGGCTTG GCCAGACCCT GGCCCCGCTG TGCATGGGTG GCATCTTTGC CCTCGGCGGA CTTCGAGCTG TGTTCTGGTC CGGGACCCTT CTCGCCGCTT GCTTGTTCCT CTTGACCCAT CTCCTTTTAC GCCCGCAATG CGGCCCGCAA AGCTGA
|
Protein sequence | MHELLKNRNL QILFAVTLMV VMGVSSIIPV LPTLIKVFDL TPETIGLVLT TFTLPGVLFT PVFGVLADRV GRKKILIPAL LVFGIAGAAC ALVRNWELLL LLRFVQGTGA AAFGMINITI IGDLFSGTRR TAALGLNASI LSVGTAVYPA VGGALAILGW YFPFALPLAA IPLALIVLWH LDNPEPDGGE SLGDYFRNAA GQMRSRQALG LFLCTLGTFI LLYGPFITFF PILLDTRFQA TAPQIGLLIS AASFITAIAA SQLGRLAAHF SELCLLRSAF VFYGLAFAII PFFTSFAWLL VPALLFGAAQ ALSIPNVMSL LNDIATPRTR AAFMAANGTL LRLGQTLAPL CMGGIFALGG LRAVFWSGTL LAACLFLLTH LLLRPQCGPQ S
|
| |