Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3924 |
Symbol | |
ID | 8744552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | + |
Start bp | 180990 |
End bp | 182192 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 646514505 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003405452 |
Protein GI | 284167174 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGCAC GACGTGTACG AGGAGACGTT CCCTGGTCCT CCGCTCTCTT TCAGACGGTG CTGGCGTGTT CGCTCATCGG TGTGATGGGG GTGCCGCTGA TCAGCCCTAT TCTCCCGTCC CTCCGTTCGG TATTCGGAAT TAGCGATGCG CAGATAGGAC TCGTAATCAC AGCGTATACG CTCCCAGGCG TGTTTCTGAC GCCGTTTATC GGCCTGATAT CCGATCGTCT GGGACGCCGA GCGGTCGTTC TTCCCCTCCT GACACTTTTC GGTCTGGCCG GAGGAGGTAT CGCCCTCGGC CCGCCGTTTC GCGGCGTTCT GGCTCTTCGG TTCGTCCAAG GGATCGGCGG AAGCGGACTG ATGGTACTGG CGATAACGCT CATCGGCGAT TTCTACGCGG GAGAACGACG GAATACGGTG ATGGGAATCA ACGGAAGCGC AATCGGTATC GGTGCCGCCG CGTATCCTCT GCTCGGTGGC GTCTTAGCTG CCGTCCGCTG GAACGTTCCG TTCGCGTTCT TCGGTCTCAG TCTCGTCGTC GGTGCGATCG CTCTCTTCAC GCTCGAGGAA CCCGCGGTAG ACGATCCGCC TGCGATTGGA CCGTACGTAT CACGGGTGAT TTCCGTCGTT CTCGTCCCGC GAGCGCTCGG TCTCTGGGCC GCAGCGTTTC TGACGTTTTT CCTGTTTTAC GGGTGTATTC TAACCGTACT ATCGCTGCTT CTGAGCGACG TTTACGGCCT TTCTTCCGGG CAGATCGGCT TGTTATTCGG CGCGGTCTCT ATCGCTAACG CGTCCATCGC GTCCCAGTAC GGACGTGTCT CCCGTTACTT CGAGGCAGAG GAGCTGATCG CCCTCGGTTT CGTGGGATTC GGCATCAGCC TCCTCGGCGT CTGGGCGGCT TCCACGCCGG TACTGATCGG CGTGATGTTG CTCTGTTTCG GTCTGGGATT CGGACTCGTG ATGCCCTCGC TCGACACGAC CGTCGTCGGC CTCGTCTCGG GTCAACTCCG CGCGAGCATG ATGGGCGTCC TGACGAGTAT GCTCTGGCTC GGGCAGACCG TCGGTCCGAT CGCGTTCACC GGTTTCGCGG GCGTCGCTTT CGACGAACCG GTGACCGGAT ATCGGTTTCT ACTGTTGTTC TGGGGGGTGG CCACGCTCGT CAGCGGCGGA CTCGCGTTCC TGGCTCTCGA TCGTCGATCC TGA
|
Protein sequence | MPARRVRGDV PWSSALFQTV LACSLIGVMG VPLISPILPS LRSVFGISDA QIGLVITAYT LPGVFLTPFI GLISDRLGRR AVVLPLLTLF GLAGGGIALG PPFRGVLALR FVQGIGGSGL MVLAITLIGD FYAGERRNTV MGINGSAIGI GAAAYPLLGG VLAAVRWNVP FAFFGLSLVV GAIALFTLEE PAVDDPPAIG PYVSRVISVV LVPRALGLWA AAFLTFFLFY GCILTVLSLL LSDVYGLSSG QIGLLFGAVS IANASIASQY GRVSRYFEAE ELIALGFVGF GISLLGVWAA STPVLIGVML LCFGLGFGLV MPSLDTTVVG LVSGQLRASM MGVLTSMLWL GQTVGPIAFT GFAGVAFDEP VTGYRFLLLF WGVATLVSGG LAFLALDRRS
|
| |