Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4214 |
Symbol | |
ID | 8744842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | + |
Start bp | 485956 |
End bp | 487200 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 646514760 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003405707 |
Protein GI | 284167429 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGACGG ACTACTCGGA GTCTATCCGC TGGCCGATGT TGATAACGCT GGTGCTGATC GGCTTTATCC CGGCTTTCTC GGGGGCACTG ATCAACCCGA CCATCCCGGC GATCCAGGAG ACATTCTCGC ACGAGCCGTA CTCGGAGACG TTGGCACAGC TCGTCAGTTC GATGTCGGCC TGGATCGTGA TCGTCGTCGC ACCCCTCACG GGATACGTGC TGGACAAGTA CGCGCGCAAG CCGATCCTGA TCGGGGCGGT CATCGTCTAC GGGGCCGGGA CGAGCATCGC GTTCTTCCTC GACTCGATCT ACCTGATCCT GGCGACCCGC GTGCTCGACG GCATCGCCGT CGGCGCGCTC ATGGTGACGG TGCCGACGCT CATCGCGGAC TACTACTCGG GCGGCCGCCG CGAGTCCATC ATGGGCTACT ACAGCGCCGT GACGGCCGCC GGCGGCGCCA TCGCCGCGGT CATGGGCGGC TACATCGCGG GGACCCTCGG TTGGCGGTAC ATCTTCCTCG TCTTCGCGGG GGCGCTCCTG TTCGTCCCGC CGATCCTCCG GTTCCTGCCG GAACCCGACG TGGAGGAGTC GGCTAGGGAC GACGGCGTCG GCCGCCTCGA GGCGATCAAG AAACTCCTGC GTGAGTCGCC CGTAAAGCTG GTCGCCGGCA TCTACGCCAT CGTGTTGGTG GGAATGCTGG TGAACAACCT CGTCATGATC GAGGTGCCGT ACTATCTCCA GAGCTCCCTC ACCGTGACCG ATTCGCAGAC CGGCCTCGTG ATCTCGGGTG TGATGATCGG CAGCTTCGTC TCCGCCTCGA TGTACGGTCG GATCAAACAG CGCATGCGTC ACGTCACGGT GATGGCGCTC GGTTTCGTCA TCGCAGCGAT GGGATTCCTC CTGTTTACCG CCGCTGATAC CCTCCCGGTC GTGATCGCCG GCGTCATCGT CAGCGGGGCG ACGGGATTCG GCATCATCCA GCCCACGGCG AACGATTGGG TCGCGTCGGT CGTCCCGGGA GAGGTCCGCG GTCGCGCCCT CAGCGGCGTG ACGATGATGA TGTACGGTGG GTTCGCGCTC TCGCCGTTCG CCCCGATACC GCTGGTCGAC GCGTTCGGTC GTAGGGGGAT GCTGCAGACC GCCGGCTACG TCATGCTCAT CGTCGGGGGC GCCCTCCTAA CGGTCTGGTT CGTCAGTCGA TCTACCGTCG ACTCGACAGC GAAGGTGTCA TCCTCTGACG ACTGA
|
Protein sequence | MSTDYSESIR WPMLITLVLI GFIPAFSGAL INPTIPAIQE TFSHEPYSET LAQLVSSMSA WIVIVVAPLT GYVLDKYARK PILIGAVIVY GAGTSIAFFL DSIYLILATR VLDGIAVGAL MVTVPTLIAD YYSGGRRESI MGYYSAVTAA GGAIAAVMGG YIAGTLGWRY IFLVFAGALL FVPPILRFLP EPDVEESARD DGVGRLEAIK KLLRESPVKL VAGIYAIVLV GMLVNNLVMI EVPYYLQSSL TVTDSQTGLV ISGVMIGSFV SASMYGRIKQ RMRHVTVMAL GFVIAAMGFL LFTAADTLPV VIAGVIVSGA TGFGIIQPTA NDWVASVVPG EVRGRALSGV TMMMYGGFAL SPFAPIPLVD AFGRRGMLQT AGYVMLIVGG ALLTVWFVSR STVDSTAKVS SSDD
|
| |