Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4231 |
Symbol | |
ID | 8744859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | + |
Start bp | 502212 |
End bp | 503459 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646514777 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003405724 |
Protein GI | 284167446 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCACG GAATCGAATC TGGCGGACGA GCGGAATCGA CGGACGAACC GAGCGGGACC GTGCCGTGGG GATCCCGGAC GGTCCAGATC GTGTTGACGA GTACGGCGCT CGCACCGCTC GGCGTGCCAC TCATCAGCCC CGCACTGCCG GTCTTTCGCG ACGTGTTTGG GATCACCGAC GCACAGGCGA GCCTCCTGGT GAGCACGTAC TTCCTCGTCG GGATCGTCCT CTCGCCGTTC ATCGGCGTCC TCGCCGATCG AGTCGGCCGA AAGCGGGTTC TGGTCGGGGG ACTACTCGCG TTCGGCGTCC TCGGCGGTGC GATGGCGCTC GCGCCGACGT TCGAAGCCCT GCTCGCGCTG CGCGTCGCAC AGGGGACCGC AGCGGCGGCG ATCTTCATCA CGACCGTCAC GATCGTGGGC GACGCGTTCG ACGGCGTCCA GCGAAACGCA GTCCTGGGGG CAAATGTCGC GGTCCTCTCG GCTACCGCCG CGCTGTTTCC CGTCCTCGGC GGGTTCCTCG CAGGAATCGC GTGGAACGCG CCGTTTCTCG CGTACCTAGC CGCGATCCCG ATCGCCGCGT TCGCGCAGGC CGCGCTGGAC GAACCACAGC GCGTCGACGA CAGAGACGGG GTTTCGTACC TCGTCGATGC CGCACGAGCG GTTCTCACGC CGGCGCTCGC GGCGCTGTTC GCCGTCGCGT TCCTCACGGA GTTCCTGCTG TTCGGCGTGA TCTTCACGGC GATGCCGTTT GTCCTCGCGG CGACGCTCGC CCCCGTACTG ATCGGGGTCG TGATCCTGGT CTCCGAGACG GCGTCGATGC TGGTCGCGCT CTCGAGCGGC CGCCTGGCGC GGCACCTCTC GAACGAGTGG GTGATCGCGA CTGGATTCGC CTGCTACGCT ATCGGGTTCG CGGCCGCGTG GGCCGCGACC GGACTCGTCG GTACGATGGG AGCGGTCGTG GCCATCGGCG TCGGCGTCGG ACTCCTGATG CCGGTCGTCG ACGCCGCCGT GAGCGATCGG GTCACCACCG AGTACCTGGC CGGGGCGATG AGTCTGCGCA ACAGCACCAC CTTCCTCGGA CGGACCGCCG GTCCGATCGC GTTCGCTGGC TTGGCGATCT CCACCGGGAT CGGATACGAA CCACTCCTGC TCGCCTCAAG TCTCGTCGCG GTCGTCGCGA CCGGCGTTGC CGTCATCGCC GGACCCGTTC GCCTCGCTCG AGTGACTGCT CGCCAACCGT CGACGTGA
|
Protein sequence | MDHGIESGGR AESTDEPSGT VPWGSRTVQI VLTSTALAPL GVPLISPALP VFRDVFGITD AQASLLVSTY FLVGIVLSPF IGVLADRVGR KRVLVGGLLA FGVLGGAMAL APTFEALLAL RVAQGTAAAA IFITTVTIVG DAFDGVQRNA VLGANVAVLS ATAALFPVLG GFLAGIAWNA PFLAYLAAIP IAAFAQAALD EPQRVDDRDG VSYLVDAARA VLTPALAALF AVAFLTEFLL FGVIFTAMPF VLAATLAPVL IGVVILVSET ASMLVALSSG RLARHLSNEW VIATGFACYA IGFAAAWAAT GLVGTMGAVV AIGVGVGLLM PVVDAAVSDR VTTEYLAGAM SLRNSTTFLG RTAGPIAFAG LAISTGIGYE PLLLASSLVA VVATGVAVIA GPVRLARVTA RQPST
|
| |