Gene Htur_4231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4231 
Symbol 
ID8744859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp502212 
End bp503459 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content68% 
IMG OID646514777 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003405724 
Protein GI284167446 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCACG GAATCGAATC TGGCGGACGA GCGGAATCGA CGGACGAACC GAGCGGGACC 
GTGCCGTGGG GATCCCGGAC GGTCCAGATC GTGTTGACGA GTACGGCGCT CGCACCGCTC
GGCGTGCCAC TCATCAGCCC CGCACTGCCG GTCTTTCGCG ACGTGTTTGG GATCACCGAC
GCACAGGCGA GCCTCCTGGT GAGCACGTAC TTCCTCGTCG GGATCGTCCT CTCGCCGTTC
ATCGGCGTCC TCGCCGATCG AGTCGGCCGA AAGCGGGTTC TGGTCGGGGG ACTACTCGCG
TTCGGCGTCC TCGGCGGTGC GATGGCGCTC GCGCCGACGT TCGAAGCCCT GCTCGCGCTG
CGCGTCGCAC AGGGGACCGC AGCGGCGGCG ATCTTCATCA CGACCGTCAC GATCGTGGGC
GACGCGTTCG ACGGCGTCCA GCGAAACGCA GTCCTGGGGG CAAATGTCGC GGTCCTCTCG
GCTACCGCCG CGCTGTTTCC CGTCCTCGGC GGGTTCCTCG CAGGAATCGC GTGGAACGCG
CCGTTTCTCG CGTACCTAGC CGCGATCCCG ATCGCCGCGT TCGCGCAGGC CGCGCTGGAC
GAACCACAGC GCGTCGACGA CAGAGACGGG GTTTCGTACC TCGTCGATGC CGCACGAGCG
GTTCTCACGC CGGCGCTCGC GGCGCTGTTC GCCGTCGCGT TCCTCACGGA GTTCCTGCTG
TTCGGCGTGA TCTTCACGGC GATGCCGTTT GTCCTCGCGG CGACGCTCGC CCCCGTACTG
ATCGGGGTCG TGATCCTGGT CTCCGAGACG GCGTCGATGC TGGTCGCGCT CTCGAGCGGC
CGCCTGGCGC GGCACCTCTC GAACGAGTGG GTGATCGCGA CTGGATTCGC CTGCTACGCT
ATCGGGTTCG CGGCCGCGTG GGCCGCGACC GGACTCGTCG GTACGATGGG AGCGGTCGTG
GCCATCGGCG TCGGCGTCGG ACTCCTGATG CCGGTCGTCG ACGCCGCCGT GAGCGATCGG
GTCACCACCG AGTACCTGGC CGGGGCGATG AGTCTGCGCA ACAGCACCAC CTTCCTCGGA
CGGACCGCCG GTCCGATCGC GTTCGCTGGC TTGGCGATCT CCACCGGGAT CGGATACGAA
CCACTCCTGC TCGCCTCAAG TCTCGTCGCG GTCGTCGCGA CCGGCGTTGC CGTCATCGCC
GGACCCGTTC GCCTCGCTCG AGTGACTGCT CGCCAACCGT CGACGTGA
 
Protein sequence
MDHGIESGGR AESTDEPSGT VPWGSRTVQI VLTSTALAPL GVPLISPALP VFRDVFGITD 
AQASLLVSTY FLVGIVLSPF IGVLADRVGR KRVLVGGLLA FGVLGGAMAL APTFEALLAL
RVAQGTAAAA IFITTVTIVG DAFDGVQRNA VLGANVAVLS ATAALFPVLG GFLAGIAWNA
PFLAYLAAIP IAAFAQAALD EPQRVDDRDG VSYLVDAARA VLTPALAALF AVAFLTEFLL
FGVIFTAMPF VLAATLAPVL IGVVILVSET ASMLVALSSG RLARHLSNEW VIATGFACYA
IGFAAAWAAT GLVGTMGAVV AIGVGVGLLM PVVDAAVSDR VTTEYLAGAM SLRNSTTFLG
RTAGPIAFAG LAISTGIGYE PLLLASSLVA VVATGVAVIA GPVRLARVTA RQPST