Gene Htur_4403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4403 
Symbol 
ID8745031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp674661 
End bp676139 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content63% 
IMG OID646514940 
Productsugar transporter 
Protein accessionYP_003405887 
Protein GI284167609 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.92249 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCTGA TTCACCGACT ATTGCCGGTA GGAGACGACG ATATCGGTCC GTTTGTTATC 
GTTATCTCCG CGCTCGCCGC GCTGAACGGA CTACTGTTCG GGTTCGACAC CGGCGTTATC
TCGGGGGCGT TGCTCTACAT GTCCGAGACG TTCCCCCAAC TCGAGGCGAA CGCGTTCTTG
CAGGGAACCG TCGTCAGCGG TGCGATGGTC GGCGCGATCG TCGGCGCGGC CTTCGGCGGC
CGGCTCGCGG ATCGGATCGG GCGGCGCCGG CTCATCCTGC TCGGCGCCGT CCTGTTCTTC
GTCGGGTCAT TCATCATGGC GGTCGCCCCC ACGGTCGAGA TTCTGATCCT CGGCCGACTC
CTCGACGGGA TCGGGATCGG CTTCGCGTCC GTCGTCGGAC CGCTGTACAT CTCGGAGATG
GCACCGGCGA AGATCCGCGG ATCGCTCGTG ACGCTCAACA ACGTCGCTAT CACGGGGGGA
ATCCTCGTGT CCTACATAAC GAACCAGCTC ATCGCAAACA TGGCATTCGA CGCCGGCCTC
TCGTGGCGGA TCATGCTCGG GCTCGGGATG CTCCCCGCCG TGGTCCTGTT CGGCGGGATC
ATCTTCATGC CGGAGAGTCC GCGGTGGCTC GTCGAAAAGG ACCGAGAGCA GGAGGCTCGA
TCCATCCTGA GTCGCGTCAG GAACGGCACT AACATCGATG CCGAAATGAA GGATATCATG
CAGATGTCCA AGCGCGAGCA GGGGAGCTTT CGCGACCTCC TGCAGCCGTG GCTTCGCCCG
GTCCTGATCG TGGGCCTCGG CCTCGCGATG TTACAGCAGG TCTCGGGAAT CAACGCGGTC
GTCTACTACG CGCCGACGAT ACTGGAGTCG TCCGGATACA GCGACATCGC GTCCCTCTTC
GGGACGATCG GAATCGGCTC GATCAACGTG TTGCTGACGG TCGCCGCGCT GTTCCTGGTC
GACCGCGTCG GCCGTCGACC GCTGTTGCTC TTCGGCCTCG TCGGGATGTG TATCTCGGTG
ACCGTCCTCG CCGGGGCCTA CATGGTTCCC AGCATGGGCG GGATCATCGG TCCGATTACG
GTCGTGAGCC TCATGCTGTT CGTCGGCTTC CACGCGGTCA GTCTCGGCTC GGTCGTCTGG
CTGGTCATCT CCGAAATCTT CCCGCTGAAC GTCCGCGGGG CCGCGATGGG AGTGACGACG
TTGGTCCTCT GGTTCTCGAA CTTCCTCGTC GCACAGTTCT TCCCGTCGCT GTTCGAGATC
GGCCCCACGG TCGCGTTCGG CGTGTTCGCG GGGATCGCGG CGGCCGGGTT CGTCTTCGTG
TACGCGCTGG TCCCGGAGAC GAAAGGCCGG ACCCTCGAGG AGATCGAGGC CGATCTGCGC
GAAACGGGCG TCGCCGACGA TAATCTGGCG CTCAGCGAGC AGGCCGAACA GGTCGATCCG
ACTGAGCAGG TCGATCAGAC CGATCACGTC AACGACTGA
 
Protein sequence
MSLIHRLLPV GDDDIGPFVI VISALAALNG LLFGFDTGVI SGALLYMSET FPQLEANAFL 
QGTVVSGAMV GAIVGAAFGG RLADRIGRRR LILLGAVLFF VGSFIMAVAP TVEILILGRL
LDGIGIGFAS VVGPLYISEM APAKIRGSLV TLNNVAITGG ILVSYITNQL IANMAFDAGL
SWRIMLGLGM LPAVVLFGGI IFMPESPRWL VEKDREQEAR SILSRVRNGT NIDAEMKDIM
QMSKREQGSF RDLLQPWLRP VLIVGLGLAM LQQVSGINAV VYYAPTILES SGYSDIASLF
GTIGIGSINV LLTVAALFLV DRVGRRPLLL FGLVGMCISV TVLAGAYMVP SMGGIIGPIT
VVSLMLFVGF HAVSLGSVVW LVISEIFPLN VRGAAMGVTT LVLWFSNFLV AQFFPSLFEI
GPTVAFGVFA GIAAAGFVFV YALVPETKGR TLEEIEADLR ETGVADDNLA LSEQAEQVDP
TEQVDQTDHV ND