Gene Htur_3042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3042 
Symbol 
ID8743661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp3125475 
End bp3126578 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content70% 
IMG OID646513627 
Productsulphate transporter 
Protein accessionYP_003404582 
Protein GI284166303 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCATT CGTTCCGGTC TGGGGCCGGC TCCGCGCTCG AGTTTTCGAC GGGCGAGCTG 
ACAGGAGCGC TAGGTGATTC GGTTACGGTA TTGCCGCTGG TAGTCGCGCT GGCGGCGACG
ACGAGTGTCT CCCTGCCTCA CGTACTGGTC GGCTTCGGCG TCTTCCAGAT CGTCTGGGGA
CTCTACTACG GACTACCGCT GTCCGTCGAA CCGATGAAGG CCTTGATCGG GCTGGCGATC
GTCGGGACGC TCACCTATGT GGAACTCGCC GCCGCCGGCC TGGTAGCGGG GGGCATACTG
CTCGCGGTGG GGAAACTGGG GCTCGTCGGC CGACTCCAGC GGGTCGTCGG CGAACCCGTG
ATCCGCGGCG TACAGTTCGC CGTCGCCTTG CTCCTCCTCG AGGCGGCCGT CGACCTCTCG
ACGGGGAACC TCCCGGTCGC GATCGGCGGG CTAGCCGTCG TCGGCCTGCT AGCGCTGGTC
GGCTACCGGC AGGCCAGCGT GCTGGTCGTG CTCGGGCTCG GCGCCCTCAC GGCCGTCACG
ACGACGGGAA TCCCGACACC GCAGGTGCCC GCTCTCGCCG TCTTCCCGGC GGGCGGGCCG
ACCCTGTCTT CCGCCGCGCT CGAGGGGACC GTCGCACAGT TGGGGATGAC GGTCGGGAAC
GCGGCGATCG CGACTGCCCT GCTCTGTGGC GATCTCTACG ACCGGGATAT CTCGCCAGAC
GCGCTCTCGA CGAGTATGGG CGTGACCTGT CTGGCGGCGA TTCCGCTCGG CGGCGTGCCG
ATGTGCCACG GCAGCGGCGG ACTCGCGGGG AAGTACGCCT TCGGCGCTCG CACCGGCGGT
GCGAACGTGC TGCTCGGGGT CGGCTACCTC GCGCTGGCGC TCGTGGCCAC CGGGGCCCTG
CTGGCCGCAT TCCCGCTTGC GGTTCTCGGC GTCCTGCTCG TCGTCGTCTC CCTCGAGTTG
GCTCGAGCGG CGTTCGAGCC GGTCTCGGGC CGCCGTTCGC TGGCGTTCGT GCTGGGCGTC
GGCGCCATCG GCCTGTTCAT CAACGTCGGC GTCGCGTTCG TCCTCGGCGC TGGCCTGTTC
TGGGCGTTGG CTGGAGCGGA GTGA
 
Protein sequence
MAHSFRSGAG SALEFSTGEL TGALGDSVTV LPLVVALAAT TSVSLPHVLV GFGVFQIVWG 
LYYGLPLSVE PMKALIGLAI VGTLTYVELA AAGLVAGGIL LAVGKLGLVG RLQRVVGEPV
IRGVQFAVAL LLLEAAVDLS TGNLPVAIGG LAVVGLLALV GYRQASVLVV LGLGALTAVT
TTGIPTPQVP ALAVFPAGGP TLSSAALEGT VAQLGMTVGN AAIATALLCG DLYDRDISPD
ALSTSMGVTC LAAIPLGGVP MCHGSGGLAG KYAFGARTGG ANVLLGVGYL ALALVATGAL
LAAFPLAVLG VLLVVVSLEL ARAAFEPVSG RRSLAFVLGV GAIGLFINVG VAFVLGAGLF
WALAGAE