Gene Htur_2304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_2304 
Symbol 
ID8742910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp2371598 
End bp2372929 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content72% 
IMG OID646512889 
Productglucose sorbosone dehydrogenase 
Protein accessionYP_003403857 
Protein GI284165578 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGCTCTC CGACGCGCCG ACGCCTGCTG GCGGCGACCG GAGCCGCGGT TCCCGCGCTC 
GCGGGCTGTT TCACGGAGTC GGGATCCGAC GCGACGGAAC TCGCGACGCC CGAGTCCGTC
CCCGCCGACG ACTGGGTCAA ACCCGACTGG CGGCCGGCCG ACGCCGTCCC GAGCGAGGAC
GACGTCGCGG CGACGACGGT CGTCTCCGAT CTCGCGATTC CGTGGGATCT CACCGTCGCC
GCCGGCGACG CCTTCGTCAC CGAACGCGAC GGCGGCGTTC GTCGATTCGA TGCGGACACG
CTGGCCGAAG ACGCCGATCT GGGACCCAAC GACGGCGAGA CACTCCTCGA GAGCGCGTCC
CTCCCCGATC GCGCGTCGCC CGGCGAGGGC GGGACCCTCG GCGTCGCGGC CCACCCCGAC
TATCCCGACA CCCCCGACCT GTTCGTCTAC TACACGGCCG ACGACGGGGG CGTCTCGAAC
CGGGTCGTCC GCTACGACCT CGAGGCCGAC GCCCTCGAGA CGATCCTCGA GGGAATTCCG
GGGTCGTCGG TCCACAACGG CGGGCGGATC GCGTTCGGCC CCGACGACCA CCTCTGGGTG
CTGACGGGCG ACGCGAGGGA GCCGGCGCTG TCGCAGGATC CCGGGTCCCG CGCGGGTGCC
GTCTTGCGGG TGACGCCCGA CGGCGAGCCC CACCCCGAGA ATCCCGACTG GGGCGACGAC
GGCGACCGAC GCACGTACAC GCTCGGCCAC CGCAACCCGC AGGGACTCGA CTTCACGCCG
CAGGGAACGC CGATCCTCGC CGAACACGGG CCGGGCGCGC GGGACGAACT CTCGATCCTC
CGGCCGGGCG GCAACTACGG CTGGGATATC GTCCGCGGCG GGCCGGACGA CCCCGAGTAC
GGGAGCTACG ACGAGTACGA GGCGGCGACG CCGCCGGTCG TCAACACCGG CCCCAAAACG
ACGTGGGCGC CCTCCGGACT GGCGTTCTAT GACGACGACG CGATCGGCCC GTGGGAGAAT
ACCGTCCTCG TCTGCGGGCT CACCTCGAGC GCGCTGTCCG TCGTCGGGCT CACGCCCCGA
AGCGACTCGG ACGGCGACGA CGAGGCGAGT TCTGACGACA CCGACGGCGT CCGGTACGAC
GCCGACTGGC TCGACGATCG CGTTACGGCG ACGGTCCATC GGCTGTTCGC CGACGAGTGG
GGTCGCCTTC GACACGTCGA GCCCGGGCCC GACGGCTCGC TGTACCTGCT CACGTCGAAC
CGGGACGGTC GCGCGGACGG CCCGTTTCCC CGGACGAACG ACGACCGGAT CGTCAGGCTG
GACCCGCGGT AG
 
Protein sequence
MCSPTRRRLL AATGAAVPAL AGCFTESGSD ATELATPESV PADDWVKPDW RPADAVPSED 
DVAATTVVSD LAIPWDLTVA AGDAFVTERD GGVRRFDADT LAEDADLGPN DGETLLESAS
LPDRASPGEG GTLGVAAHPD YPDTPDLFVY YTADDGGVSN RVVRYDLEAD ALETILEGIP
GSSVHNGGRI AFGPDDHLWV LTGDAREPAL SQDPGSRAGA VLRVTPDGEP HPENPDWGDD
GDRRTYTLGH RNPQGLDFTP QGTPILAEHG PGARDELSIL RPGGNYGWDI VRGGPDDPEY
GSYDEYEAAT PPVVNTGPKT TWAPSGLAFY DDDAIGPWEN TVLVCGLTSS ALSVVGLTPR
SDSDGDDEAS SDDTDGVRYD ADWLDDRVTA TVHRLFADEW GRLRHVEPGP DGSLYLLTSN
RDGRADGPFP RTNDDRIVRL DPR