Gene Htur_2110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_2110 
Symbol 
ID8742710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp2179654 
End bp2180973 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content62% 
IMG OID646512692 
Productalpha amylase catalytic region 
Protein accessionYP_003403666 
Protein GI284165387 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGACG ATACCGATCG ACGATCCGGT GACGGAATCG ACGACGCGAT GAGTAGACGA 
GCCCTCATCA GCACCGCCGC GATGGCCGGT GTTTCCCTCA CCGGCGTCGG GTCCGCCTCG
GCGGGCACGG GCAGCGGCGA ACGGGTGTTC TTCCAGTACT TCCACGAGAC GTGGCCGACG
ATCACGGACA CCCTCTCGAC AGTTGCGGAC CGCGGCTACG ACGGCGTCTG GATCCAGGCG
CCTCAAGAGA GCGAGCTGAC CTGGAGCGAC CAGGACGGCC GGAACGATCC GCCGTTGGGC
TACCAGCCGG TCGACTTCCG CTCCTTCGAC AGCGAGTTCG GGACCGAAGC GGACCTCAAC
CGGCTCGTCG AGACCGCACA CGAACACGGC CTCGAGGTGT ACGTCGACTG CGTAATGAAC
CACATGGCCG CAAATCGCGG CTACGACTTC CCGCAGTTCA AGGAGAAACA CTTCCACACT
CACGTCGGTT CGATCGACGA CTGGGACGAC GAACACCAGG TCGAGCACGG GAACCTCCTC
GGGTTGAAGG ACCTCGCGCA ACTCGAGGAC CACGGACACG AGGACACCGC GCCGTACGTC
CGCGAGCAGC TGTACGACTA CATGAAAAAG ATCGCGGACA CCGGGGCCGA CGGCTACCGC
TACGATGCGG TCAAACACGT CGAGCGCGAA TACTGGGAGC AGTACGCCAA TCAGTGGGCT
GACGAGTTCG GCATGAGTCG AGTCGGAGAG GTGTTTGACG GCGGCGTCGA CTACGTGCAG
AACTACATCG ATACCGGAAT GAACGCCTTC GACTACCCGC TGTACTTCGT CATGGAGGAG
GTCTTCGACT ATGGTGATAT GAGCAAACTC GACGGTGCGG GAGTTGTCGC CCAGGATCCG
TTCCACTCTT GGCCGTTCGT TCAGAATCAC GACGAGGGCG CGCCGCCACA GTACCACCTC
GCACACGCCT TCGTTCTCAC GATCGAGGGA ACGCCGATGG TATACAATCT CTACCCCGAC
GAGATCCTCG ACGACGACGC GATCACCAAC ATGGTGTGGG TCAAGACGAA CCTCGCCGGC
GGTACGACCT ACTGGCGACA CACCGATTCC GACCTCGCAG TCTACGAGCG GCAGAACAAC
CTGCTCGTCG GTCTCAACAA CAATACCGAC AGCTGGCGAA GTAAGTGGGT GTACACGACC
TGGAGCGACG AGACGCTCAA AGACTACAGT GGCAACGCCG ACGACATCGA CGTCAACGGT
GACGGCTGGG TCGAGGTCTC GGTTCCGCCC GAGGGGTGGG TGTTCTACGC GCCGTACTGA
 
Protein sequence
MSDDTDRRSG DGIDDAMSRR ALISTAAMAG VSLTGVGSAS AGTGSGERVF FQYFHETWPT 
ITDTLSTVAD RGYDGVWIQA PQESELTWSD QDGRNDPPLG YQPVDFRSFD SEFGTEADLN
RLVETAHEHG LEVYVDCVMN HMAANRGYDF PQFKEKHFHT HVGSIDDWDD EHQVEHGNLL
GLKDLAQLED HGHEDTAPYV REQLYDYMKK IADTGADGYR YDAVKHVERE YWEQYANQWA
DEFGMSRVGE VFDGGVDYVQ NYIDTGMNAF DYPLYFVMEE VFDYGDMSKL DGAGVVAQDP
FHSWPFVQNH DEGAPPQYHL AHAFVLTIEG TPMVYNLYPD EILDDDAITN MVWVKTNLAG
GTTYWRHTDS DLAVYERQNN LLVGLNNNTD SWRSKWVYTT WSDETLKDYS GNADDIDVNG
DGWVEVSVPP EGWVFYAPY