Gene Htur_4097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4097 
Symbol 
ID8744725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp359200 
End bp360555 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content60% 
IMG OID646514657 
ProductBeta-fructofuranosidase 
Protein accessionYP_003405604 
Protein GI284167326 
COG category 
COG ID 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAGATA AGAGTACTAA TCTTGAGCTA AGCAGGCGCA GTATACTCAA AACAACTGGC 
GCAATTGGTG CGGCCGGTCT CGCGCTACCG TTTGGGACTG GCTCCGGAGC CGCTCTCGAG
ACAGGTGAAG CGAGCCAGTG GACTCGCGAA CACGCAAACA GTATCGAACT TACAGACGAT
ACGACTGCTC CGGTCATCGA TGAGAACTCC GACGTAATCT CGGACGACTA CTGGATCTGG
GACACGTGGC CGCTTCGATA CCGCGACGGC TCGATTGCAA AGATCGACGG CTGGCAGGTC
GTCTTCTCCC TCACGGCGTC AAAGGACCTC GTACCGGGCG CTCGGCACAA TGAAGCGACG
ATTCGCTACT TCTACTCCCG GAATGGTCAC GACTGGCAGG AGGGCGGGAC GGCCTTCGAG
AACCCGCTTG GGCACCACCA GTGGGCCGGC TCTGCGATGT ACGATCAGAG CGAGGACCAG
ATTTACCACT TCTACACAGC GACCAGCCCG GAACCGGAGT TCCGCCAGCG ACTCGCACTC
GGCAAGGGCG CGTCCCTCCG GACCAGTCCG CACGGCGTCG AGCTGACCGG TGACCAGGAG
CACGTTATCA TCGGCGAAGC CGACGGGGAC CTCTATCAGA CCCTAGAGCA GTCCCGAGAA
CAGGGCATCG TCTACGCGTT CCGTGACCCG TGGTACTTCG AGCATCCCGA GACCGGCGAG
GATCTCGTCG TGTTCGAGGG GAACACGCCC ACGGGAGGCG ATAGCCCAGA TGATCCGCAA
AGCTACAACG GGAACGTTGG CGTAATGCGA GCGACCAACG ACGAACTCAC TGAGTGGGAG
CTCCTCCCGC CTAACCTGGA GGCGATCGAG GTCAACCAAC AGTTGGAACG CCCGCATTAC
GTCTTCAATA ACGGGAAGTG GTATCTGTTC GTCCTCAGCC ACGAGTTTAC GTTCGCTCCC
GGCCTCAGCG GTCCCGACGC GCTGTACGGG TTCGTGAGCG ATTCGCTCTA CGGAGAATAC
GAACCGCTCA ACGGGAGCGG ACTGGTCCTC GCAAACCCCG AGTCGGCGCC CTTCCAAGCG
TATTCGTGGC TGGCGATGCC CCACGGGAAC GACGTGCTGA TCGAAAGCTT CGAGAACTTC
CGCGGGCTCG ACGACACGTC TCGGGGCGAG ATCAGCCTCG ACGAGGTCGG CCATCTGCCC
CCCGAAGAGC AGAAGGAACT GTTCGGTGGA ACGCTTGCAC CGAGCCTGAA GCTACAACTC
GAGGGGACTA AAACGCGGAT CGTCAGCGAA CTCAATGACG GCCACTTTCT TCCCTCGGGT
GGATCGAACA AGGGGACGAA CGGAAATAAT CAGTAA
 
Protein sequence
MVDKSTNLEL SRRSILKTTG AIGAAGLALP FGTGSGAALE TGEASQWTRE HANSIELTDD 
TTAPVIDENS DVISDDYWIW DTWPLRYRDG SIAKIDGWQV VFSLTASKDL VPGARHNEAT
IRYFYSRNGH DWQEGGTAFE NPLGHHQWAG SAMYDQSEDQ IYHFYTATSP EPEFRQRLAL
GKGASLRTSP HGVELTGDQE HVIIGEADGD LYQTLEQSRE QGIVYAFRDP WYFEHPETGE
DLVVFEGNTP TGGDSPDDPQ SYNGNVGVMR ATNDELTEWE LLPPNLEAIE VNQQLERPHY
VFNNGKWYLF VLSHEFTFAP GLSGPDALYG FVSDSLYGEY EPLNGSGLVL ANPESAPFQA
YSWLAMPHGN DVLIESFENF RGLDDTSRGE ISLDEVGHLP PEEQKELFGG TLAPSLKLQL
EGTKTRIVSE LNDGHFLPSG GSNKGTNGNN Q