Gene Htur_3989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3989 
Symbol 
ID8744617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp244512 
End bp245732 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content67% 
IMG OID646514564 
Producthypothetical protein 
Protein accessionYP_003405511 
Protein GI284167233 
COG category[S] Function unknown 
COG ID[COG5441] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.138068 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGTCG TCATCATCGG AACGTTGGAT ACGAAAGCGG AAGAGATCGG CTTCGCCAGG 
GACGTCCTCG AAGCCCAGGG TGTGGACGTC CACGTCGTCG ACGCGGGCGT GATGGGCGAA
CCGGGATTCG AGCCGGAGAC GACTGCGAGC GAGGTCGCCG ATGCAGCGGG AACGACCCTC
GAGCACCTCC GCGAGGAGGC TGACCGCGGC GAGGCGATAG AGGCGATGGG CGATGGCGCG
GCCGAGGTCG CCCAGCGACT CCACGATGAG GGTGTCCTCG ACGGCGTCCT CGGATTGGGC
GGGTCGGGAA ACACCTCGAT CGCGACGGCG GCCATGCGGG CGCTGCCCGT CGGCGTCCCG
AAGGTCATGG TTTCGACGAT GGCGTCGGGC GATACGGAGC CCTACGTCGG GTCCCGGGAC
GTCACGATGA TGTACTCGGT CGCGGATATC GAGGGGTTGA ACCAACTTTC GCGGCGGATC
ATCTCCAACG CCGCGCTGGC GATGGTCGGC ATGGTCTCGA ACGACCCCGA CGTCGACGTA
GAAGAGCGAC CCACGATCGC CATGACGATG TTCGGCGTCA CGACGCCCTG CGTGCAGGCG
GCCCGCGAGC GACTCGAGGA CATGGGCTAC GAGGCGATCG TCTTCCACGC CACCGGGACC
GGCGGGCGCG CAATGGAATC GCTCGTCGAG GAGGGTGTCG TCGACGGTGT GCTCGACGTC
ACGACGACCG AGTGGGCCGA CGAACTGGTC GGCGGCGTCC TGAGCGCCGG TCCGGACCGA
CTCGAGGCGG CCGGCGACGA GGGCATCCCG CAGGTCGTGT CGACGGGCGC GCTCGACATG
GTCAACTTCG GCCCTCGCGA TTCGGTCCCC GAGGAGTTCG AGGGCCGTCA GTTCCACGTT
CACAACCCGC AGGTGACACT CATGCGGACG ACGCCCGAGG AAAACGCCGA ACTCGGGGAG
ATCATCTCCG AGAAGCTCAA CGACGCGACC GGACCCACCG CGCTCGTCCT TCCCCTCGAG
GGCGTCTCGG CGATCGACGT CGAGGGAGAG GACTTCTATG ATCCCGAGGC CGACGCGGCG
CTGTTCGACG CGCTCCGGTC GTCGCTCGAA GACGACGTCG AACTCCTCGA GATGGAGACC
GACATCAACG ACGAGGCCTT CGCGGCGAAA CTGGCGGAGA CCCTCGACGG GTACATGCGA
GAGGCTGGAC GAGCCCCGTA A
 
Protein sequence
MSVVIIGTLD TKAEEIGFAR DVLEAQGVDV HVVDAGVMGE PGFEPETTAS EVADAAGTTL 
EHLREEADRG EAIEAMGDGA AEVAQRLHDE GVLDGVLGLG GSGNTSIATA AMRALPVGVP
KVMVSTMASG DTEPYVGSRD VTMMYSVADI EGLNQLSRRI ISNAALAMVG MVSNDPDVDV
EERPTIAMTM FGVTTPCVQA ARERLEDMGY EAIVFHATGT GGRAMESLVE EGVVDGVLDV
TTTEWADELV GGVLSAGPDR LEAAGDEGIP QVVSTGALDM VNFGPRDSVP EEFEGRQFHV
HNPQVTLMRT TPEENAELGE IISEKLNDAT GPTALVLPLE GVSAIDVEGE DFYDPEADAA
LFDALRSSLE DDVELLEMET DINDEAFAAK LAETLDGYMR EAGRAP