Gene Htur_5112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_5112 
Symbol 
ID8745660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013747 
Strand
Start bp6099 
End bp7109 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content65% 
IMG OID646515469 
Producthypothetical protein 
Protein accessionYP_003406416 
Protein GI284176139 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0964659 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATCG AAGTAACGAC GCTCGATCCG CGGGCCGACG CCGACGAGTG GAACCGATAC 
GTCGAGCGCT CGGACGGGAC GAACCCGTTT TACCGAGCCG AAGCGCTCCG ACTGCAGGCG
ACGGACACCG GGTCGACGCC GCACCTGCTC GTCGGGTTTA AGGGCCAAGA GCCGGTCGGG
CTCTTCCCCG TCTTCGAGTA CGCCAAGGGA CCGATCACCG GAGCGTTCTC GCCAGCGCCG
TTCTCGTGGT CGTGTTACCT CGGACCGGCG CTGTTGAACG TCGACAAACT CAAACAGCGC
AAGGCCGACC GCCGAACGCG GCGGTTCCTC GAGGGTAGTC TCGCCTACAT CGATCGGAAA
ATCTCGCCGG TGTACGCCAA GTTCGTCACC GCCGAGTTCG ACGACCTCCG GACGTTCGCC
TGGAACGAGT ACACCGTCGA GCCGGGCTAC ACCTACGTCG TCGACCTCGA GGGAAGCGAG
GACGACCTGT TGAAGCGGTT CAGCAGCGAC GCGCGGAGCA ACGTCCGTAA CGCCGATCCG
GACGCGTACG TCGTCGAAGA GGGCGACGGG GACGACGTCG ATCGCATCGT CGAGCAGGTC
GCGGCCCGCT ACGAGAGTCA GGGCAAGCCG TTCCAGCTGA GCACGGCGTT CGCCCGTTCG
ATGTACGAAC GGCTGCCCGA CGGCGCGATC CGGCCGTACG TCTGTCGCGT CGACGGGGCG
TTCGTCGGCG GCATCCTCGT CGTCGAGTCC GAGCGGACCC GCTACCGGTG GCAGGGCGGC
GTCAAACCCG ACACCGACGT CGATGTCCCG ATCAACGACC TGCTCGACTG GCACGTCATG
CGCGACGGTC TTCGCGACGG GCTCGAGCGA TACGACCTCG TCGGCGCCGG CGTCCCGAGC
ATCAACCGGT ACAAGGCGAA GTTCAACCCG CGCCTCGAAA CCCACTACGA GATCACGGCG
GGCTCGTTCG GAATCGATCT GCTGATCGAT CGCTACCGAA AACACAGCTG A
 
Protein sequence
MSIEVTTLDP RADADEWNRY VERSDGTNPF YRAEALRLQA TDTGSTPHLL VGFKGQEPVG 
LFPVFEYAKG PITGAFSPAP FSWSCYLGPA LLNVDKLKQR KADRRTRRFL EGSLAYIDRK
ISPVYAKFVT AEFDDLRTFA WNEYTVEPGY TYVVDLEGSE DDLLKRFSSD ARSNVRNADP
DAYVVEEGDG DDVDRIVEQV AARYESQGKP FQLSTAFARS MYERLPDGAI RPYVCRVDGA
FVGGILVVES ERTRYRWQGG VKPDTDVDVP INDLLDWHVM RDGLRDGLER YDLVGAGVPS
INRYKAKFNP RLETHYEITA GSFGIDLLID RYRKHS