Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_5112 |
Symbol | |
ID | 8745660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013747 |
Strand | - |
Start bp | 6099 |
End bp | 7109 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 646515469 |
Product | hypothetical protein |
Protein accession | YP_003406416 |
Protein GI | 284176139 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0964659 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTATCG AAGTAACGAC GCTCGATCCG CGGGCCGACG CCGACGAGTG GAACCGATAC GTCGAGCGCT CGGACGGGAC GAACCCGTTT TACCGAGCCG AAGCGCTCCG ACTGCAGGCG ACGGACACCG GGTCGACGCC GCACCTGCTC GTCGGGTTTA AGGGCCAAGA GCCGGTCGGG CTCTTCCCCG TCTTCGAGTA CGCCAAGGGA CCGATCACCG GAGCGTTCTC GCCAGCGCCG TTCTCGTGGT CGTGTTACCT CGGACCGGCG CTGTTGAACG TCGACAAACT CAAACAGCGC AAGGCCGACC GCCGAACGCG GCGGTTCCTC GAGGGTAGTC TCGCCTACAT CGATCGGAAA ATCTCGCCGG TGTACGCCAA GTTCGTCACC GCCGAGTTCG ACGACCTCCG GACGTTCGCC TGGAACGAGT ACACCGTCGA GCCGGGCTAC ACCTACGTCG TCGACCTCGA GGGAAGCGAG GACGACCTGT TGAAGCGGTT CAGCAGCGAC GCGCGGAGCA ACGTCCGTAA CGCCGATCCG GACGCGTACG TCGTCGAAGA GGGCGACGGG GACGACGTCG ATCGCATCGT CGAGCAGGTC GCGGCCCGCT ACGAGAGTCA GGGCAAGCCG TTCCAGCTGA GCACGGCGTT CGCCCGTTCG ATGTACGAAC GGCTGCCCGA CGGCGCGATC CGGCCGTACG TCTGTCGCGT CGACGGGGCG TTCGTCGGCG GCATCCTCGT CGTCGAGTCC GAGCGGACCC GCTACCGGTG GCAGGGCGGC GTCAAACCCG ACACCGACGT CGATGTCCCG ATCAACGACC TGCTCGACTG GCACGTCATG CGCGACGGTC TTCGCGACGG GCTCGAGCGA TACGACCTCG TCGGCGCCGG CGTCCCGAGC ATCAACCGGT ACAAGGCGAA GTTCAACCCG CGCCTCGAAA CCCACTACGA GATCACGGCG GGCTCGTTCG GAATCGATCT GCTGATCGAT CGCTACCGAA AACACAGCTG A
|
Protein sequence | MSIEVTTLDP RADADEWNRY VERSDGTNPF YRAEALRLQA TDTGSTPHLL VGFKGQEPVG LFPVFEYAKG PITGAFSPAP FSWSCYLGPA LLNVDKLKQR KADRRTRRFL EGSLAYIDRK ISPVYAKFVT AEFDDLRTFA WNEYTVEPGY TYVVDLEGSE DDLLKRFSSD ARSNVRNADP DAYVVEEGDG DDVDRIVEQV AARYESQGKP FQLSTAFARS MYERLPDGAI RPYVCRVDGA FVGGILVVES ERTRYRWQGG VKPDTDVDVP INDLLDWHVM RDGLRDGLER YDLVGAGVPS INRYKAKFNP RLETHYEITA GSFGIDLLID RYRKHS
|
| |