Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4783 |
Symbol | |
ID | 8745373 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | - |
Start bp | 398391 |
End bp | 400496 |
Gene Length | 2106 bp |
Protein Length | 701 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646515281 |
Product | hypothetical protein |
Protein accession | YP_003406228 |
Protein GI | 284172846 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3866] Pectate lyase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.292143 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACACA AACGACGATC CTTCCTGCGA GCGATCGGTG CGGGGAGCCT CGGACTGACG GCAGCAGCGG TTACCAGTGG CACGGCCGCG GCGGCGACCA TCATCACGAT TCGCGGCGGT GGTGCGGACA TCTGGAGTAC GGCGGACGCG TTCCACTACT ACTACGACAA CGTCAGCGGA GACTTCGACG TACAAGTGCG AAACACCGCG CTCGAGAACA CTGACCCCAA CGCGAAGACC GGCATCATGA TCCGGGAGTC ACTGGATCCC ACGGTGAAGA ACGTGATGCT CCGGCGGACG CCCAGCGGCG AGGCGTCGCT CCAGTGGCGG CCGGAAGCCG GCGTCGATAC GGTCAGCACG ACGTCGGGCG GCGAAGACGA GAGCGAAGTC GACGGTGGGA GCCTCGAGGC CGAGTGGCTG CGCCTGAAGC GGAGCGGCGA CGTCTTCGAG GCGTACGGCT CGAATGACGG GGAGAGCTGG ACGCTGATCG CCGATATCGA CGCGGAACAC GTCGAATTGA GCGACGACGC GTACGTCGGC CTCCCCGTGA CGAGCCACAA CGTCGGCACG CTCTGTACGG CCGAACTACG CGATCTGACG GGACTCGAAC CGACCGCCAA CCGCGATATC GGCGACGTCG ACGTCGCCGG AAGCGTCGAC GTTGAAGAGG GCGTCCCGTT CGTCTCGACC GGCGATGCGA CCGACGTGAC GGCCACGGGG GCGACGGTAC GCGGCGAACT GACCGATCTG GGCGGCGCCG AGTCGGCCGA CTGCTACGTC GAGTATCGGG AGGTTCCGAC CGAGTCCTGG GCGACGACCG CCGCGGGAAC GCTCGAGGAG ACGGGGGCGT TCGGCGTCCG CCTCGACGGC CTCACGAGCA GGCGGTACTA CGAGTACCGC GCGGTCATCG AGACGAGCGA CGGGGATTGG GCGACCGGGT CGACCAGAAC GGTCGGGACG CCCGGTCGGT CGAACGGCCG GACGGTTCGA AACGGGCCAC GAAGCGCATC ATACGTCGAC CTCGCCGACG GGTTCGCGGA TCCGGCGCCG TGGCTGGACG ACGACACGCC CGTCATCAAG ATCACCGAGC CGACGCGACG CCAGCTGTCG GCCGCGGTCG GGGTCGACGG GCCGCGTCTG GTCGTCTTCG AGACCAGCGG TGTGATCGAT CTCGAGGAGC AGCGCCTGAC GGTGGTCAAC GATGAACTCT ACCTCGCGGG ACAGACGGCC CCGTCGCCGG GGATCACGCT CACGCGCGGC GATCTCTGGA TCGACGCGGA CGATTGCGTC GTCCAGCACC TGCGGGTCCG GCCCGGCGAC GCCAACCTGA CCGAGGAGAG CGACTGGGAA CCCGACGGGA TCAGAACGGG AGACGGGACC GAGAACAACG TCATCGACCA CTGCACGGCG ACGTGGGGGG TCGACGAGAA CCTCTCGGTC GGCTACGACA CCGAAAACAC GACGGTCTCG AACTGCCTGA TCGCCGAGCC GCTGCAGGAC GCGACCCATC ACAAGGGCGA TCACGGCTAC GGTTCGCTGA TCGGTAACAA CGCGGAAAAC GTCGCGCTCG CGGGCAACGT CTGGGCGCAC AACTACGATC GGAACCCGCG CCTCAAGGAG GGGACCAGAA CCGTCGTTTC GAACAACGTT ATGTATCACT ACAGGGACGG GGCCTGGATG GATCCCGACA CGGAGGCGAG CATCGAAGGC AACGTCTTCC GGCGACCGGT CAGCGACCAG CCCAACGTCT TCGGCGACGG CGACGCGTAC GTCGCCGACA ACGTCCTCGA GGGCGGCGAC AATCCGATGG TCGGCGACGG AATCACGCGA CTCGACTCGC GGCCGCTCTG GCCCGAGGAC CTCGAGGTCT TCGACTCGGA GGACGTCGTC GAACACAACC TCGAGAACGT CGGCGCGCGG CCGGCCGACC GAACCGCTCA CGACGAGCGC GTCCTCGAGC AGCTCCGTAC CGGTGACGGC ACGTACATCG ACAGCCAGGA AGAGGTCGGC GGTTACCCCG ACCTCGAGGT CAACCGGCGG CGGCTGGACG TCCCCCAGAA CGGAACGCAC GCCTGGCTTC GCGCGAAAGC TCGCAGCGTC GAATAG
|
Protein sequence | MAHKRRSFLR AIGAGSLGLT AAAVTSGTAA AATIITIRGG GADIWSTADA FHYYYDNVSG DFDVQVRNTA LENTDPNAKT GIMIRESLDP TVKNVMLRRT PSGEASLQWR PEAGVDTVST TSGGEDESEV DGGSLEAEWL RLKRSGDVFE AYGSNDGESW TLIADIDAEH VELSDDAYVG LPVTSHNVGT LCTAELRDLT GLEPTANRDI GDVDVAGSVD VEEGVPFVST GDATDVTATG ATVRGELTDL GGAESADCYV EYREVPTESW ATTAAGTLEE TGAFGVRLDG LTSRRYYEYR AVIETSDGDW ATGSTRTVGT PGRSNGRTVR NGPRSASYVD LADGFADPAP WLDDDTPVIK ITEPTRRQLS AAVGVDGPRL VVFETSGVID LEEQRLTVVN DELYLAGQTA PSPGITLTRG DLWIDADDCV VQHLRVRPGD ANLTEESDWE PDGIRTGDGT ENNVIDHCTA TWGVDENLSV GYDTENTTVS NCLIAEPLQD ATHHKGDHGY GSLIGNNAEN VALAGNVWAH NYDRNPRLKE GTRTVVSNNV MYHYRDGAWM DPDTEASIEG NVFRRPVSDQ PNVFGDGDAY VADNVLEGGD NPMVGDGITR LDSRPLWPED LEVFDSEDVV EHNLENVGAR PADRTAHDER VLEQLRTGDG TYIDSQEEVG GYPDLEVNRR RLDVPQNGTH AWLRAKARSV E
|
| |