Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4790 |
Symbol | |
ID | 8745380 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | + |
Start bp | 412019 |
End bp | 413644 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 646515288 |
Product | glycoside hydrolase family 28 |
Protein accession | YP_003406235 |
Protein GI | 284172853 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0680897 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGGAA TCGCCGGCCA ACGGCGACCC GCCGACGACG CAACGTTTAT TTCCGTACTC CGAAATACCC AGACGATGGC AGCACCTGAA ACGGACGGAT TCGATATCCG CGAGTACGGC GCGACCGGCG ACTCGGACGC GCTCGATACC GAGGCGATCC AGACGGCGCT CGACGAGTGC GCCGAGTCCG GCGGGACGGT GTACGTCCCG TCGGGCACGT ACGTGACCGG CCCGCTCCGG GTCGGCGACC AGACGACCCT GCATCTGGAC GCGGGGGCGA CGCTCCAGTT CGTCGGCGAC TACGAGGCAT TCCCGACGGT CCAGAGCCGC TGGGAGGGGT GGAACCAGTA CGGCTTCCAC CCGTGTCTGC TCGTCGACGA CGCCGAGAAC GTCTCGATCA CCGGCCGCGG GACGATCGAC GGCGGCGGCG AGTACTGGTG GCAGTTCTAC GACGCCCCCG AGTCGGAGAT CCCAGACGGA CTCCAGGAGC GACTGGCCGA GTTCGAGGAG AAAAACGAGA AACAGGACGA CGTCAGCAGC TTCACCCACC GGCCGCCGCT GTTCCAGATC TCCGAGTCGG AGAACGTCAG CGTGTCCGGC GTGACCCTCG AGAACTCGCC GTTCTGGAAC ACTCACGTCG TCTACTCCGA GAACGTCACG ATCACCGACG TCAACATCGC GAACCCGGCC GACGCGCCTA ACGGAGACGG GATCGACATC GATTCCTCGC GGTACGTCCG CATCAGTGAC ACCTACATCA ACGCGGGCGA CGACGCGATC TGTATCAAGT CCGGGAAGAA CGCCGAGGGC CGCGAGGTCG GCGAACCGGC GTCCCAGATC ACCGTCACCA ACTGTACGGT CGAGGCCGGT CACGGCGGCG TCGTCATCGG TAGCGAGATG TCCGGCGACG TGCGGGACGT GACGGTTTCT AACTGCACCT TCACCGACAC CGATCGAGGG GTCCGGATCA AGACCGCTCG CGATCGCGGC GGCGTCGTCG AGGACCTCCG GTTCGACAAC ATCGTCATGC GACGGATCGC CTGCCCGTTC ACCATCAACG GCTACTACTT CATGCCCCTC GACAGCGATT CCGAACCGGT CGACGAGGGG ACGCCGATGG TGCGGAACGT CTCCTTTACC AACATCACTG CCCGTCAGGT CGAGACCGCC GGCTTCTTCG CCGGCCTCCC CGAGCAGTAC TTCGAGGGAA TTTCGTTCAA CGACGTGCAG ATCGACGCCA CCCGGCCGCT CGACGCGACG GACCTCGATC CCGCGATGGC CTACGACTAC GAGCAGACCC ACGCCCTCTT CTGTAAGTCC ATCGCCGACA TCTCGTTCAC CGACGTCCGG ATCCGGACGC CCGGCGGCCC GGCGATGCAG TTCGAGGACA CCGAGGACGT GACGATCGAC GGCCTGCGGG TCCCCGACGA CCAGGACGCG CCCGTGGTCT CCCTGACGAA CGTCGATCGG ACTCGAGTCC GCGGCTGTGC GCCCCAGTCG GAGGGACCGT TCCTCGAGGC CGGCGGGTCG GAAACGCGCG AAATTTCGTT CGCCGGTAAC CACGGATCGC TGGCCGACGA CACCGAAATC GCCGACGACA GTGACGCGAC GATCGACCGC TCCTGA
|
Protein sequence | MSGIAGQRRP ADDATFISVL RNTQTMAAPE TDGFDIREYG ATGDSDALDT EAIQTALDEC AESGGTVYVP SGTYVTGPLR VGDQTTLHLD AGATLQFVGD YEAFPTVQSR WEGWNQYGFH PCLLVDDAEN VSITGRGTID GGGEYWWQFY DAPESEIPDG LQERLAEFEE KNEKQDDVSS FTHRPPLFQI SESENVSVSG VTLENSPFWN THVVYSENVT ITDVNIANPA DAPNGDGIDI DSSRYVRISD TYINAGDDAI CIKSGKNAEG REVGEPASQI TVTNCTVEAG HGGVVIGSEM SGDVRDVTVS NCTFTDTDRG VRIKTARDRG GVVEDLRFDN IVMRRIACPF TINGYYFMPL DSDSEPVDEG TPMVRNVSFT NITARQVETA GFFAGLPEQY FEGISFNDVQ IDATRPLDAT DLDPAMAYDY EQTHALFCKS IADISFTDVR IRTPGGPAMQ FEDTEDVTID GLRVPDDQDA PVVSLTNVDR TRVRGCAPQS EGPFLEAGGS ETREISFAGN HGSLADDTEI ADDSDATIDR S
|
| |