Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3579 |
Symbol | |
ID | 8744199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 3681235 |
End bp | 3683559 |
Gene Length | 2325 bp |
Protein Length | 774 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 646514160 |
Product | transglutaminase domain protein |
Protein accession | YP_003405114 |
Protein GI | 284166835 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTACGA ACTCCGGTTC CCAGACGGGC GAGCGGACGA TCGAGATCGC GCCGGACGGA TCGGTCGGTC CCGGAACATT CCGGCTGCTC GCGCTCTGTT GCGTGCTGGT CCTGACGGCG TCGTACGTGA GCGTCCTGCG CGACGTGACG CGAGTCGTCG GCGGGACCGA GACGCTGTTG GCCCTCGTCG TCGCGATGGG AGTCGCGGCG ACGGTCCTCG CGTGGGCGAT CCGCCCGCGG ACCGCGACGG TCGCCGCGCT CGTCGCGGGC GCGCTCGGCT TCGCCTACTA CCTCGAGTAC ACCGGCCTCG GCGTCGAGGT GCTGCTCTCG GCGGGCGACG TGTTGCTCTC GGACGCCGCC GCGCTCGCGA CCGGCCTCCC CCTGCTCCGG ATGGTCGAAG CGGGCGTCTG GACGCTCGCG TTCGTCCCCG CGCCCGTCTT CCTCTCGTGG TATCTCGCCG TCCGGGGTCG GTACGCGCTC GGCGTCGTTC CCGGCGGTAT CGCGCTGCTC TTTCTCGTCC TGACCGGCGA CGCCGGGACG GTACCCACGC TGGTGGGGAC GCTGGCCGGG ATCGGCGCCG TCGGCTGCGG CGAACTCGAG CGCCGCGGCG GTTCGGTCGC CCAGGCGGAC CTGCTCACGG CCCTGTTCGC GCTGATCGTC GTCCTCTCGC TGACCGTCAC GGTCGTCCCC GGCGGCTCGG GGACACCGGG CCACTTCGCT GGGACCGAGA CCGGGACGCT CGAGGGGACC ATCGACCACG CCTCGGATCG CTCGAGCATC GGCGGCTCGG TCGAGCTCTC GCCGGAGGTC CGGTTTACGG TCGAAGCCGA CCAGGCCTCC TACTGGCGGA CGGGCGTCTA CGATCGCTTC GCCGGCGACG AGTGGGTCCG GACCGGCGAC CGCAGCGACT ACGGGGGTCC GATCGGGAAG CCGCCGGGCG AGAGCGAACG GGTGCACCAG CTCGTCACGG CCGAAACCAC GCTCGGCGTC ATGCCAGCCG CTCCGCAACC GGTCACGGTC GACGGGGATC TCACCCAGCA CACCGAGGTC TCGAGCCACG GGCAGATCCA TCCGAGCAAC CCCCTCGTGG AGGGCGACAC GTACGCCGTC GAGAGCGCGG TCATCGATCC CGACGCCGCC GCCCTCCAGC GGGCGGGAAC GGACTACCCC GAGCCGATCT CCGAGCGGTA CCTCCAGACG CCCGAGGACA CCTCCAAGGA GTTCGAGGCC CGAACGGCCG AGATCACCGC CGACGCCGAC ACGCCCTACG AGAAAGCCGT CGCGATCGAG GACTACCTCC GGTCGACGAA GGGCTACTCC CTCGAGGTCG ACCGACCGAA CGGCAACGTC GCCGAGGCGT TCTTACTCGA GATGGACCAG GGCTACTGCG TCTACTTCGC GACGACGATG GCCCAGATGC TGCGAGCCGA GGACGTTCCG ACTCGCTACG TCACCGGCTA CACGAGCGGC CAGCAGGTCG ACGACGACGA GTACGTCGTC CGCGGCACCG ACGCCCACGC CTGGGTCGAG GTCTACTTCC CCGACCACGG CTGGGTCGCC TTCGAACCGA CGCCGTCCGG CCCCCGCAAC GCGGCCCACA ACGAACAGGT CGAGCAGGCC CGCGAGGACG GCGCCGAGGA CGTCGACACC GACTCGAGCG AGGACGTGCC GCTCTCCGAG GAGGAAGCGG AGAACGAACC CGGCGAGTCC CCGTCGGAGA TCATCGACGA CAATGAGAGC GAGGCCCGGA ACGGCTCCGA ACCCGACACC GGGTACGGGC CGGACAACGA GACCGACAAC GACTCGACCT CGCCCCAACC GGAGCCCGAT CCGTCCGATC CGAACGCGGA CAACGACTCG CTCACCGACG AGGAGGACGA CGACGGCTTG CTCAGGGCGC TCGTCGAGCG GGTATCGATC TCCCGCGAGG CAGCCACCGT CGCCCTCGTC GCCCTCACCG GGTTAGTCGC GAGCGTCCAT CACACCGGCG CGGTCGCGCG GCTCCGCCGA ACGGTCGGCC GCTACTGGCA GCGGCGCAGC GACGACCCCG ACCGCGACGC CGAACGCGCC TATCGGCGGC TCGAGCGCCT GCTGGCCGCC TCCCACCGCC CCCGCGAGCG CTCAGAGTCC GCGCGCGGCT ACCTCGAGGC GCTGGCCGAG GAATCCGACG TGGGAGTCGA TCCGCGGGCG AAAACCGTCC TCGAGCGGTA CGAACGGGCG GTGTACGGCG GCGGCGTCGA TCGCGAGGGG GCCGACGAGG CGATGGCGAT CGTCGACGAA CTCGCGCGTG AGCACCTGCC CGGCGTCGGT CGACGGCGAA AGTGA
|
Protein sequence | MSTNSGSQTG ERTIEIAPDG SVGPGTFRLL ALCCVLVLTA SYVSVLRDVT RVVGGTETLL ALVVAMGVAA TVLAWAIRPR TATVAALVAG ALGFAYYLEY TGLGVEVLLS AGDVLLSDAA ALATGLPLLR MVEAGVWTLA FVPAPVFLSW YLAVRGRYAL GVVPGGIALL FLVLTGDAGT VPTLVGTLAG IGAVGCGELE RRGGSVAQAD LLTALFALIV VLSLTVTVVP GGSGTPGHFA GTETGTLEGT IDHASDRSSI GGSVELSPEV RFTVEADQAS YWRTGVYDRF AGDEWVRTGD RSDYGGPIGK PPGESERVHQ LVTAETTLGV MPAAPQPVTV DGDLTQHTEV SSHGQIHPSN PLVEGDTYAV ESAVIDPDAA ALQRAGTDYP EPISERYLQT PEDTSKEFEA RTAEITADAD TPYEKAVAIE DYLRSTKGYS LEVDRPNGNV AEAFLLEMDQ GYCVYFATTM AQMLRAEDVP TRYVTGYTSG QQVDDDEYVV RGTDAHAWVE VYFPDHGWVA FEPTPSGPRN AAHNEQVEQA REDGAEDVDT DSSEDVPLSE EEAENEPGES PSEIIDDNES EARNGSEPDT GYGPDNETDN DSTSPQPEPD PSDPNADNDS LTDEEDDDGL LRALVERVSI SREAATVALV ALTGLVASVH HTGAVARLRR TVGRYWQRRS DDPDRDAERA YRRLERLLAA SHRPRERSES ARGYLEALAE ESDVGVDPRA KTVLERYERA VYGGGVDREG ADEAMAIVDE LAREHLPGVG RRRK
|
| |