Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0964 |
Symbol | |
ID | 5732850 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1104270 |
End bp | 1106672 |
Gene Length | 2403 bp |
Protein Length | 800 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278096 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_001543740 |
Protein GI | 159897493 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.490195 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGCAAC GAATTCGCGC TTTTGTTCAA CCAGGCTATG GGTTATTAAC CTGGACGCTT GCCAGCCTGA TGGCCTTGTT GGTTGTGACC AATGTGGTCG AATCGGAATG GGTCGAAGGT TTTGAACGCA TGCGGATTGT GGCACTCGGC GGCTTGATTG TGGCAGCCTT GTTTGCGCGG TGGCAAGCCT TGCCCTCGTT TTTGGCCCAT GTACTTAGTT TTTTCATGGG CTTGGCATGG TCAATTCGCT TTTTACCGCT TGATGAACGG CTCGATAGCC AAGTTGATCG CTGGACGGAT ATGCTGATTC GGGGGATTGC AGCAATTCGC GCGGTTAAAA CTGGCGAGCA AATTGAAGAT TATTATCTCT TTTCGTTGGC AGGCCTACTA CTGGCTTGGA TTTTTGCCTA TAGCACGATT TGGCTGCTGT TTCGTTGGAA CTGGAATTGG CGTTTGGTGC TGCTGAATGG TAGCCTGTTG ATGGCCAATT TGACCTATGC GATTCCCAAA CCAATTGCCT CGTTCTGGCT GTTTTTACTG GTTGCCCTGC TCTTTTTGGT CACGCAGACC TATGCCCAAC GCCAAAATGG CTGGGATGCT GGCTTGCTTG AGCAACAAGA ATGGCTTTCG CTGCGCTATT TGTGGGCGGG AGTTTTAGCC TGTTTTAGTT TGGTGTTTAT GGCGGCCTTG ATGCCTGCCA ATATTACCAA TGCCCAATTG CGGATTTTTG GCGAAAATCT GAGCCGCCCG ATCGATTTCT TTCTGCCTGA TAATGGGGCT GAACGCGCCG ATCGTGGGCC GCAAGGTGTG GGGAATGTGG TCGTGCCCAA TGGTAGTGGG GCTAGTTTTT CCAATAATAC GGTCAACCTT GGCGGGGCAC GCTCAGCCAC CAACGAAGTC GTGCTTGAAG TCAAAGCACC ATCAGCCGAG TATTGGCGCT CGAATGCTTG GGATTTATAC ACAGGCAAAG GTTGGCAAAA CACCACTGGC GAGTTGGCTT GGCAAGTGCG CCAAACTCCA ACCCGCCGCG AAGCCTTGAC CCCAATTAAT CCCGAAGATA CGCTGACCCA GCTTGATACA ACTGGTCGTG TGCCATTCAC CCAAACGATT AAATTGATGC AAACCCGTGG CGATCAGCAA TTGCCTGCCG CAACCTCGCC GCTGACTTGG AGCGTGCCAG TCTTGGTGCA ACACTCATTT ATCGTTTCGG ATACCGAAAA TACCTTGCCC AACTTTGCCG ATAGTGCGAT CTATTTCAAT CAAGGCCCAG CCAACGAAGG CTTTGAATAC AGCGTAGTTT CGTTGATCAG CAATATTGAT AAGCAAAGTT TGCGTGGAGC AGCGACCGAT TATCCGGCTT GGTTGCAGCG CTATGTTCAG TTGCCCGATA CGCAATCGAT GCGCAATATT GCTGGTTTAT CGCGCCAATT AACGACTGCG GCAGGAGCTG AAACCGCCTA TGATAAAGCG GTGGCAATCG AACGTTATTT GCGCGAATTT CCCTATGATG ATCAAATTCC TGCGCCGCCT GCTGATGCCG ACCCGATCGA GAATTTCTTG TTCAATCTAC GGCGGGGCTA CTGCGATTAT TTTGCTGGCT CGATGGTGCT GATGTTGCGG GCGCAAGGGA TTCCAGCGCG TTGGGTTCAG GGCTATGCCA CCGGCGATTA TGATGCTGAT CGCCAAGTCT ATGTCGTGCG TGATACGATT GCCCATAGCT GGCCCGAAGT CTATTTTATG GGCTATGGCT GGATTCGCTT CGAGCCAACC CCAGCGGGCT ATGTAACTGT GCCAATTCGG CCTGATGGCC CACCCCAAGC CATAGATAAC AGCGAAAACC CTGATGATAT TGGGCCAAAT GCGGGGATTG TTGATCAACC AACTACCAAT TTTGATGCCC TGCAAGATTT GCGCGAAAAC TTCCAAACCA CGCCAATTCC AACGGTCGAA CCTCAAGCGT TGCCAGCTGA AGATGTAGCG GCGCAGGTTG TCGAAAGCAG CCCATTCTGG CGTTGGCTGG CCATTATTTG TGGAGTGATT GGCTTAATTA TTTTGTTGCT ATGGTTGCTG TTGCAACGTG AATTTCGTGG TTTGCGACCT GCGGCACGGG CCTATGGGTT TATTGGCTTA TTGGCGCGTT GGGCTGGTTT AGAGCAGCGA CCCGAACGCA CACCCCAAGA GTTTGCCTGC GATTTGGCCA AAGAAATGCC TGGCCAGCGC CGAACTCTGC GGCGCTTGGC CGATGCCTAT AGTGCTGAGC AATATGGCAG CCAAGTGCGG CTTGACCCTG AGGAAATTGA AAATGATCGA GCTATGGTCA GTCGCACGCT TTGGCCGCGC TCGATCAGCC GTGGTTGGCG CTCGATTTTG CAACAAATCT TGCATCCGCG CTGGCGACGC TAG
|
Protein sequence | MLQRIRAFVQ PGYGLLTWTL ASLMALLVVT NVVESEWVEG FERMRIVALG GLIVAALFAR WQALPSFLAH VLSFFMGLAW SIRFLPLDER LDSQVDRWTD MLIRGIAAIR AVKTGEQIED YYLFSLAGLL LAWIFAYSTI WLLFRWNWNW RLVLLNGSLL MANLTYAIPK PIASFWLFLL VALLFLVTQT YAQRQNGWDA GLLEQQEWLS LRYLWAGVLA CFSLVFMAAL MPANITNAQL RIFGENLSRP IDFFLPDNGA ERADRGPQGV GNVVVPNGSG ASFSNNTVNL GGARSATNEV VLEVKAPSAE YWRSNAWDLY TGKGWQNTTG ELAWQVRQTP TRREALTPIN PEDTLTQLDT TGRVPFTQTI KLMQTRGDQQ LPAATSPLTW SVPVLVQHSF IVSDTENTLP NFADSAIYFN QGPANEGFEY SVVSLISNID KQSLRGAATD YPAWLQRYVQ LPDTQSMRNI AGLSRQLTTA AGAETAYDKA VAIERYLREF PYDDQIPAPP ADADPIENFL FNLRRGYCDY FAGSMVLMLR AQGIPARWVQ GYATGDYDAD RQVYVVRDTI AHSWPEVYFM GYGWIRFEPT PAGYVTVPIR PDGPPQAIDN SENPDDIGPN AGIVDQPTTN FDALQDLREN FQTTPIPTVE PQALPAEDVA AQVVESSPFW RWLAIICGVI GLIILLLWLL LQREFRGLRP AARAYGFIGL LARWAGLEQR PERTPQEFAC DLAKEMPGQR RTLRRLADAY SAEQYGSQVR LDPEEIENDR AMVSRTLWPR SISRGWRSIL QQILHPRWRR
|
| |