Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3412 |
Symbol | |
ID | 5735273 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4298232 |
End bp | 4299188 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280559 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_001546176 |
Protein GI | 159899929 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACTACA CCATTCGCCA CGTTACCCGC TTTCGTTATA GCAACCCGAT TAGCGAAAGC ATGATGGAAG TTCGCAAACA ACCGCGCACC GAGGGCGATC AACGCTGCTT ATCGTTTGAC ATAACCACCA AGCCAAAATC AAAAGTCTTG GTTTATTACG ATTCGCAGGG CAATACGGTG CATCATTTTG GCATTCCACG GGCGCATACT CAGCTTGAAA TTATCACCCA AGCCTATGTT GAAAATCGGA TTCAAGCGCC AGCCGACGAT CTCATGCTTG ATCTCAATAC CTGGCAAGGT TTGGAAGATC ATGCCTTGAA CGGCCACGAT TGGGATTATC TTAATTCCAG CCAATTTGTG CACTCGACCG ATTTGCTGCA AGCGTTTGCC AACGAAATTG GCCTGAGCAA ACAGCTTGAT CCACTTACTT TGATTCGCCA AATTACCCAA GCAATCTACG ATAAATTTGA GTATGTTCCG CAGAGCACCC GCGCTGATTC GCCAATCGAT GAAGCTTTGG CCACCCGACG CGGCGTTTGC CAAGATTACA CCCATATTAT GCTCAGCCTC TTGCGTTGGT TACAAATTCC GGCGCGGTAT GTGAGCGGCT ATCTGTTTCA TCGCACCGAC GACAGTGTGC GTTCGGCGGC TGATGCCTCG CATGCCTGGG TTGAGGCGTG GTTGCCCAGC CTTGGCTGGG TTGGCTTTGA TCCAACCAAC AATGTGGTGG TAGCCGATCG GCATATTCGG GTGGCGCTTG GCCGTGATTA TGCTGATGTG CCGCCGACCC ATGGCATTTT CAAGGGCGAA ACCCAAAGTA CACTCGAAGT AGCTGTCCAA GTGTGCCTTG CCGACGAGCC ATTACAAGCC GATCAACCAT CGAAACTAGA AGGCTGGTCG GTGGTTGAGG GTGGCGATGT GGCGGCGGTG TTGCAACAAA TGCAGCAACA ACAATAA
|
Protein sequence | MYYTIRHVTR FRYSNPISES MMEVRKQPRT EGDQRCLSFD ITTKPKSKVL VYYDSQGNTV HHFGIPRAHT QLEIITQAYV ENRIQAPADD LMLDLNTWQG LEDHALNGHD WDYLNSSQFV HSTDLLQAFA NEIGLSKQLD PLTLIRQITQ AIYDKFEYVP QSTRADSPID EALATRRGVC QDYTHIMLSL LRWLQIPARY VSGYLFHRTD DSVRSAADAS HAWVEAWLPS LGWVGFDPTN NVVVADRHIR VALGRDYADV PPTHGIFKGE TQSTLEVAVQ VCLADEPLQA DQPSKLEGWS VVEGGDVAAV LQQMQQQQ
|
| |