Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3231 |
Symbol | |
ID | 5735099 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4091173 |
End bp | 4092123 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280377 |
Product | glycosyl transferase family protein |
Protein accession | YP_001545996 |
Protein GI | 159899749 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000094491 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTATC TATCGTTAAT TTGTACAGTT AAAAACGAGG CTGATAATAT CGCCGATTTG CTAGATTCGA TGTTGGCACA AAGCCGCCAA CCTGATGAAA TTGTGGTCAA TGATTGTGGC TCAACCGACT CAACCGCCGC GATTGTCCAA ACCTATATTG AGCGTGGTGC ACCAATTCGC TTGGTCCATG GTGGTTTTAA CATCTCTTCT GGTCGGAATA ACGCCATTGT GCATGCCCAA GGCGACTTAA TTGCCTCGAC CGATGCTGGC TTAGCGCTCG ATCGCACATG GCTCGAACGG ATTATCGCGC CACTCGAAGC AGATCAGGCC GATTTGGTGG CTGGCTTTTA TCAAGCAGCG CCGCGCAGCG ATCTGGAAAC CGCAATTGGT TCGACCAACT ATCCGCTGGC TGAAGAAGTT GATCCAAGCC GATTTTTGGC GGCTGGGCAA TCGGTGGCCT TTCGCAAAGT TGTGTGGCAA ACCGTGGGTG GTTACCCCGA ATGGCTCGAC CATTGCGAAG ATTTGGTGTT TGATCGGGCA GCAGTAGCGG CGGGCTTTCG CAGCACAGCG GTGCTCGATG CAGTTGTGCA TTTTCAGCCG CGCTCCAGTT TTCGTGCCCT CTTTCGCCAA TATTTCTTCT ATGCACGGGG CGATGGGGTT GCCAACCTTT GGCCGTTACG CCATGCGATT CGCTATGCCA CCTACCTCGG CCTACTCCTT TTGATCCGCA ACCTGCCCCA ACGCCCATGG CTTCTCGGTG TTTTAGGTTT GGGTATTGCT GGCTACACTC GCAAACCCTA TCGACGGTTG TGGCGAGCAA CCAAAGGCTG GTCATTCACT CGCCGCAGCA AAACCTTGGG TTTACCGCCA TTAATTCGCA TGGTTGGCGA TCTTGCCAAA ATGCTCGGCT ACCCGGTTGG CTGGCTGGTA CGTCTGCGCA AACGTCGATA A
|
Protein sequence | MTYLSLICTV KNEADNIADL LDSMLAQSRQ PDEIVVNDCG STDSTAAIVQ TYIERGAPIR LVHGGFNISS GRNNAIVHAQ GDLIASTDAG LALDRTWLER IIAPLEADQA DLVAGFYQAA PRSDLETAIG STNYPLAEEV DPSRFLAAGQ SVAFRKVVWQ TVGGYPEWLD HCEDLVFDRA AVAAGFRSTA VLDAVVHFQP RSSFRALFRQ YFFYARGDGV ANLWPLRHAI RYATYLGLLL LIRNLPQRPW LLGVLGLGIA GYTRKPYRRL WRATKGWSFT RRSKTLGLPP LIRMVGDLAK MLGYPVGWLV RLRKRR
|
| |