Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3354 |
Symbol | |
ID | 5735224 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4231317 |
End bp | 4232546 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280501 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001546118 |
Protein GI | 159899871 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGCGGG TCGTTTTTCT GATCACGATG GGTATTGAAC GACCTTCGGG TCAGCGCTAC TTTAATCTGG CCAAGGAATT GGTGCGCCAA GGCGATCAGG TACGCATCTT GGCCTTGCAC CCCGATTTAG AACAATGTCA ACAACGCCGC TTTGTGCAGG ATGGAGTTGA AATTTGGTAT GTTGGCCAGA TGCACTCGCG CAAACGCCAT GGCGAGGTAC TGCCATTTTC ACCCTTGCAG TTGCTAGCGG TCTTGATTCG TGCCACATTA GGCATGCTGT GGGCCATTCT CTGCTCGCCC GCTGAGATTT ACCACCTTGG CAAACCACAA CCTGTCAATG GTTTGGCGGC GATTTTGGGG GTGTTGTTGG GGCGTGGTCA ACGCTTTTGG GTCGATTGCG ACGATGATGA AGTTGGTAGC AATCGCCTGA CCAACCAAGC GCAGCGTATG GTGTTTGGCT TTTGGCAATG GCTCTTGCCG CGTTTAGCCC AAGGCGTAAC CGTCAATACC CAAGCCTTAG CGGCGCGAAT GCTTGCTGCG CGGGTTGCCA ACATTGTCTA TGTGCCCAAC GGCGTTGATT TGAGCTTGTT CCAAGCGCCA GAGCCAGCAG TTTTGGCAGG CTTGCGCACA AGTTTAGGTT TGGATGGAAT GCAGGTGATC GCCTATGTTG GCACAATCGC CCTGCACAAC CACCCAATCA ACCTTTTGCT CGAAGCCTTT GATCAACAAT TAAAGCATGA TCCTCTGATT CGACTGGTGC TCGTTGGTGG CGGCGAAGAT CTGGGTTATG TTCAAACTTG GATTCGCGAC CATGGCTACC ACGATCGCAT CTTCTGTCTT GGGCATCAAG ATCGGCGCAG CATTCCCATG TGGATGGCGC TTGCCAATGT GACGGTTGAT CCAGTGTATG ATGATATGGT GGCACAAGCC CGTTCGCCCT TGAAGCTGTT TGAAAGCTTG GCCTTGGGAA TTCCCCCAGT TACTGGCGAT GTTGGCGATC GGCGGCTTGT CTTGAATTTT GCTGCTGATC GGTTGATCGC TCCGGCTGGT GATGCATTGG CGTTAGCTCA AACCCTGCAA GCTGTGCTAG CAACCTGGAG TACAACTAAT CGCCAAGCCT GCTTTGAACG AGCGACTGAC TATGCATGGT CGGCGTTGGC GCTACGTTGG CGAGCAGGCT ACGAGGAATC TTATGCGAAC CAGGCACTTG ACTCAACCTC AACCAAATAG
|
Protein sequence | MRRVVFLITM GIERPSGQRY FNLAKELVRQ GDQVRILALH PDLEQCQQRR FVQDGVEIWY VGQMHSRKRH GEVLPFSPLQ LLAVLIRATL GMLWAILCSP AEIYHLGKPQ PVNGLAAILG VLLGRGQRFW VDCDDDEVGS NRLTNQAQRM VFGFWQWLLP RLAQGVTVNT QALAARMLAA RVANIVYVPN GVDLSLFQAP EPAVLAGLRT SLGLDGMQVI AYVGTIALHN HPINLLLEAF DQQLKHDPLI RLVLVGGGED LGYVQTWIRD HGYHDRIFCL GHQDRRSIPM WMALANVTVD PVYDDMVAQA RSPLKLFESL ALGIPPVTGD VGDRRLVLNF AADRLIAPAG DALALAQTLQ AVLATWSTTN RQACFERATD YAWSALALRW RAGYEESYAN QALDSTSTK
|
| |