Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3300 |
Symbol | |
ID | 5735170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4165051 |
End bp | 4166178 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280447 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001546064 |
Protein GI | 159899817 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00341584 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGTGTGG CTCCCAAAAT TGCTTTTATT CGTAAGGGGC GCTGGCCGTT GGCGAATGTG CGAACCGCCG AAGCACTGCG TGCTCAATTC CCCGAATACG AACTCCGTGA GATCGATTTA ATTCCCATCA TTCGACGCAA GCCTGCCTTG GTTGCATTAA ACGGCTGGTG GACATTACGC CAATATGCTG GCGATTTGGC CATGCGGCGA CGTGGCCCCA AAGATGCCTT TTTGATCACT AGTTATATTT TCCGTGCTGT CAAGCATTTA GTGGCCGATT TACTGCGCGA TGACGACTAT CTGTTTAGTT TTCAGATGCA ATCATTATTT GATGCTAGCG TGCCAAATAT CCCGCATTTT GTCTATACCG ACCATACCTT GCTGGCAAAT CGTCAATATC CAGGCTTTAA TCCGGCTTCA CTCTATCACC CTGAATGGAT GAAGTTAGAG CCAACAATTT ATCAAAATGC CAATTTAGTC TTTACACGTT CCAATCATGT TTCACGTTCA CTAGTCGAAG ATTACCATTG TGATCCAGCC AAAGTACGTT GTGTTTACGC TGGCAGCAAT GCTCCGGTGA TCAGCGAACC GCCTGATCCA GCTCGCTATG CCAGCCAAAA TATTGTCTAT GTTGGGATAG ATTGGGAGCG TAAAGGCGGC CCTGAATTGC TGCAAGCTTT CGCCCAAGTG CGGGCGGTTT ATCCCAATGC CACCTTGACG ATCATCGGCG CAAACCCTCA AACCAATCAG CCAGGGGTTG AGGTGATTGG GCGGATTCCG GTTGAGCAAT TACCGCACTA TTATCAACGC GCCGCCGTCT TTTGCATGCC CACCAAACTT GAGCCATTTG GCATCGTCAC GATTGAAGCC ATGAACTATT GGCTGCCTGT GGTTTCAACC AATCTTGGAG CCATGCCCGA TTTTATCGAG CACGATCACA ACGGCTATTT GGTCGAACCA GGCACGGTTG ATCAACTAGC CACCGCCTTG ATCAAGCTGG TTGGCGATCC TGAACGCTGT CGGCGTTTTG GGGCACGCAG TGTCGAAATT GCGGCACGGT ATCGCTGGGA ATCGGTTGGT TCGGCTATGC GTGAGGCAAT TATCCAGAAC ATAGAGCATA GAACATAA
|
Protein sequence | MRVAPKIAFI RKGRWPLANV RTAEALRAQF PEYELREIDL IPIIRRKPAL VALNGWWTLR QYAGDLAMRR RGPKDAFLIT SYIFRAVKHL VADLLRDDDY LFSFQMQSLF DASVPNIPHF VYTDHTLLAN RQYPGFNPAS LYHPEWMKLE PTIYQNANLV FTRSNHVSRS LVEDYHCDPA KVRCVYAGSN APVISEPPDP ARYASQNIVY VGIDWERKGG PELLQAFAQV RAVYPNATLT IIGANPQTNQ PGVEVIGRIP VEQLPHYYQR AAVFCMPTKL EPFGIVTIEA MNYWLPVVST NLGAMPDFIE HDHNGYLVEP GTVDQLATAL IKLVGDPERC RRFGARSVEI AARYRWESVG SAMREAIIQN IEHRT
|
| |