Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4741 |
Symbol | |
ID | 5736585 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 6050161 |
End bp | 6051087 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281906 |
Product | glycosyl transferase family protein |
Protein accession | YP_001547500 |
Protein GI | 159901253 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCTCG ATTGTAGTGT CGTGATTATT TCATGGAATG TGGCAGATTT GTTGCGCCAA TGTTTGGACT CGATCAGCCA ATCGCTAGCA GGCAGCAACT ATAGCTACGA AGTGATTGTG GTCGATAATG CTTCGCACGA CGATTCGGTG ATGATGGTGC GCCAAGCATT CCCCAAGGTT CAAATCATCG AAACGGGAGC CAATTTAGGC TATGCGGGCG GGGTCAATAT TGGGGTCGAT GCAGCCCAAG GCCAATGGAT TTTGGTGCTC AACCCTGATA CTGTGATGCA AGCCGAGGCA ATTCCTCAGT TGCTGGATTG GGCGCAACAG CAGCCCACTG CCAGCGTGAT TGGCCCGCAA CTGCGCTACC CCGATGGCTC AATTCAATCG TCGCGGCGAC GACTACCCAC CAAAGCTAGT TATTTTTTTG AAAGTACCAT CCTCGAACGC TGGTGGCCTA ACAATCCTTG GGCGCTGGCC TATCGCTGTG CCGATCAACC CGATGATCAG CCGCAACAGG TCGAATGGCT CATGGGCGCG GCCTTATTAG TGCGTAAATC GGCAATCGAG CAAGCCGGCT TGATGGATCG ACGCTTCTGG ATGTATTCAG AGGAAGTTGA TTGGCAAGCT CGCTTGGGAC GTTTTGGGCC AATTTGGTAT GTGCCGCAAG CAGTGATCAT ACATCACGAA GGCAAATCCA GCGAACAAGC GCCAGCGCGT AAACATTTGG CCTTCCAAGA ATCCAAATTG CGCTATGCTT CGCTATACGA AGGGCCAATA TTCGCCACCT GTTTACACTT TTTCTTGGCC AGCAGCTACC TATATGAGCT AGCAGTTGAG AGTGTTAAAT GGCTGCTTGG GCATAAACGC GAGCTACGTT GGCAGCGCAT GCAGATTTAT TGGCAGGTGT TGCGCCATTT TGGCTAG
|
Protein sequence | MALDCSVVII SWNVADLLRQ CLDSISQSLA GSNYSYEVIV VDNASHDDSV MMVRQAFPKV QIIETGANLG YAGGVNIGVD AAQGQWILVL NPDTVMQAEA IPQLLDWAQQ QPTASVIGPQ LRYPDGSIQS SRRRLPTKAS YFFESTILER WWPNNPWALA YRCADQPDDQ PQQVEWLMGA ALLVRKSAIE QAGLMDRRFW MYSEEVDWQA RLGRFGPIWY VPQAVIIHHE GKSSEQAPAR KHLAFQESKL RYASLYEGPI FATCLHFFLA SSYLYELAVE SVKWLLGHKR ELRWQRMQIY WQVLRHFG
|
| |