Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4604 |
Symbol | |
ID | 5736449 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5887572 |
End bp | 5888687 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281766 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001547363 |
Protein GI | 159901116 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000112819 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGATTG CAATTGATGC TAGTCGTTTG GCGGTTGGGC AGCGCACTGG CACCGAATCA TATACAACTG AATTAGTAAG AGCGCTGGCT CAAACTGACC GTACCAACCA CTATTGTTTA TATGTGAATC AACTCCCCGC AGCGTTGCCC CCACTAGGCC GCAATTGGCG GATCAAGCCG ATTCCTGCGC CGCGTTTATG GACGCACTTG CGGCTTGGCC CAACATGGCA GATCGATCGG CCAGATGTGG CATTTGTGCC AGCCCATGTT TTGCCAAGTT TGCCGCCGCG TCGGAGCGTT GTGACGATTC ACGATTTGGG CTACGAACAC CACCCTGAAT CGCACCCCGC CCGCCAACGG CTGTATTTGC GCTACTCAAC CTTGTGGAGC GCTCGTATGG CTAGCCAAAT TATTGCGATC TCCGAAGCCA CCAAGCGCGA TCTGTTGCAC TACACAGGCA TTGCCGCCGA AAAAATTAGC GTAATCTACC ACGGCGTGCA CGAGCGTTTT TATCCCCATT CGAGCGAGCA AACCAAGGCG ACTGCCGCCA AATATGGCTT ACACGGCGAG TATTTATTAT TTATCAGCAC AATTCAGCCA CGCAAAAATT TGGTGCGCTT GATCGAAGCC TATGCCCAAG CCCGCCAACG CTGCCCTGAT TTGCCGATTT TGGCCTTGGG CGGCAAAACT GGCTGGCTAA CCGAACAAAT TACCCAACAA GCTCAACAGT TAGGAATCAG CGAGCATGTG GCCTTTTTAG GCTATGTGGC CGACGACGAT TTACCAGCGC TGCTCAGTGG TGCGACAATC TATCTATTGC CATCGCTCTA CGAAGGCTTT GGCATGACCG TCTTGGAAGC CATGTCCAGT GGCGTTCCAG TGATTACCAG CAATGTAAGC AGCCTACCTG AGGTTGCTGG CGATGCAGCC TTGTTGGTTG AGCCAAGCCA AACCGCTACA ATTGCCGCAG CGATTGTTGA GCTTTGGCAA AACCCACAGC AACGCCACGA TTTTGCTCAA CGCGGCTTAG CATGGGCCAA ACAATGGACA TGGCAGCGCT GTGCTGAACA AACTTTAGCA GTTCTTACAA CGGTTGGGCA TCATGGCTCA TTCTAA
|
Protein sequence | MQIAIDASRL AVGQRTGTES YTTELVRALA QTDRTNHYCL YVNQLPAALP PLGRNWRIKP IPAPRLWTHL RLGPTWQIDR PDVAFVPAHV LPSLPPRRSV VTIHDLGYEH HPESHPARQR LYLRYSTLWS ARMASQIIAI SEATKRDLLH YTGIAAEKIS VIYHGVHERF YPHSSEQTKA TAAKYGLHGE YLLFISTIQP RKNLVRLIEA YAQARQRCPD LPILALGGKT GWLTEQITQQ AQQLGISEHV AFLGYVADDD LPALLSGATI YLLPSLYEGF GMTVLEAMSS GVPVITSNVS SLPEVAGDAA LLVEPSQTAT IAAAIVELWQ NPQQRHDFAQ RGLAWAKQWT WQRCAEQTLA VLTTVGHHGS F
|
| |