Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4009 |
Symbol | |
ID | 5735870 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5115669 |
End bp | 5116754 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641281159 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001546769 |
Protein GI | 159900522 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAATAG CTTTAGTTCA TGATTATTTG AATCAATATG GTGGCGCGGA ACGGGTGCTC GAAGTGTTGC ACGCCATGTT TCCGCAAGCC CCAATTTATA CATCAATTTA CGATGCCGAG GCCATGCCCA GCCACTATCG CAGTTGGGAT ATTCGCACCT CGTTTATGCA AAAGCTGCCA GGTTGGCGCA AGCATTTTCG TAAATATTTT TTGCTCTATC CCAGTGCCTT CGAGCATTTC GACCTGAGCG CCTATGATTT GGTGATCAGC TCATCGAGCG CTTATGCCAA GGGCGTAATA ACCAAACCTG GCGCTCGCCA TGTGTGCTAT TGCCACACCC CAATGCGCTT TGCTTGGCGC ACCGACGATT ACGTTAAACG CGAGCAGATC AGTGGGATCT TCGGGGCGAT TCTGCCCTTC TTTTTGACCT ACCTGCGCAT GTGGGATGTC CAATCGTCAG GCCGCGTAGA TCGCTTTATT GCCAACTCGC GCACGGTTGC TGATCGGATT GACCATTTCT ACAAACGCCC TTCAACAATC ATCACGCCGC CAGTTGAATT GCAGCCATTC GAGCCACAAC CAGCCGAAGA TTTTTATTTG GCGGGCGGGC GGCTTGTGCC CTACAAACGG CTTGATTTGG CGATCAAAGC GTGTACCAAA CTTGGTTTGC CCTTGGTGAT TTTTGGCGAT GGCCGCGATC GCGCCGAGCT TGAAAAAGTG GCAGGGCCAA GCGTGCGCTT CGTTGGCAAA GTTGATGACG CGACCTTGCG CAGTTTATAT GCCCGTTGTC GCGCCTACCT CATGCCAGGC GAAGAAGATG CAGGCATTCA GCCGCTCGAA GCCATGGGTG CAGGCCGCCC TGTGATCGCC TACCAAGCAG GTGGGGCACT CGATAGCGTG ATCGAAGGCC AAACTGGGCG CTTTTTCAGC CAACAAACCG TCGAAGATCT GGCGGCGGCC ATCCTTGCCA GCCAAAACGA TCACTACGAG CCAACGGCGA TTCGCGCTCA TGCCGAGCAA TTTGCCCGCC CCGCCTTCGA GGCGCGGATT CGGGCCGAAG TCGAAGCGGT GTTAAACGAA GGATGA
|
Protein sequence | MQIALVHDYL NQYGGAERVL EVLHAMFPQA PIYTSIYDAE AMPSHYRSWD IRTSFMQKLP GWRKHFRKYF LLYPSAFEHF DLSAYDLVIS SSSAYAKGVI TKPGARHVCY CHTPMRFAWR TDDYVKREQI SGIFGAILPF FLTYLRMWDV QSSGRVDRFI ANSRTVADRI DHFYKRPSTI ITPPVELQPF EPQPAEDFYL AGGRLVPYKR LDLAIKACTK LGLPLVIFGD GRDRAELEKV AGPSVRFVGK VDDATLRSLY ARCRAYLMPG EEDAGIQPLE AMGAGRPVIA YQAGGALDSV IEGQTGRFFS QQTVEDLAAA ILASQNDHYE PTAIRAHAEQ FARPAFEARI RAEVEAVLNE G
|
| |