Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2083 |
Symbol | |
ID | 5733971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2593709 |
End bp | 2594971 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641279224 |
Product | sterol 3-beta-glucosyltransferase |
Protein accession | YP_001544851 |
Protein GI | 159898604 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAACTA TCACAATCAT GACGGGGGGA ACCCGGGGCG ATGTCCAACC CTATGTTGCC CTTGGGGCAG GCTTACAGGC AGCGGGCTAT ACCGTTCGGA TTGCGGCCAG TGACAATTTC GCTGGCCTCG TCACGGATGC AGGTTTAACC TTCGTCTCAA CCGGCGAGAG CATTGAAGCG CTGCTGAATA GTCCCCCATG GCGTGCCACC CTCGAGCGCG GCAACTTCGT GACGATTCTT CGGCAGATGC AGCGCGAGAT GCGCAGCCGT GCCGCCCAGC AAGCCCGCCA GATTCCGACT ATCATTCACG GGAGCGATCT GTTGATTGCG GGCATGGCGG GCTTTGGTGG CGCGTTTACC GCCGCATTAG CAGCACAGAT CCCGATTCTG ATCGCCCATC TCTTTCCCTT TACCCCAACG CGTCGGTTTC CTAGTCCGCT TATCCCCGTT GCGACCTTGG GGGGCATGCT AAATCGCCTC TCTTTCCGCG TGATGCAGCT CGTGTTTTGG CAAACCTTGC ACGCCGCCGA TGCAGCAACG CGGACGACCC TTGGATTGCC CGCTGCACCG TTGGGTGGCC CCTTTGGCCA GTATGAGCGC CAGCAGATCC CTGTCATGTA TGGCTATAGT CCGCATGTCC TGCCACGCCC CAATGATTGG CCCCCGCAGC ATGTCGTCAC GGGCTATTGG TTTCTTGACC CACCGCTGGG GTGGATACCC CCTGCCGATC TCGTGGCGTT CCTTGCGGCA GGTCCCCCCC CGATCTATCT TGGGTTTGGC AGTATGGGTG GCCGCAATCC CGAAGCTGCG GGGCGCATGG CATTGGAGGC ACTGGCCCAA ACGGGGCAGC GTGGGATTCT TGCGGCGGGC TGGGGCGGAT TGACGGTGCG TGATGTTCCC CCGACCGTCC ATCTGCTGGA GGCCATCCCA CATGCGTGGT TATTTCCACA CCTAGCAGGG ATTGTCCATC ACGGGGGAGC GGGCACGACC GCCGCTGCAT TGCGGGCAGG TGTCCCATCA ATTGTCGTCC CGTTTATGGG CGATCAGGCA TTTTGGGGGA AGCGGGTAGC AGAGTTAGGG GTCGGGCCGC CACCGATCGC GCGAACATCT CTCCGCAGTG TGCAGTTAGG CCATGCGATT GAGCGCGTGG TACGTGATGC TGCCATGCAG CAGCGGGCGG CCGTATTGGG ACAGCAGATC GATGGCGATA GGGGGATTCC GGCGGCAGTG GCGATCGTGC AGCGACTCGT TCCAGTTCCA TAA
|
Protein sequence | MPTITIMTGG TRGDVQPYVA LGAGLQAAGY TVRIAASDNF AGLVTDAGLT FVSTGESIEA LLNSPPWRAT LERGNFVTIL RQMQREMRSR AAQQARQIPT IIHGSDLLIA GMAGFGGAFT AALAAQIPIL IAHLFPFTPT RRFPSPLIPV ATLGGMLNRL SFRVMQLVFW QTLHAADAAT RTTLGLPAAP LGGPFGQYER QQIPVMYGYS PHVLPRPNDW PPQHVVTGYW FLDPPLGWIP PADLVAFLAA GPPPIYLGFG SMGGRNPEAA GRMALEALAQ TGQRGILAAG WGGLTVRDVP PTVHLLEAIP HAWLFPHLAG IVHHGGAGTT AAALRAGVPS IVVPFMGDQA FWGKRVAELG VGPPPIARTS LRSVQLGHAI ERVVRDAAMQ QRAAVLGQQI DGDRGIPAAV AIVQRLVPVP
|
| |