Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2418 |
Symbol | |
ID | 5734299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3100238 |
End bp | 3101431 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641279559 |
Product | glycosyl transferase family protein |
Protein accession | YP_001545186 |
Protein GI | 159898939 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR01426] glycosyltransferase, MGT family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGCAT TATTTGTTCT CAACCCTGGA ATAGGTCATT TAAACCCCAT GTTGCCAGTC GCGCAAATTT TGCAAGAAGC TGGCCATGAT CTTGCCTTTG CTACATCACC GAAGATGTTA CCAAGCATCA ATGCCAAAGG GTTTAACACC TTCGCTGCTG GCTTAAACTG GCTTTCCTCC GAAATGGATC AAACGTTTCC TGAAATTTTC GAGCTACCAT TTGCCGAGCA AGGCCAAGCA ATTTTGGGCA GCATTTTTGC CGATGCCGCC GCCCACCCCA TGGTTGCTGA TTTGCTGAAT ATCTGCCAAA CATGGCAACC AGATCTGATT GTGCGCAACG ATTTTGAGTT TGGCAGTTGT GTGGCTGCTG AGATCTTGGG CATTCCTCAA GCGACGATCA GCATCAGCTA TTTTCTTTCG GCCAATGCAC TCGAATCGTT GATTGGCGAA GAATTAGCTT ATTTGCGCAG CACCTATGGC CTTGCACCCT ACCCAACCAT GGATATGCTC TATCCATCAT TATATTTAGC CTTTGCCCCG CCATCATTCC AGCCCAAGGA AATTCCAACT ATGGAATCGT TGCGGCCACT GCGTTTTACG CGGTTTAGCG ATGGTGATTT ACCAGCCTGG GTCAGCCAAT TACCTGATCG ACCAACCGTT TATGCCTCAA TGAGTTCAGT TTTCGATACC CCAACGATCT TCCCGATGAT TCTTGAGGCC TTGCGCGACG AGCCGATTAA CCTGATTTTG ACCGTGGGAA CCAAGCAAGA TCCAGCACAG TTTGGCCCCC AACCTGCGAA TGTTTATATC GAGCAATACA TTCCTCAAGC ATTGCTGTTT CCCTATTGCG ATTTGTTTAT AACGCATTGT CCATTCGCCA CGATCATGGC GGCAATCAGC CATGGCATGC CATTGCTCAT GATTCCAGTT GCCGGCGAAG AGCCTGCTGG AGCCATGCGA GCCGCCGAGC TTGGTTTAGG CAAAGTTTTA CGCCTGCCCA ACCAACCCAA GGAGTTTTTC GACCAATGGG TTCCAGAATT TTCGGTTGAG TCGATTCGCG CCAGCGTGCG TGAGCTGCTG CAAACCACGC GTTATCGCAA CAATGCCCAA CGTTTTCAAG CCGAAATTCA AGCCCTACCT GGCCCTGAAC GGGTGATTGA GCTATTAACC AATTTGGCAC ACAAAAAAAG GTAG
|
Protein sequence | MRALFVLNPG IGHLNPMLPV AQILQEAGHD LAFATSPKML PSINAKGFNT FAAGLNWLSS EMDQTFPEIF ELPFAEQGQA ILGSIFADAA AHPMVADLLN ICQTWQPDLI VRNDFEFGSC VAAEILGIPQ ATISISYFLS ANALESLIGE ELAYLRSTYG LAPYPTMDML YPSLYLAFAP PSFQPKEIPT MESLRPLRFT RFSDGDLPAW VSQLPDRPTV YASMSSVFDT PTIFPMILEA LRDEPINLIL TVGTKQDPAQ FGPQPANVYI EQYIPQALLF PYCDLFITHC PFATIMAAIS HGMPLLMIPV AGEEPAGAMR AAELGLGKVL RLPNQPKEFF DQWVPEFSVE SIRASVRELL QTTRYRNNAQ RFQAEIQALP GPERVIELLT NLAHKKR
|
| |