Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3294 |
Symbol | |
ID | 5735164 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4159883 |
End bp | 4161091 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280442 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001546059 |
Protein GI | 159899812 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAATTC TGTTTCTGAC ACCATACCCA GCCTACCCAG CCAATAGCGG CGGTACACAA CGCATGTTTC AGTTGCTGCG TGAGCTAAGC CGCCAGCATG AATTATGGTG TTTAAGCCTC TCGCCGAGCC ACAGCGCTAG CGCCGCTGCT CAACCGCTCA ATCGCTATTG CCATTTAACG TTAATTCCTG CACCGCGCCG CAATTTAACC CAACGCGCTT GGACGACCCT GACCTCGCCG TTGCCCGATA TGGCTTTGCG TGCACCATCA GAAACATTTC GTTCGGCCTT GGCTAGTTTG CTCAACACCA ATCGTTTTGA TTTAGTGCAA GCTGAAAGCA TCGAGATGGC GCAATATGCC TTGCAAGCGC AACGTTGGGG CATTCCGGCG ACGCTTGATC AATGGAATGC TGAATATGTG TTGCAACAAC GGGCTGCTCA AACCGATCGT CAACAGCTCA AACGTTGGCA TGCTGCGCTC TATTCGGCAA TTCAATGGCG CAAATTGGCG CGTTATGAAC GCTTCGTTTG CAACCAACTT GATCAAACGT ATGTGGTTTC GCCCGAAGAT CGAACCATGC TGCAAAAAAT TGAGGTCAAA AAAACGCTGC ATGTTGTACC GAATGGGGTT GATAGTCAGC AATTTAGCCC AAACATGCCA CGTGAGTATG ATCCAAATAC GCTGCTTTTT ACTGGCAGCT TAGATTTTCG CCCGAATATC GATGCCCTGC GCTGGTTTAT TCAAGAAGTA TTGCCTTTGA TTCGGGCTGA ACGACCTGAG ACGAAGTTGA TGATTGTGGG GCGTGCGCCA ACTCCGGCGG TGTTGCAATT AGCCCAGCCT AGCGTGATCG ACATTATCCC AAATGTGCCG AGTGTGCAGC CCTATTTCAA CCAAGCCGCC GTGTTCGTCT TGCCCATGCG CATGGGCGGC GGCGTGCGAC TCAAACTACT CGAAGCTTTA GCCACCGAAA CTCCACTGGT CTCAACTACG ATGGGCGCTG ATGGCGTAAC AGGTTTAGTG CCCAACCAGC ATTGTTTGCT GGCCGATACT CCTGCCGAAT TTGCCCAAGC CACCCTCAAA CTACTGCATG AACGCCAATT TGCCCAAGGC TTGGCGATGG CTGCTCGCCG TTTCGTGGCC GCCAACTACG ATTGGCAGGC AATTACCGAG CGCTTGCAAC AAGCGTGGGG CGAAATAAGT CATGGGTAA
|
Protein sequence | MRILFLTPYP AYPANSGGTQ RMFQLLRELS RQHELWCLSL SPSHSASAAA QPLNRYCHLT LIPAPRRNLT QRAWTTLTSP LPDMALRAPS ETFRSALASL LNTNRFDLVQ AESIEMAQYA LQAQRWGIPA TLDQWNAEYV LQQRAAQTDR QQLKRWHAAL YSAIQWRKLA RYERFVCNQL DQTYVVSPED RTMLQKIEVK KTLHVVPNGV DSQQFSPNMP REYDPNTLLF TGSLDFRPNI DALRWFIQEV LPLIRAERPE TKLMIVGRAP TPAVLQLAQP SVIDIIPNVP SVQPYFNQAA VFVLPMRMGG GVRLKLLEAL ATETPLVSTT MGADGVTGLV PNQHCLLADT PAEFAQATLK LLHERQFAQG LAMAARRFVA ANYDWQAITE RLQQAWGEIS HG
|
| |