Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0042 |
Symbol | |
ID | 5731914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 51606 |
End bp | 52706 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277163 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001542822 |
Protein GI | 159896575 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAAG CCACAATTGC CCTCGATTTA CGTGTGCTCG ACGACCACTT TCCGGGAATT GGGCGCTTTT GCTATGAATT AAGTTTAGCT TTATTGCAAA CGCCAATCGC TGAACAATTG CATCTCATCT TGCCGCAGCA ACCGCGTACC CAATTTGATC TCAGGCCGTT ACTGCAACAC CCAAAAGTCT CGCGGGTTTC CCATAGCTTG TTTAGCGTCG GTCAGCATCG CGAATGGCGC AGCTTGCTGA GTGAGATTCA GGCTGATCTG GCGATTTTCC CCTACTATAT TCGCCCATTG TTCCAGCCCT GCCCCAGCCT GACATTAATT TACGACACAA TTTCGTGGCG CGTGCCAGCG ACATTTAGCC GCCGCAAACG TTGGCAAATC GCGGCCTTGC ATCATGTGGC GATTCAGCAA TCGGCGGCGA TTGGCACTAT TTCGCATTCA GCAGCTAGCG ATATTGCCCA GTTTTATGGG GTGAATCCAC AGCGTTTGGC CTATTTAGGG GTTGGTATTT CAGCCCAATT TCAGCCCCAG CCTGCTACTA CAATTGCAGC GCTACGGGCA AAATACGGCT TACCAGAGCG TTATATCGTC TATGTAGCCT CGGATAAACC GCATAAGCAA ATCGACTTTT TGCTGGATGC TTGGCAAGCT GCCCAAACTG CTGAGGTTGG CTTGGTGCTA GGCGGGCGGT GGCGCAACCC TACGAGCGAG CAATTACTAG AGCATCCCAA ACTTCACGGG CGAGTGTGGC GCATTGCCGA TGTGCCTGAG GATGAACTCG CTGCGTTGTA TAGCGGAGCG TTAGCGCTGG CATTTCCTTC GTTATATGAA GGCTTTGGGT TGCCAGCGCT TGAAGCGATT GCCTGTGGCA CACCAGTTTT GGCCCAAAAC AGCTCGTCGT TGCCTGAGGC GGTTGGGGCG GCAGGGTGTT TGTTGCCCAA TGAGCAAGCA ACATGGATTA AAGCCCTAGA GCGCATGTGT CATGATTCGA CATGGCGCGA AAGCTTGGCC GCTCAAACCA GCGCTCAGGC GGCCAAATTT AGCTGGCAAC AGGTTGCTGA ACGATTGGAG CAGCAGCTAA ATACTCTCTA A
|
Protein sequence | MKQATIALDL RVLDDHFPGI GRFCYELSLA LLQTPIAEQL HLILPQQPRT QFDLRPLLQH PKVSRVSHSL FSVGQHREWR SLLSEIQADL AIFPYYIRPL FQPCPSLTLI YDTISWRVPA TFSRRKRWQI AALHHVAIQQ SAAIGTISHS AASDIAQFYG VNPQRLAYLG VGISAQFQPQ PATTIAALRA KYGLPERYIV YVASDKPHKQ IDFLLDAWQA AQTAEVGLVL GGRWRNPTSE QLLEHPKLHG RVWRIADVPE DELAALYSGA LALAFPSLYE GFGLPALEAI ACGTPVLAQN SSSLPEAVGA AGCLLPNEQA TWIKALERMC HDSTWRESLA AQTSAQAAKF SWQQVAERLE QQLNTL
|
| |