Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4273 |
Symbol | |
ID | 5736132 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5455175 |
End bp | 5456398 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281433 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001547033 |
Protein GI | 159900786 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAATTT TGATTGTTGG GCTAGGCGGG GTAACCGCAA CCTTTCGCAA CTGGCCTGAG CGAATTGTGG CGCTCGGTTT GGCTCAACGC GGTCATGCCG TTCGCGCGAT TGGAACCCAC GATCCCAAAC GACCTGCCTT AGCTGCCCGT CATGAAATCA TCGAGGGCGT AACAGTGCAA CGCGTGCATT CGGGCTACGC GCCAAATCGT GAGTTGCAAC AAGCCTTGGA GCATGGTCCA AGGCCAGATT TGATTCATTT TATGCATCCG CGCAATGTGC TGGCCGCCCA AACGAGTGCC TGGGCCAAAC AGCACAAGAT TCCCACGGTT TATACATGGT TGGGGCCGTA TCACGATGCC TATTTGGTTG ATGATCGTGA GCGTCCATTT GAAACAACCA TCCACTATCA GCGGCCAATT TGGACGAAAC AACAATTTTG GCAACGCCTC AAATCGGCCC GTTCGTGGCG CACAATCCGC GACCATCTAC GCAATTGGCG CTTGCATCGC CCATTGTGGG AAGCCGATCA GCTAATTCCA TGCTCGCAAT TCGAGGCCGA TGAACTTAAA CGCATGGGAT TGCAGCAAGA ATCAAGTGTG ATTCCCTTGT GGATTGACGA TTCGGCGATT CAGACTACCC CGGTTGTTTT ACCTGATTTG AAGGTAAGCC GCCCATGGAT TTTATTTGTT GGGCAATTGA CTCCGCGCAA GGGCTACGAT TTGGCCTTGC GGGCCATGCC AGCAATTTTG CAGCAATATC CCAATGCCAA TTTATTGATG GTATCGGGGA TTAACCACGC TGAACGGGCC GAAGTTGACC GAATCGCCCA AGAACTGAAT ATTCAACCAC AGATTCATTT TTTGGGGCGG GTTGATGATG CAACCTTGGT CAACCTTTTT CGCCATTGCG ATGTCTATCT CACGCCGACC CGTTATGAGG GCTTTGGCTT GACCTTGCTC GAAGCCATGG CGGCGGGTGC GCCCTTGGTT GCCAGCGATA TTCCGGTGGT TAACGAAATT GTGCGCCACG GCGAAAATGG CTTGCTAGCA CCCTACAACA ATCCCGAAGC CTTGGCCGCA GCCGCCAATT TGATTCTTGG GCAGCCGCGT TTAGCTGCCA AACTCCGCAG CGGTGGGCAG CAAACATGCG AGGTATGGTA TAATCCGGCC CGATGGACGA CCGCCTTAGA GCAGGTCTAT ACGCGAGTAA TCAATGAGCA CTAA
|
Protein sequence | MQILIVGLGG VTATFRNWPE RIVALGLAQR GHAVRAIGTH DPKRPALAAR HEIIEGVTVQ RVHSGYAPNR ELQQALEHGP RPDLIHFMHP RNVLAAQTSA WAKQHKIPTV YTWLGPYHDA YLVDDRERPF ETTIHYQRPI WTKQQFWQRL KSARSWRTIR DHLRNWRLHR PLWEADQLIP CSQFEADELK RMGLQQESSV IPLWIDDSAI QTTPVVLPDL KVSRPWILFV GQLTPRKGYD LALRAMPAIL QQYPNANLLM VSGINHAERA EVDRIAQELN IQPQIHFLGR VDDATLVNLF RHCDVYLTPT RYEGFGLTLL EAMAAGAPLV ASDIPVVNEI VRHGENGLLA PYNNPEALAA AANLILGQPR LAAKLRSGGQ QTCEVWYNPA RWTTALEQVY TRVINEH
|
| |