Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1212 |
Symbol | |
ID | 5733105 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1396331 |
End bp | 1397686 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278352 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001543988 |
Protein GI | 159897741 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.360145 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCGTG TGGCTTTTTG CACTCCAGTC AATCCAGTTG AATCGGGTAT TTCGGATTAT AGCGAGGAAT TATTGCCCTA TTTGGGGCAG TATGTTGATC TGACGTTGGT GGTTGATGCT GAGGTTCAGC CCACTAATCA ACAATTGCTC GCCAAACTGC CGATTATCCG CATTGGTGAT TTAGCCAAGC AGCATGCACG GCAACCTTTC GATGCAATTA TCTATCATAT GGGCAATAGC CCCGCCCACA GTCGTTTTTG GCAAAGTTTG CAAAGCTTGC CGGGAATTGT GGTGCTGCAC GATTATGTGC TACATCATTT AATGCTGTGG CATGCCGCCA ATCGCTTGAA AAATGTGGCG AGCTACCGCC AATTGATGCA GCACTATTAT GCTGAACAAG GCTCAAGCAT TGCCCAACGT ATGGAGCGTG GCCAGCTTGG CGATGCAGTG TTCGATTTTC CACTTTCTGA GCCAGTGATT GCCCAAGCCA GCAGCCTAAT TGCTCATAGC CAATATGTGC TTGAGCGGGT GCAGCCACAG CGCCCCAACT TGGCGACTTC GCTAGTGCCA ATGGGTGTGC CCTTACTACC AGCCCCTGAT CGTTTGGCGG CTCGGCAAGC GCTCCAATTG CCCGCTGAAA TTCCGATTTG GGCTAGTTTT GGTCATATCA ATCCCTACAA ACGGATTGAA CAGGCGCTGC AAGCTTTTGC CCAATTTCGC CGTACTTATC CCGATGCACG GTATATATTG GTTGGCAGCG TCTCGCCAAG CTACGATCTC AAGGCCTTGC TCCAACGCTT GCAGCTTGGC GAGAGCGTCC AAGTCACGGG CTATGTTGAT CATGCTGATT TTAATCGCTA TGTTGCTGCC GCTGATCTGT GTTTCAATGG GCGCTACCCT TCGGCTGGCG AGACTTCGGC CAGTTTGTTG CGCTTGTTGG GTGCTGGCCG CGCAGTTTTA GTCAGCGATA TTGCTACCTT CAGCGAATTG CCCGCCGATG TGGTGGCTCA TGTGCCCGTT GATCGCGATG AAGTTGCTTT AATTGCGGCC TATGCTCAGC GTTTATGGGC CGATGTGGCG CTACGCGAAG CCATGGAAAC CAATGCCCGC CGCTATGTGA CTGAAAAACA TAGTTTGCCC TTGGCCGCGC GAGGCTATGC CGATCATCTG AGCCGCGTGC AGGGCTGGCC GCGTTTGGAG CCACAACGTG AGCCATTGTG GGATATTAAT GCTGTCACCA TCCAGCATTC AATCGCCCAA ACGATTGGCC GTAAAGCCGC TCAACTTGGC TTAGTTGATG ACGATGCGCC GTTGCTTGAT CGTTTGGCTG CACGGCTACG AAATTTATTG ACATAA
|
Protein sequence | MQRVAFCTPV NPVESGISDY SEELLPYLGQ YVDLTLVVDA EVQPTNQQLL AKLPIIRIGD LAKQHARQPF DAIIYHMGNS PAHSRFWQSL QSLPGIVVLH DYVLHHLMLW HAANRLKNVA SYRQLMQHYY AEQGSSIAQR MERGQLGDAV FDFPLSEPVI AQASSLIAHS QYVLERVQPQ RPNLATSLVP MGVPLLPAPD RLAARQALQL PAEIPIWASF GHINPYKRIE QALQAFAQFR RTYPDARYIL VGSVSPSYDL KALLQRLQLG ESVQVTGYVD HADFNRYVAA ADLCFNGRYP SAGETSASLL RLLGAGRAVL VSDIATFSEL PADVVAHVPV DRDEVALIAA YAQRLWADVA LREAMETNAR RYVTEKHSLP LAARGYADHL SRVQGWPRLE PQREPLWDIN AVTIQHSIAQ TIGRKAAQLG LVDDDAPLLD RLAARLRNLL T
|
| |