Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Lferr_0443 |
Symbol | |
ID | 6876395 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidithiobacillus ferrooxidans ATCC 53993 |
Kingdom | Bacteria |
Replicon accession | NC_011206 |
Strand | + |
Start bp | 418601 |
End bp | 419749 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642788316 |
Product | hopene-associated glycosyltransferase HpnB |
Protein accession | YP_002218904 |
Protein GI | 198282583 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | [TIGR03469] hopene-associated glycosyltransferase HpnB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0000679717 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTGGCTGT TACCGGCCGC GTGGGCCGTT CTGGCGGCCT GGCTGGTGCT GTTCCTCGGG CGGGGCGCTT TCTGGCGGGC AGATCAGCGT CTTCCACCCC GCGAAAGCAA CCCACGGCGA AGCTGGCCGG AAGTCACCGC CGTAGTGCCT GCGCGTAATG AGGCAGAGGG TATTGGTGCC TGCGTAACGG CCCTGCTGAG GCAGGATTAC CCCGGCGTCT TGCGGGTCAT CGTGGTGGAT GACCAGAGTA CGGATGGCAC TGCAAAGTGC GCACGGGAGG CGGCGCGGCA GATGGATGCC TCCTCCCGTC TGGAGGTGCT GACCGGCACG CCGCTGCCGG AAAACTGGGC GGGTAAGGTC TGGGCCATGG ACCAGGGGGT GACCGCGGCC GGCCGGGTGC CGCTTCTCTG GTTCACCGAC GGTGACATCG TGCATGGGCC GGATGTCCTG CAGCGCCTCG TCACCCGGAT AGAGGATCGG GACCTGTCAT TGGTCTCGCT GATGGTGATG CTCTCCTGCC GCGGTTTTTG GGAACGTCTT CTGATCCCGC CCTTCATCTT CTTCTTCCAG ATGCTGTACC CCTTTCCCTG GGTCAACGAC CCACACCGTA CTCTGGCCGG TGCGGCGGGG GGGTGCATGC TGCTGCGTCG GGAGGCGCTC GAAAAGGCCG GCGGGTTCGC CGCCATGCGT GCCGCACGCA TAGATGACTG CACGCTGGCA GCGTTACTGA AGCCCCATGG CGCGATCTGG CTGGGTTTAG GGACGGAGAG CCATAGTCTG CGGGATTACC GCCAACTGGG CGAAATATGG CGGATGGTAG CACGTTCCGC CTACGTGCAA TTGCGGTTCA GCCCCTGGCG GCTGCTCGGA GCGACGTTGG TGATGGCTTT TCTCTATGCC GGGCCGGTAG GGGGGCTGCT GTATGGACTT TGGACCGAAA ATGTCTGGCT GGCGATACCC GCGTTGCTCG CCGGTCTGCT GATGCTGACG GCCTATGTGC CCACTTTGCG GCTTTACTCG CTGAACCCCC TCCGGGCGTT CAGTCTGCCC ATTGCGGCTC TGTTCTATAC ACTCATGACG CTGGATTCCG CCCGCCGTCA TTACGGCGGC CGGGGCGGAG CGTGGAAAGG GCGGCATTAT AGTGCATAG
|
Protein sequence | MWLLPAAWAV LAAWLVLFLG RGAFWRADQR LPPRESNPRR SWPEVTAVVP ARNEAEGIGA CVTALLRQDY PGVLRVIVVD DQSTDGTAKC AREAARQMDA SSRLEVLTGT PLPENWAGKV WAMDQGVTAA GRVPLLWFTD GDIVHGPDVL QRLVTRIEDR DLSLVSLMVM LSCRGFWERL LIPPFIFFFQ MLYPFPWVND PHRTLAGAAG GCMLLRREAL EKAGGFAAMR AARIDDCTLA ALLKPHGAIW LGLGTESHSL RDYRQLGEIW RMVARSAYVQ LRFSPWRLLG ATLVMAFLYA GPVGGLLYGL WTENVWLAIP ALLAGLLMLT AYVPTLRLYS LNPLRAFSLP IAALFYTLMT LDSARRHYGG RGGAWKGRHY SA
|
| |