Gene Lferr_0443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_0443 
Symbol 
ID6876395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp418601 
End bp419749 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content64% 
IMG OID642788316 
Producthopene-associated glycosyltransferase HpnB 
Protein accessionYP_002218904 
Protein GI198282583 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID[TIGR03469] hopene-associated glycosyltransferase HpnB 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000679717 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTGGCTGT TACCGGCCGC GTGGGCCGTT CTGGCGGCCT GGCTGGTGCT GTTCCTCGGG 
CGGGGCGCTT TCTGGCGGGC AGATCAGCGT CTTCCACCCC GCGAAAGCAA CCCACGGCGA
AGCTGGCCGG AAGTCACCGC CGTAGTGCCT GCGCGTAATG AGGCAGAGGG TATTGGTGCC
TGCGTAACGG CCCTGCTGAG GCAGGATTAC CCCGGCGTCT TGCGGGTCAT CGTGGTGGAT
GACCAGAGTA CGGATGGCAC TGCAAAGTGC GCACGGGAGG CGGCGCGGCA GATGGATGCC
TCCTCCCGTC TGGAGGTGCT GACCGGCACG CCGCTGCCGG AAAACTGGGC GGGTAAGGTC
TGGGCCATGG ACCAGGGGGT GACCGCGGCC GGCCGGGTGC CGCTTCTCTG GTTCACCGAC
GGTGACATCG TGCATGGGCC GGATGTCCTG CAGCGCCTCG TCACCCGGAT AGAGGATCGG
GACCTGTCAT TGGTCTCGCT GATGGTGATG CTCTCCTGCC GCGGTTTTTG GGAACGTCTT
CTGATCCCGC CCTTCATCTT CTTCTTCCAG ATGCTGTACC CCTTTCCCTG GGTCAACGAC
CCACACCGTA CTCTGGCCGG TGCGGCGGGG GGGTGCATGC TGCTGCGTCG GGAGGCGCTC
GAAAAGGCCG GCGGGTTCGC CGCCATGCGT GCCGCACGCA TAGATGACTG CACGCTGGCA
GCGTTACTGA AGCCCCATGG CGCGATCTGG CTGGGTTTAG GGACGGAGAG CCATAGTCTG
CGGGATTACC GCCAACTGGG CGAAATATGG CGGATGGTAG CACGTTCCGC CTACGTGCAA
TTGCGGTTCA GCCCCTGGCG GCTGCTCGGA GCGACGTTGG TGATGGCTTT TCTCTATGCC
GGGCCGGTAG GGGGGCTGCT GTATGGACTT TGGACCGAAA ATGTCTGGCT GGCGATACCC
GCGTTGCTCG CCGGTCTGCT GATGCTGACG GCCTATGTGC CCACTTTGCG GCTTTACTCG
CTGAACCCCC TCCGGGCGTT CAGTCTGCCC ATTGCGGCTC TGTTCTATAC ACTCATGACG
CTGGATTCCG CCCGCCGTCA TTACGGCGGC CGGGGCGGAG CGTGGAAAGG GCGGCATTAT
AGTGCATAG
 
Protein sequence
MWLLPAAWAV LAAWLVLFLG RGAFWRADQR LPPRESNPRR SWPEVTAVVP ARNEAEGIGA 
CVTALLRQDY PGVLRVIVVD DQSTDGTAKC AREAARQMDA SSRLEVLTGT PLPENWAGKV
WAMDQGVTAA GRVPLLWFTD GDIVHGPDVL QRLVTRIEDR DLSLVSLMVM LSCRGFWERL
LIPPFIFFFQ MLYPFPWVND PHRTLAGAAG GCMLLRREAL EKAGGFAAMR AARIDDCTLA
ALLKPHGAIW LGLGTESHSL RDYRQLGEIW RMVARSAYVQ LRFSPWRLLG ATLVMAFLYA
GPVGGLLYGL WTENVWLAIP ALLAGLLMLT AYVPTLRLYS LNPLRAFSLP IAALFYTLMT
LDSARRHYGG RGGAWKGRHY SA