Gene Bind_3158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_3158 
Symbol 
ID6201553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp3603126 
End bp3604274 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content58% 
IMG OID641707106 
Producthopanoid biosynthesis associated radical SAM protein HpnH 
Protein accessionYP_001834208 
Protein GI182680062 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR03470] hopanoid biosynthesis associated radical SAM protein HpnH 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGAATTC CGCTTCTGCA GATGGCGCAG ATCGGCGCTT ATGTGGCGCG CCAGCAATTG 
ATGGGGCGCA AACGCTATCC GCTCGTCCTC ATGCTCGAGC CGCTGTTTCG TTGCAATCTC
GCCTGCGCTG GCTGTGGCAA GATCGATTAT CCGGATGAAA TCCTCAACCA GCGCCTGTCG
CTCGAGGACA GTCTCGCGGC CGTGGATGAA TGCGGCGCGC CCGTCGTCGT CATCGCCGGT
GGCGAGCCCT TGCTGCATCG TGATCTGCCG GCCATTGTCG AAGGCGCGAT GGCCAAGGGC
AAATATGTCA CGGTCTGCAC CAATGCATTG CTGCTCGAGA AGAATCTCGA TCGGTACAAG
CCGAACCGTT ATTTCAACTG GTCGATCCAT CTCGATGGCG ATGCCGGCAT GCATGACCAT
TCGGTCTGTC AGGATGGTGT CTACGAGCGT GCCGTCGCCG CCATGAAACT CGCGCAGAAG
CGCGGTTTCC GGGTCACGAT CAATTGCACT TTGTTCAATA ATGCCGACCC TGACCGTGTC
GCGGCCTTTT TCGACGAAAT GAAAAAACAG GGTATCGAAG GCATTACCGT TTCGCCGGGC
TATGCCTATG AGCGCGCGCC CGACCAGCAG CATTTCCTCA ATCGCGAGAA AACCAAGCAA
TTGTTCCGCG CAATCTTCTC GCGTGGCAAG AATGGCAAGG CCTGGCCTTT CTTCCAATCC
ATGCTATTCC TGGACTTCCT GGCCGGTAAT CGCACCTATC AATGCACGCC TTGGGGCAAT
CCGACGCGGA CTGTCTTCGG CTGGCAGCGC CCCTGTTATC TTTTGGGCGA AGGCTATGCC
CCCACGTTCA AGGCCTTGAT GGAAGAAACC GATTGGGATG CCTATGGCAC CGGCCGCTAT
GAGAAATGCG CCGATTGCAT GGTCCATTGC GGTTTCGAAG CGAGCGCCGT GCGGGAAGCT
TTTCAGCGTC CCTGGGAAAT GCTGGGCATT CTCCTGAAGG GCTTCCGGAC CTCCGGGCCG
ATGGTGCCGG ATCTTCCGCT CGCCTCACAA CGTCCCGCCA CTTACGTTTT CAACCAGCAG
GTCGAGGAAA AACTTTCCGA GCTGCATCAT CACAAGGCGG CCCGCGATCA TCTTTCGGCC
GCGGAATAA
 
Protein sequence
MGIPLLQMAQ IGAYVARQQL MGRKRYPLVL MLEPLFRCNL ACAGCGKIDY PDEILNQRLS 
LEDSLAAVDE CGAPVVVIAG GEPLLHRDLP AIVEGAMAKG KYVTVCTNAL LLEKNLDRYK
PNRYFNWSIH LDGDAGMHDH SVCQDGVYER AVAAMKLAQK RGFRVTINCT LFNNADPDRV
AAFFDEMKKQ GIEGITVSPG YAYERAPDQQ HFLNREKTKQ LFRAIFSRGK NGKAWPFFQS
MLFLDFLAGN RTYQCTPWGN PTRTVFGWQR PCYLLGEGYA PTFKALMEET DWDAYGTGRY
EKCADCMVHC GFEASAVREA FQRPWEMLGI LLKGFRTSGP MVPDLPLASQ RPATYVFNQQ
VEEKLSELHH HKAARDHLSA AE