Gene Msil_3245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3245 
Symbol 
ID7090660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3560267 
End bp3561415 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content62% 
IMG OID643466553 
Producthopanoid biosynthesis associated radical SAM protein HpnH 
Protein accessionYP_002363514 
Protein GI217979367 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR03470] hopanoid biosynthesis associated radical SAM protein HpnH 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.207112 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGAATTT CAGTCCTTCA GATGGCTCAG ATCGGCGGCT ATGTCGTCCG CCAGCGCATC 
GCCGGCCGCA AACGCTTTCC GCTGGTGCTG ATGCTTGAGC CGCTGTTTCG GTGCAATCTC
GCTTGCGCTG GCTGTGGCAA GATCGACTAT CCCGACGAAA TCCTGAACCA GCGCCTACCC
CTCGCCGATT GCCTCGGCGC GATCGACGAA TGCGGCGCCC CTGTCGTCGT CATCGCCGGC
GGCGAGCCGC TCCTGCATAA GGAGCTGCCG CAGATGGTTG AAGGCGCCAT CGCCCGGCGC
AAATTCGTGA CGGTGTGCAC CAATGCCCTA CTGCTCGAGA AGAATCTGCA CAAATACAAG
CCGAACCGCT ACTTCAACTG GTCGATCCAT CTCGATGGCG ACAAGGAGAT GCACGACCAT
TCGGTCTGCC AGACCGGAGT TTACGACCGC GCGCTGAAGG CGCTGAAACT CGCCAAGGAG
GCGGGATTTC GCGTCACCAT CAATTGCACG CTGTTCAACA ACGCCGAGCC CGCGCGCGTC
GCCGCCTTCT TCGACGAGGT CGCCGCGATC GGCATCGAGG GCATCACCGT CTCGCCCGGC
TATGCCTATG AGCGCGCGCC CGACCAGCAG CATTTTCTCA ACCGCGAGAA GACCAAGACG
CTGTTTCGCG ACATCTTTTC ACGCGGCAAA GGGGGAAAGG CCTGGCCCTT CTTCCAGTCG
ACGCTTTTCC TCGATTTCCT CGCCGGCAAC CGCACCTATC ATTGCACGCC CTGGGGCAAT
CCGACCCGGA CCGTGTTCGG CTGGCAGCGG CCCTGCTATC TTCTCGGCGA AGGCTATGTC
TCGACCTTCA AGGAATTGAT GGAGGGGACC GACTGGGACC GCTACGGCAC CGGTAATTAT
GAAAAATGCG CCGACTGCAT GGTCCATTGC GGCTTTGAGG CGACCGCGGT CGACGAAGCG
TTCAAGAATC CGCTCGCGGC GCTAAAAGTC GCCATCGGCG GCATCCGGAC CACGGGGCCG
ATGTCGCCCG ACATCGCGCT AGACCGGCAG CGGCCCGCCG AGTTCGTCTT CAGCCGGCAT
GTTCAGCACA AGCTCGACGA GATCCGGCAG GGCAAAGCCG AGGGCGAGCA TCTTTCGGCC
GCGGAGTAG
 
Protein sequence
MGISVLQMAQ IGGYVVRQRI AGRKRFPLVL MLEPLFRCNL ACAGCGKIDY PDEILNQRLP 
LADCLGAIDE CGAPVVVIAG GEPLLHKELP QMVEGAIARR KFVTVCTNAL LLEKNLHKYK
PNRYFNWSIH LDGDKEMHDH SVCQTGVYDR ALKALKLAKE AGFRVTINCT LFNNAEPARV
AAFFDEVAAI GIEGITVSPG YAYERAPDQQ HFLNREKTKT LFRDIFSRGK GGKAWPFFQS
TLFLDFLAGN RTYHCTPWGN PTRTVFGWQR PCYLLGEGYV STFKELMEGT DWDRYGTGNY
EKCADCMVHC GFEATAVDEA FKNPLAALKV AIGGIRTTGP MSPDIALDRQ RPAEFVFSRH
VQHKLDEIRQ GKAEGEHLSA AE