Gene Msil_0149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0149 
Symbol 
ID7090465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp145035 
End bp146027 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content64% 
IMG OID643463482 
Productbiotin synthase 
Protein accessionYP_002360492 
Protein GI217976345 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID[TIGR00433] biotin synthetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.464836 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACATC AGGCCGCCGA CCCGTTCACC CGGCATGATT GGACTCTCAG CGAAATTATC 
GAGATCTATA AATCGCCGCT GCTGGAGCTG ATCGCGCGGG CGAATGCGAT CCATCGCGAA
TTTCATGACG TCAACGACGT GCAGAAGGCG AGCCTCCTCA GCATCAAGAC CGGCGGCTGC
CCGGAAAATT GCGGCTATTG TCCGCAATCG GCGCATCACA AGGAAGTGCA TCTCGACCGC
ATCCATATGA TGAAGCCGGA TGAAGTCCTT GAGATCGCGG CGCGCGCGAA GGCGGCGGGC
GCAGACCGTT TCTGCATGGG CGCGGCCTGG CGCAGCGTGC GCGACGGCCG CGAATTCGAT
TCGGTGATCG AGATGGTGCG CGGCGTGCGC GAACTCGGGC TGGAGGCCTG CGTCACGCTC
GGCATGCTCA ATGCGGCGCA GGCCGGGCGG CTTAAGGAAG CCGGCCTCAC GGCCTATAAT
CACAACCTCG ACACCGGCCC CGACTATTAC GACAAGATCG TAACGACCCG CAGCTATGAC
GACCGGCTCG ATACGCTGAA GGCCGTGCGC GGCGCCGGCA TCGAAATGTG CTGCGGCGGC
ATTGTCGGCA TGGGCGAGAG CGTCGCCGAC CGCGCCGCGA TGCTGCAGAC GCTGGCGCGC
TTTGATCCGC ATCCCGAAAG CGTGCCGATC AATGCGCTGG TGCCCGTCGA AGGCACGCCG
CTGGGCGAGC GCGAACGGAT CGATCCGCTG GAATTCGTGC GCATGATTGC GGTCACGCGC
ATCGTGCTGC CGGCCTCCCG CGTGCGGCTT TCGGCCGGCC GCTCCGTGCT GAACCGCGAG
GCGCAGGTTC TGTGCATGGT GGCCGGCGCG AATTCGATTT TTTATGGCGA GCAGCTGTTG
ACGACGCCGA ATGTCGGCGA AACCGATGAC GACGCCCTCT TCGCCGCGCT GGCGCCGCAA
TGCCAGCTGA AAGACGCCCT GCCGACGGAT TAA
 
Protein sequence
MTHQAADPFT RHDWTLSEII EIYKSPLLEL IARANAIHRE FHDVNDVQKA SLLSIKTGGC 
PENCGYCPQS AHHKEVHLDR IHMMKPDEVL EIAARAKAAG ADRFCMGAAW RSVRDGREFD
SVIEMVRGVR ELGLEACVTL GMLNAAQAGR LKEAGLTAYN HNLDTGPDYY DKIVTTRSYD
DRLDTLKAVR GAGIEMCCGG IVGMGESVAD RAAMLQTLAR FDPHPESVPI NALVPVEGTP
LGERERIDPL EFVRMIAVTR IVLPASRVRL SAGRSVLNRE AQVLCMVAGA NSIFYGEQLL
TTPNVGETDD DALFAALAPQ CQLKDALPTD