Gene Msil_2384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2384 
Symbol 
ID7093936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2598665 
End bp2599684 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content67% 
IMG OID643465706 
Productbeta-ribofuranosylaminobenzene 5'-phosphate synthase family 
Protein accessionYP_002362676 
Protein GI217978529 
COG category[R] General function prediction only 
COG ID[COG1907] Predicted archaeal sugar kinases 
TIGRFAM ID[TIGR00144] beta-RFAP synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGCAA GCGTCACCGT CGTCGCCCCC GCGCGGCTGC ATCTCGGCTT TCTCGATCTT 
CACGGCGGGC TCAAGCGCCG TTTTGGCAGC ATCGGGCTGG CGATCGACCG GCCGGCGACG
CGGCTTCATA TTTCTCGCGC GGCGAGGAAC GCTGCGGCCG GGCCCGACTC CGAGCGCGCT
TTGGCGCATG TCGAGGCGCT GCAGCGGCGC CATGGCCTGT CCTCTTTCTA TGACGTCGCC
ATCGAGACAG CGATCCCCGA TCATGTCGGG CTTGGCTCGG GCACGCAGCT GGCGCTGGCG
CTGGGCGCCG GACTTCGCCT GCTCGAAGGG CTGCCGGCCG ACCCCGCCGA CGACGCGCTG
TTGCTGCAGC GCACGATGCG CTCCGGGATC GGCGCGGCGA TTTTTGAGCG CGGCGGCGTC
ATCGTCGACG GCGGCCGCGG CGAGCGGACG ATCACGCCCC CGGTGATCGC GCGGCTCGAC
TTTCCGCCGG CGTGGCGCGT CATCCTTGTG CTGGACCCCA CCCTGAAAGG CATCCACGGA
CCGGAGGAAA TTCGATCCTT CGCGGCGCTA AAGCCTTTTG AAGCCTCCGC ATCGGGCGAA
ATCTGCCGTC TCGTGCTGAT CCAGGCGCTT CCTGCTTTGG TCGAAGCCGA CATTGACGCC
TTCGGGGCGG CGATCACGCG GATTCAGGAG ATCGTCGGCG ACTATTTCGC GCCGGCGCAG
GGCGGCGGCG CCTTTACGAG CCCGCGCGTG GCGCAGGCGA TGGCCGAACT CGCCCGCCAT
GGCGCCAAAG GAATTGGCCA ATCCTCCTGG GGTCCGACCG GCTTCGCCTT CGCCGCCGAC
GCCGCCGACG CTGCGCGCAT TTGCGCGCTT TCGCGCGAAA AATCTAATGC CTTGGGAGTA
GACATTGCGA TATGCAAAGG ACTTAATCAC GGCGCGATCG TAAGGGGCGA TCTTCCTGAC
GGTCCGGCGC CGGCGCGGGC GACTGCGACC ATCGGGACAG ACCGGCCGAC AAGATCATGA
 
Protein sequence
MPASVTVVAP ARLHLGFLDL HGGLKRRFGS IGLAIDRPAT RLHISRAARN AAAGPDSERA 
LAHVEALQRR HGLSSFYDVA IETAIPDHVG LGSGTQLALA LGAGLRLLEG LPADPADDAL
LLQRTMRSGI GAAIFERGGV IVDGGRGERT ITPPVIARLD FPPAWRVILV LDPTLKGIHG
PEEIRSFAAL KPFEASASGE ICRLVLIQAL PALVEADIDA FGAAITRIQE IVGDYFAPAQ
GGGAFTSPRV AQAMAELARH GAKGIGQSSW GPTGFAFAAD AADAARICAL SREKSNALGV
DIAICKGLNH GAIVRGDLPD GPAPARATAT IGTDRPTRS