Gene Msil_0072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0072 
Symbol 
ID7090387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp63468 
End bp64799 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content70% 
IMG OID643463405 
ProductThree-deoxy-D-manno-octulosonic-acid transferase domain protein 
Protein accessionYP_002360417 
Protein GI217976270 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1519] 3-deoxy-D-manno-octulosonic-acid transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.238755 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTTT CCCTCTATCG GGCCTGCAGC GCCGCCTTCG CACCGCTTGC TCCGCTTGTC 
CTGTGGTGGC GGGTTCGGCT TCGCGGCGGG CGGTCGCGCC AGGACGAGCG CAGGATCGCA
GCCGAGCGTC TGGGCAGCCC GTCGGCGGCT CGTCCGCAAG GCCGGCTTGT CTGGGTCGCT
GCGGCGACGG CAATTGACGC CACGCGCCTC CTGCCGCTGA TCGACAGGCT GGCCGCGGCC
GGATTCCATG TGCTGGTGAC GACGCGGGAT GACGAGGCCG CGCCGCCGCG GTTGCCCCCC
TTCGCGCTGC ATCAATATGC GCCGCTCGAC GTTCCGAAAT TCGCCGCCCG CTTTCTTGCG
TCCTGGCGTC CCGACGTCGC TCTGCTGGAC GGCGCCGAAT TTTGGCCGAA TCTGACGCGG
CAAATGCGCC GGCGCGGCGT TCCGGTCGCG CTCGTCGACG CGCATCTGTC GGCTCGCGCG
TTTGCCCTTT TAAGCCGAGC GCCAAAACTG GCGCGCGCTC TTCTTTCCGG ATTTGAGGCC
TGTCTCGCGC GCAGCGCCGC CGACATGGAG CGGCTGCGGC ATTTGGGCGC CGGCTATGCG
CAAATTGTCG GCGACCCGGC CTATGATCTT TCGCCGGAAC CTGCCGACAG CGCGGCGCTC
GCGCTGCTCT CGGCGCGCAT TGGCGCGCGG CCGGTGTGGG CGGCTTTCAC GGCCGATCAG
GCGGAGGCTG ACGTCGTTCT CGACGCGCAT CGCAAGATCG CGGCGAAACT GCCGGGCGTC
CTGACCATCA TCGCGCCGCG GCGGGCGAAA AGCGCCATCG AGATCGCCCT GCGTGCAAGC
AAGCTTGGAT TGGACGCGCG GGCTGCGACG GCCAGCGCCG GCGACGAAGC TTTGCCCGCG
ATCTTGATCC TCGCGGGGGC GGACGCCGGG ACGCTGTATC GCGCGGCGGG AGTCGTCTTC
CTCGGTCGAT CGCTGGGCGA CGCGATAAGC CGGTCGCTCG GCGTCGCCTC GGGGGGAGGC
GGCCTCAATC CGATCGAGGC GGCGAAGCTC GGCTGCGCGA TCCTGCGCGG GCCCGAGGTT
TCCGATTTTG CGGACAGCTA TGAGACGCTC GATCGGGCCG GCGGCTGCGC GCTGGTTCAT
GACGCGGAAT CGCTCGCGGC GGAGGTTACC CTGCTGCTCT TCGACGCCGC CGAACTTCGG
GCGATGGGCC GCGCCGCGGC CGAAGAGGTC GAGCGCCTGT CAGGCGCCTC GACACGGATC
ATGCAGGCGC TGTCGCCATT CCTGGCGCAG GTTTTCCTGA GGCCGGGCGT GGAGGACGAG
GCCGAGAGCT GA
 
Protein sequence
MLLSLYRACS AAFAPLAPLV LWWRVRLRGG RSRQDERRIA AERLGSPSAA RPQGRLVWVA 
AATAIDATRL LPLIDRLAAA GFHVLVTTRD DEAAPPRLPP FALHQYAPLD VPKFAARFLA
SWRPDVALLD GAEFWPNLTR QMRRRGVPVA LVDAHLSARA FALLSRAPKL ARALLSGFEA
CLARSAADME RLRHLGAGYA QIVGDPAYDL SPEPADSAAL ALLSARIGAR PVWAAFTADQ
AEADVVLDAH RKIAAKLPGV LTIIAPRRAK SAIEIALRAS KLGLDARAAT ASAGDEALPA
ILILAGADAG TLYRAAGVVF LGRSLGDAIS RSLGVASGGG GLNPIEAAKL GCAILRGPEV
SDFADSYETL DRAGGCALVH DAESLAAEVT LLLFDAAELR AMGRAAAEEV ERLSGASTRI
MQALSPFLAQ VFLRPGVEDE AES