Gene Msil_1006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1006 
Symbol 
ID7091834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1091108 
End bp1093012 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content70% 
IMG OID643464345 
Producthypothetical protein 
Protein accessionYP_002361337 
Protein GI217977190 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.0103184 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGATT ACTATCCGCT GCTCGCCAGG GCCGTCGCAG GGCTCGCCGA TCCGACCCCG 
CAAGCGCGCA GCGCCATCTA TGACCGCGCC CGCAATGCGC TGCTCGGGCA GTTGCGCCGT
CTCGACCCGC CGATCGCCGA TGAAGAAATC GATCGCGAAA GCGTCGCGCT CGAAGACGCC
GTCGCGCGGC TCGAGGCCGA TTTCACTCCG AAGCCGCCTG AAGCTGCGTC GCCCGAACCC
CCGCCGCCGG AGTTCCACCC GCCGCAGCCT CAGCCCGAGC GCGCGGACAT CGCCGCGGCG
CCGATCGAGC CGACGGTCGC CGAGCGCCCC AATGAGCCAT CCGCGCCGCA TGAGATCGCG
GGCGAACCTT CTTTTGCCGG GACCCCTGAC GCGCCCCATG CGCCGGAAGC GGAACCGCAG
GCGGAGCCTC AGGAGCGCCC GCCTGCCGCA GCGGCGGAGA CGCCGGCGCC GGCTGCTTCC
CCTCCCGGGG AGGGCGCTGA TGACAACGCG GCGCAAAAGA TCAAGGACGA TTCGGCAAAG
CCCGACTTCA GAGCCGGGCG GCCGTCGAGC CTGCAGACGC GCCCGCCGAT GTTCTTCAAA
GCGCGGCGCG ACAAGACGAC CCCGGAGCCG GCGCCCGGCG CGATCCTTCC GCCGCGCCGG
CCGCTGGGAA CGAGTTTGCT GCGCATGCCG CCGCGAGTCC CGGCGGCGCC GGTGGCCGCT
CCCGCGCCAG GATTCACGCC TGAGCCGCCG CCCAAAGCTG AGCCTGCTCC CGAGCCTCTC
CCGGACCAGC CGGCGCACGG GGACGAAAAC GGGTGGCGGG AGCCCGTATT CGAGCCGCGC
GAGGCGGAGC TGCGGCGCTG GGAGATGGAG GGCGAGCCGC GATGGGACGC CCCCGCGCCG
GCTCCCCTCG GTGAGGCGCC GGCGGATGAA TGGGCGCGGT CCGATTTTGC GCCGCCCGCG
CCTGAGGCCG ACGCCCCCGC GCGCGACGAA GGCGGGTCCG ATCCGGCGTC GGCGGCTCCG
ATCCGGCCGC GCGCGGAGGC GCAGCGGCCC TTCGCGCCGC AGCCGCGCCG CGAGGACGCC
GCGCCGCGCC GACTGTGGCT CGTCGGCTTT ATCCTTGGAC TGCTGGTGTT TCTCGTCGCG
ATCGCGGCCT ATCAGCTGCG CGACCGGCCT GAAGAATTGC GGCAGCGCGC CGCCGCGCCG
CTCATCATCC CGGAGCCCGC GCCCAGCGGC AAGATTGTCG ATCGTATTGG CGGCGGCCAG
AACCAGCCGC AAGATTCCGC GGCCGGATCG CCGCCTGCGA CGTCCCCGGC GCCGTCTTCT
GCGGCGCGCG GCGCCCCGAC TGCGGTTGAG CCCGAAACCC AGAGCGCGCG CCGGGCGGCG
CTGCTCGTCG AAGCGCCTGA GGAGCCGAAC AAGGTCAGAA CCTTCCTCGG CGCGGTAAAC
TGGAAAGTCG ATAATGTCAC CAGCGGGCCG AATGACCCGC TGAGCATGGC GGTGCATGCG
ACGGTCGAGA TTCCGGAAGA AAAGCTCGAA ATCGTCATGA CGCTGCAGAA AAACTTCGAC
AGCAGCCTGC CGGCCTCGCA CACAATGAAG ATCCAGTTCA TCGAGGGCGC GGACAGCCCG
CTCGGCTCCG TGCAGCAGAT CAGCGTGCCG CAGATGCGCC TCGAGGACAC GGCGACCGGC
GACGCGCTGA ACGGTGTCCC CGTTCAGATC ACCGACAATA CATTTCTCGT CGGGCTCACA
AGCGGGAGCC CGGAGGCGGG CAATCTCGAT CTGTTGAAGT CGCGCGGATG GATCGACGTG
CCGATCCTCT TGAGCAACGG CAAGATCGCC AAGCTGACGT TCGAGAAGGG TCCCGCCGGC
GACCGCGCGA TCGACGATGC GATCGCCGCC TGGAAGGGGC AATAG
 
Protein sequence
MADYYPLLAR AVAGLADPTP QARSAIYDRA RNALLGQLRR LDPPIADEEI DRESVALEDA 
VARLEADFTP KPPEAASPEP PPPEFHPPQP QPERADIAAA PIEPTVAERP NEPSAPHEIA
GEPSFAGTPD APHAPEAEPQ AEPQERPPAA AAETPAPAAS PPGEGADDNA AQKIKDDSAK
PDFRAGRPSS LQTRPPMFFK ARRDKTTPEP APGAILPPRR PLGTSLLRMP PRVPAAPVAA
PAPGFTPEPP PKAEPAPEPL PDQPAHGDEN GWREPVFEPR EAELRRWEME GEPRWDAPAP
APLGEAPADE WARSDFAPPA PEADAPARDE GGSDPASAAP IRPRAEAQRP FAPQPRREDA
APRRLWLVGF ILGLLVFLVA IAAYQLRDRP EELRQRAAAP LIIPEPAPSG KIVDRIGGGQ
NQPQDSAAGS PPATSPAPSS AARGAPTAVE PETQSARRAA LLVEAPEEPN KVRTFLGAVN
WKVDNVTSGP NDPLSMAVHA TVEIPEEKLE IVMTLQKNFD SSLPASHTMK IQFIEGADSP
LGSVQQISVP QMRLEDTATG DALNGVPVQI TDNTFLVGLT SGSPEAGNLD LLKSRGWIDV
PILLSNGKIA KLTFEKGPAG DRAIDDAIAA WKGQ