Gene Msil_0147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0147 
Symbol 
ID7090463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp142460 
End bp143713 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content63% 
IMG OID643463480 
Producttryptophan synthase subunit beta 
Protein accessionYP_002360490 
Protein GI217976343 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.77335 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGG CTTTTGTTGC GCAAAACCTC AATTCCTATC GCACCGGCCC CGACGAGCGC 
GGCCATTTTG GCGTCTTTGG CGGCCGCTTC GTCGCCGAAA CCCTGATGCC GCTGATTCTC
GACCTCGAGC GTCACTATGA GGCCGCACGA TCCGATCCCG CCTTCAAGGC CGAACTCGAT
AATCTCCTTA CCCATTACGT CGGCCGGCCG AGCCCGCTCT ATTATGCCGA GCGCATGACC
GAGTTTTTTC GACGCAAAGC GGCCGACGCC GGAAGCGAGG GCGGCGCAAA AATCTACTTC
AAGCGCGAGG ATCTGAACCA CACCGGCGCG CATAAGATCA ACAATGTGCT CGGGCAGATT
CTGCTGGCGC GGCGCATGGG CAAGACGCGC ATTATCGCCG AGACGGGCGC CGGCCAGCAT
GGCGTCGCGA CCGCCACCGC CTGCGCGCGC TTCGGGCTCG ATTGCGTCGT CTATATGGGA
TCGGTCGACG TCGAGCGGCA AAAGCCCAAT GTGTTTCGCA TGAAACTGCT CGGCGCCAAG
GTCGTGCCGG TGGAATCGGG CGCGAGGACG CTGAAGGACG CCATGAACGA GGCGCTGCGC
GACTGGGTCA CCAATGTCGC CGATACTTTT TATTGCATCG GCACCGCGGC GGGGCCGCAT
CCCTATCCCG CGATGGTGCG CGATTTCCAA TGCATCATCG GCGATGAGAC GAGGCGTCAG
ATGCGCGAGG CCGAGGGCCG CCTGCCGGAT TCCTTGCTGG CGTGCATCGG CGGCGGCTCG
AACGCCATCG GCCTGTTTCA CCCTTTCCTC GACGATCCGT CGGTTGAGAT TTACGGGGTG
GAGGCGGCGG GTTTTGGCCT TGATGACAAG CACGCCGCCT CGCTCGCGGG CGGGCGCCCC
GGCGTGCTGC ACGGCAATCG CACCTATCTC CTGATGAATG CGGACGGCCA GATCGAGGAA
GGCCATTCGA TCTCGGCCGG CCTCGACTAT CCGGGCATCG GCCCCGAACA TTCCTGGCTG
AAGGAATCCG GCAGAGTGAC CTATCTGTCC GCGACCGACG CGGAAGCGCT GGCCGCCTTC
GAGCAATGCT CGAAGCTTGA GGGCATCATC CCGGCGCTGG AGCCGGCCCA TGCGCTGGCC
AAGGTCGGCG ACATCGCGCC GTTGAAGCCG CAGGATCATC TGATGGTGGT GAATATTTCC
GGCCGCGGGG ACAAGGATAT TTTCACCGTC GCCGAACACC TCGGCGGGAT GTAA
 
Protein sequence
MSQAFVAQNL NSYRTGPDER GHFGVFGGRF VAETLMPLIL DLERHYEAAR SDPAFKAELD 
NLLTHYVGRP SPLYYAERMT EFFRRKAADA GSEGGAKIYF KREDLNHTGA HKINNVLGQI
LLARRMGKTR IIAETGAGQH GVATATACAR FGLDCVVYMG SVDVERQKPN VFRMKLLGAK
VVPVESGART LKDAMNEALR DWVTNVADTF YCIGTAAGPH PYPAMVRDFQ CIIGDETRRQ
MREAEGRLPD SLLACIGGGS NAIGLFHPFL DDPSVEIYGV EAAGFGLDDK HAASLAGGRP
GVLHGNRTYL LMNADGQIEE GHSISAGLDY PGIGPEHSWL KESGRVTYLS ATDAEALAAF
EQCSKLEGII PALEPAHALA KVGDIAPLKP QDHLMVVNIS GRGDKDIFTV AEHLGGM