Gene Msil_1061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1061 
Symbol 
ID7091890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1148957 
End bp1150258 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content58% 
IMG OID643464401 
Productglycosidase PH1107-related 
Protein accessionYP_002361392 
Protein GI217977245 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2152] Predicted glycosylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0491073 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCACAAG CCAAATTTCT CAACCGGCAG GCGCTGTATC TACGCCCTGA TCCCGCACGG 
GTGGTCGTGC GGCCGTTCAA GCCGGCGACC GAGCCGCGCG ATCTCAATCC CACCGACAAG
ACGCGCGCCA ATCATATCGT AGAGCGGGTG CTCGCGCTCG ATGTCGACAC GGCCGCCCAC
CAGCTCGACG ACGTTCTTGA GAATTTCAAC GGCCGCCATC GCAACCTGCT GGAGACTTTC
GAGGCGCGGG CCGACGAAAT GGAAGACGCC TTCAGCGCGC ATTCCACATT TAGCAAGATA
CAGCGCCAAC TGGTTGGAGC CTATTTTCTG CACGAATATT CCTTCGAAGC GGCCGCCTTG
TTCAATCCCA GCATTGTGTT GCATCCGGAT CAATCCGGGG CGCCGGAGGG CGGCAGCCGC
TTTATCCTCA GTCTTCGCGG CGTAGGCGAA GGGCATATTT CCTCGCTGAC GTTTCGCTCG
GGAGCCATTG CCGCCGATGG GGCCGTGAGC GTCGATCCGC CGGCGCGGCT CGCATCAATT
CCAAAAGTGG CGAAGCGGAT CCCCGGACCC TATGGCGATT GCGTTGACGT GATTTTCAAG
CCGAACGAAG ACATCAGCGA ACGGGTGATC TTTCCGATCA CCGAAACGCA GACGAACGGC
ATCGAAGACG CCCGTTTCGT CGAGTTTTCC GACGGCGGAA AGAAGACATT TTATGCGACC
TATACAGCCT ATAGCGGCGC GGCGATAAGA TCTGAATTGT TGCAGACCTC GGACTTTGTA
TCGTTCCGGT TGTCGCCTCT GAAAGGCTCT GCCGCGCGCA ACAAGGGCAT GGCCCTGTTC
CCCAGGAAGA TCAACGGCAA ATACGCCATG ATCGGCCGGC AGGATAATGA GAATCTTTAC
CTTATCTATT CGGACGATCT ATACGCATGG GACGGCGGCC AGCCCATTCT AAAGCCACGG
TTTCCCTGGG AATTCGTGCA GATCGGCAAT TGCGGATCTC CAATCGAGCT TGACGAGGGC
TGGCTGTTGC TGACCCACGG CGTTGGCCCG GTTCGAAAAT ATTCAATCGG CGCGGTCCTG
CTGGACAAGC GCGACCCCTC CAAAGTGTTG GCGCGCTCGC GCGAGCCGCT GGTCAGGCCC
GACCCTTCCG AACGTGAGGG CTACGTCCCG AATGTCGTCT ATACATGCGG GGCGATACGT
CACAACGATC AGATCATTTT GCCTTATGCG ATCTCCGACA CCTTCTCCAA TTTTGCAACG
ATGAAGATTG ACGCGCTGCT GGCGAGCCTC GACAGGTCGT GA
 
Protein sequence
MSQAKFLNRQ ALYLRPDPAR VVVRPFKPAT EPRDLNPTDK TRANHIVERV LALDVDTAAH 
QLDDVLENFN GRHRNLLETF EARADEMEDA FSAHSTFSKI QRQLVGAYFL HEYSFEAAAL
FNPSIVLHPD QSGAPEGGSR FILSLRGVGE GHISSLTFRS GAIAADGAVS VDPPARLASI
PKVAKRIPGP YGDCVDVIFK PNEDISERVI FPITETQTNG IEDARFVEFS DGGKKTFYAT
YTAYSGAAIR SELLQTSDFV SFRLSPLKGS AARNKGMALF PRKINGKYAM IGRQDNENLY
LIYSDDLYAW DGGQPILKPR FPWEFVQIGN CGSPIELDEG WLLLTHGVGP VRKYSIGAVL
LDKRDPSKVL ARSREPLVRP DPSEREGYVP NVVYTCGAIR HNDQIILPYA ISDTFSNFAT
MKIDALLASL DRS