Gene Msil_1423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1423 
Symbol 
ID7091763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1538567 
End bp1540276 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content69% 
IMG OID643464761 
ProductHeparinase II/III family protein 
Protein accessionYP_002361750 
Protein GI217977603 
COG category[S] Function unknown 
COG ID[COG5360] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.332975 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCGGCG CGATTTTGAA GGATCGGCTG CATGTCGCGG GCCTCGTTCT CGCGCGCGGC 
GCGGCGGCGG CGTGGCGCTT CGCCGCGAGC CCCTGGCGCG CGCTCGCCGG GCTGCGGCGC
CGGCCGCCCG AACGGCTGCT GATCGCGCCG CAGGACATCC GAACCAGCGA TCCGACGCTC
GCCGCCGATA TTTACGCCGG CTATTTCGCT TTCGGCGGCA AGATCGTCAA CGCGCATGGA
CGCTCGCCTT TCGAGCTTGA GCCGCAATCG GACGCCTGGG CGCGCTCGCT CGCCAGTTTC
AGCTGGCTGC GCCATCTGCG CGCGGCCGAT ACGGCGATCT CGAAAGCGAA TGCGCGGGCG
CTGGTCGAGG ATTTTCTCGT CGATTTCAGC AAGCCCGGCG CGAACGCGGC CTGGGAGCCG
CGCGTCGCGG CCAGACGGCT GCTCGCCTTC CTCTCGCAGT CGCCGATCAT CCTGCATGGC
GCCGACCGCG CTTTCTATCG CCGCTTCATG AAGGCGATCG GGCGCACGCA GCAATTTCTC
GAACGCAAGA TGGCCGAGGG GCTCGCCGGC GAGGACCGGC TGCTTGTCGC GATCGCGCTG
GTCGAGCTCA GCCTTTGCGC CGAGGACGCC GGAAAATTGC GCCAGCGCGC CGGCCGGATG
CTCGCCGAGG AGCTGCAGCG CCAGATCCTG CCGGATGGCG GCCACATCAG CCGCAATCCG
CAGATCCTGA TCGACCTGCT GCTCGATCTT CTGCCGCTGC GTCAGGCCTA TGCCGCGCGC
GGCGCGCAGC CGCCGCCGCA GCTTCTCAAC GCGATCGACC GCATGATGCC CATGCTGCGG
CTGTTCCGCC ATGGCGACGG CGCGCTGGCG CTGTTCAACG GAATGGGCGT GACGCCGCCG
GAGCAGCTTG CGACGGTGCT CGCCTATGAC GACAGCCGGG CGCGGGCGCT GACCAATGCG
CCCCATTCCG GCTATCAGCG GCTGGAGGGG CAGGACGCGG TCGTCGTCGT CGACGCCGGC
CGGCCGCCGC CGCCGGTGTT TTCGACGCGC GCCCACGCCG GCTGCGCCTC GTTTGAATTT
TCGGTCGGCG CGCAGCGTCT CGTGTTGAAT TGCGGGGCGC CGGAGGCAAA CCGAGCCGCC
GCGCGCGAGG CGGCGCGCAT GACCGCCGCC CATTCGACGC TTGTCGTCGA CGATCTGTCC
TCATCGCGCT TCGCCTTTCA TCTCGGCTTG CGCAAATGGC TCGGGGACGA AATCGTGTCG
GGGCCGGAAC AGGTCGAGAT CGAACGCCGC GACGAGGCGG CGGGCTCGAC TCTCGTGGTC
CAGCACGACG GTTATGCCTC CCGCTTCGGC CTCATTTGCC AGCGCCGCCT CGTGTTGCAC
AAGGACGGCA AATGGCTCGA CGGCGCTGAC CGCATGGTCG CCGCGACGCC CGGCGACGCC
ATCGAGCGCC GCCCCTTCGC GGTGCGCTTC CACATCCATC CCAATGTGCG GCTGAAGCGG
GTGCGCGAGG GCCATGCGGT GTTGTGCCTG CTTCCGAACG GGCGGCGCTG GCTGTTCGAG
ACGCCCTGGA TCGCCGAGAT CGAGGAGAGC ATTTTCTTCG CCGCCCCGGA TGGACCAAGA
GCCTGCTCTC AGATCGTGCT CGAGGGCGAG ACGCGCGACG GCCTGGAGCT GACTTGGAGC
TTTCGGCAGG CGGAGAAGAA GAAGCGGTAG
 
Protein sequence
MAGAILKDRL HVAGLVLARG AAAAWRFAAS PWRALAGLRR RPPERLLIAP QDIRTSDPTL 
AADIYAGYFA FGGKIVNAHG RSPFELEPQS DAWARSLASF SWLRHLRAAD TAISKANARA
LVEDFLVDFS KPGANAAWEP RVAARRLLAF LSQSPIILHG ADRAFYRRFM KAIGRTQQFL
ERKMAEGLAG EDRLLVAIAL VELSLCAEDA GKLRQRAGRM LAEELQRQIL PDGGHISRNP
QILIDLLLDL LPLRQAYAAR GAQPPPQLLN AIDRMMPMLR LFRHGDGALA LFNGMGVTPP
EQLATVLAYD DSRARALTNA PHSGYQRLEG QDAVVVVDAG RPPPPVFSTR AHAGCASFEF
SVGAQRLVLN CGAPEANRAA AREAARMTAA HSTLVVDDLS SSRFAFHLGL RKWLGDEIVS
GPEQVEIERR DEAAGSTLVV QHDGYASRFG LICQRRLVLH KDGKWLDGAD RMVAATPGDA
IERRPFAVRF HIHPNVRLKR VREGHAVLCL LPNGRRWLFE TPWIAEIEES IFFAAPDGPR
ACSQIVLEGE TRDGLELTWS FRQAEKKKR