Gene Msil_2000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2000 
Symbol 
ID7094198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2170217 
End bp2171473 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content59% 
IMG OID643465326 
Producthypothetical protein 
Protein accessionYP_002362304 
Protein GI217978157 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGACC GTTCAGTATC GGGTAAGACG CGGCGCCCGA ACGCGTCCAC GGCAGGCGCC 
CTCGCGCTTT GCGTCGCGCT CGGGTTCAGC GCCATAGATC TTACGAAGGC GGAGGCGGGA
GAGACCATCG ATCTCGGCGA CGGCAGATCG TTCACGATCG GCGCCGGCCT CAGGACGAGC
TTCGGCTCGG TTTCATCCTT TGCGCCTGGC GGCTACGGCA ACGGAACGAC GGCGCAATAT
AATCTCGACA GCTTCCGAAT TTACACCGGC GCGACGCTGA ATGAATATAT CAAGGCGACG
TTCAATACGG AACGCTCCTA TGGCAACGGG CCGATCGGAG TGCTCGACGC CTATGTGCAG
TTCGAGCCGA TGAACGAAGT CAACGTCTGG GTCGGCCAGA TGCTGCCGCC AAGCGATCGA
GCCAATCTCG ACGGCCCCTA TTATCTAAGC GAGTGGTATT ACCCGGGCGT CGTATCGCAA
TATCCCTCGC GCTTCTATGG GCGCGATCTC GGCGGAACCG TGTGGGGCAA ACTGTTCGAC
AAAAAGCTGG TCTATTCCGT TGGCGTCTTC GCGGGCCATA ATCTTGCAAC CTACAATGGC
GTGCCAGGCC CCGGCGTCGA TCCCACGACC TTCGGCTTCT TTGGTCCATC GAATCAGGCG
CATAATCCGC TTTTCGCCGG CCGCGTCGTG TATAATTTCT GGGACCCGGA ACCCGACCCC
GCCTATTACG AAGCCAGCAC CTACTATGGC AAGGTCGACG TCCTCTCGAT CGGCGTCGCC
GGCATGTTCC AGCAGGATGG GGTCGGGACC AGCTTCAACT CCGCAAATTA TGGAGCCTGG
AACGTCGACG GCCTGATGGA GAAGAAGCTT GGCGACTATG GCGTGATCAC GCTGGAAGGC
GCCTATTATA ATTACAACAC TGGCGGCATC GTTGACGTTC CGCCTAACTA CAATAACGCC
GGCCTCACCG CGAATATCGG CGGTGTCACG CAGGGCAACG GCTATCTCGC GAGCGCCGCC
TACCTTATTC CCTATACGTT CGGCTATGGG ATCGTTCAGG GACAGTTTCA GCCCTACGCC
CGTTACCAGC ACTTTGACGC CACCGTTCTC GAGACATGGC AGTCGCAGAT CGATTTCGGC
GTAAACTATG TGATCAAGCC GCATGATCTG GTCGTTACGC TGGATTGCGC GCTGAATTCG
GCGAGCAACA CGCATAGCGG CACGCGGGTG ACGCTCGGCC TGCAGGTGCA GCTCTAA
 
Protein sequence
MKDRSVSGKT RRPNASTAGA LALCVALGFS AIDLTKAEAG ETIDLGDGRS FTIGAGLRTS 
FGSVSSFAPG GYGNGTTAQY NLDSFRIYTG ATLNEYIKAT FNTERSYGNG PIGVLDAYVQ
FEPMNEVNVW VGQMLPPSDR ANLDGPYYLS EWYYPGVVSQ YPSRFYGRDL GGTVWGKLFD
KKLVYSVGVF AGHNLATYNG VPGPGVDPTT FGFFGPSNQA HNPLFAGRVV YNFWDPEPDP
AYYEASTYYG KVDVLSIGVA GMFQQDGVGT SFNSANYGAW NVDGLMEKKL GDYGVITLEG
AYYNYNTGGI VDVPPNYNNA GLTANIGGVT QGNGYLASAA YLIPYTFGYG IVQGQFQPYA
RYQHFDATVL ETWQSQIDFG VNYVIKPHDL VVTLDCALNS ASNTHSGTRV TLGLQVQL