Gene Msil_2456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2456 
Symbol 
ID7094008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2678842 
End bp2682072 
Gene Length3231 bp 
Protein Length1076 aa 
Translation table11 
GC content63% 
IMG OID643465777 
Producthypothetical protein 
Protein accessionYP_002362747 
Protein GI217978600 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGCA TTGAGGCGCG CCGCTGGACA ACGCCGCGCG ATGAGAACCT TGGCCTTCGC 
CGAATTTCGA ACGCTTGCGG CCTGTCCGTC GCCGCGCTGC CGAATGGCGC GCTGTTTGCG
ATCGAGCATC AAGGCGCCGG CGCGCCGGTC CTCGTCAATC AGACGCTCGG CTCGCCCGTC
GCAGGCGCGA TCTCCCGCAT CTATCTGCGG ATCGAGGGCG AAGAGCCGAT CGAGATAACC
GGCCCGGGCG CGGCGATGTT TGGGCAAGGC GCCGACCGCT TTGTCTGGGA CAGCCAATCG
CATGGAATTT TTCGCCGCGT CACGCTGCTC TTGCAGCCGA GCGCCAGCGC CTGGATCTGG
CGCCTCGATG TCGAGAACCG CGGCGGCGAG CCGCGCGCGA TGGATGCGAT TTTAATCCAG
GACATCGGCC TTGGGCCGCG CGGTTTCCTC ATGAACAATG AGGCCTATGT CTCGCAATAT
GTCGATCATT TTATCAGCCC GCATGAGCGT TTCGGCCCTG TTGTGATGAG CCGCCAGAAT
CTCGCGCAGG ACGGGTTCTG TCCCTTCGCC ATGCATGGGT GTTTTGACGG CGCCGCCGGC
TTTGCGACGG ACGGCAAGCA GCTGTTTGGC CCCGCCTTCC GCGACTCCGA CTTGTTTCGT
TTTCCTTATG GAACGGACCT GCCGAACCAG CGGCTGCAAC ATGAGATGGC CTGCGTCGCG
ATCCAGTCGA CGCCGCTGCG GCTCGGCGCG GGCGCGCGTG AATCCCGAAC GTTTTTTGGG
CTGTTCGCGC CGGATCAGCC ACAGGCCTCA AGCGACGTGG ATCTTGCGCG CATCGATGCG
ATCGACTGGC GCGCGCCGGA TTTTTCCGAG GCGCCGCTCA TGGCGCGCCC GGCGCGCAGC
ATAGTTGAAG CCGCGCCAGT TGCGATCGCC GACGCCCTCG ACGATGACGA GATCGCGCGC
CTCTACCCAG ACCGCTTCGA GGAGGAGTGG ATTGACGGCC GGCTGATGTC TTTCTTCACG
CCGGACGGGC CGCATAATCG GCATGTCGTG TTGCGCGACA AGGAGCGGAT CGTGACGCGC
CGCCATGGCG CGCTGATGCG CTCCGGCGCC GCCATGCTGC CAGATGATTC GACGCTCTGC
GCGACGGCCT GGATGCATGG CGTATTCGCG GCGCAGCTGA CAATCGGCAA CACCTCATTC
CACAAGCTGT TTTCGGTCTC GCGCGATCCC TATAATGTCA CGCGCGGAAG CGGCCTGCGC
ATGCTCGCCG AGATCGATGG ACGTTGGCGC CTGCTCGCGA CGCCATCGGC TTTCGAGATC
GGGCTCAATG ACTGCCGCTG GATCTACAGG ATCGGCGCGC GAACGATCAT CGTATCTGCG
ATCGTGTCCG GCGCAGATTC GGCGATGGCG TTTGCGCTCG ATTTCGAAGG CGAGCCCTGC
CGCTTCCTTG TTTTTGGCCA TCTCGCTTTA GGCGAGCGCG AATATGATCA TCATAGCCGG
GTCGAAATCG ACCCTGTTTC GATGCGCGTG ACGCTGCGCC CCGATCCGGA CTGGCTGTGG
GGCCAGCGCT ATCCTCAAGC CGCCTATCAT ATCGTCAGCC CAGAGCCCGC TGACATCGCG
GCGATCGGCG GCGACGAATT ATTATATGAG GATGCTCATC AATCCGACGG TTCCTGCGTC
GCGCTCCGCA CGAGGCCGGT CACAGCCTTC TCCTTCGCTG TCGTCGGCTC AATGACGAGC
GGCGCCGAAG CCAAGATCCT CGCCGAAAAA TATGCGCGCC GGATTTTGCG CGAAGAACTT
CTGGAGGCCG CGCAAGAATT CTGGAAACGC ATCACCCGCG GCGCTCGCGT CCAGGGCGAT
CGCGCGCTGG ATACGCTGAT CCCCTGGCTT GCGCATGACG CGATGATCCA CCTTACCGTT
CCGCACGGGC TCGAACAATA TACCGGCGCG GCCTGGGGCG TGCGCGACGT CTGCCAGGGT
CCTGTCGAGC TGTTGCTGGC GCTGCGTCAC GACGAGCCAG TCAAGGAGAT TTTGCGCCTC
GTCTTCGCCC AGCAATATGC AGAGACAGGC GATTGGCCGC AATGGTTCAT GCTCGATCCC
TATGCGATCA TTCAGGATCG CGTCAGCCAT GGCGATGTAA TCATATGGCC GCTGAAAGCC
GTCAACGACT ACATCGAGGC GACGGGCGAT TTCGCATTTC TTGACGAGCA TATCGTGTGG
CGCGGCAGGG AGGATCTCGA GCGCACCCCG CGCAAGGACA GCGTTCTCCG CCATATCGAG
AAACTGATCG AGGCCATTCG CGCGCGCTTC GTGCCGGGAA CGCATCTCCT CTCCTATGGC
CACGGCGACT GGAACGATTC GCTACAACCC GTCGATCCGA CGATGCGCGA CATTATGGCG
TCGAGCTGGA CGGTCAGCCT GCTCTATCAA CAACTGGGCC GCTACGCCGC CATCCTTGAA
AAAACCGCGC GCGCCGGCGA AGCGACCTCG CTCACCGAAC TCGCGGACGC TATGCGGGCC
GACTTCAACG CCTTGTTGGT CGGCGATGGA ATCGTGGCCG GCTATGGCGT CTTCGAGCCC
GGCGCTTCCG GGCCGGAATT GCTGCTGCAT CCGCGCGACG CGCGCACCGG CCTGCATTAT
TCGCTGCTGC CGATGACGCG CTCGATCATC GGCGATCTGT TTACGCCAGA TCAGGCGGAA
CACCATGTTC GCATTATCAA CGACCATCTT CTGTTTGCCG ATGGAGCCCG GCTGATGGAC
CGCCCCGTTC CCTATCACGG CGGTCCGCAG AGCATTTTTC GCCGCGCCGA ATCGGCGAGT
TTTTTTGGCC GCGAAATCGG GCTCATGTAT GTACATTCCC ATCTACGCTT CGGCGAGGCG
CTGGCCCACC GCGGGGATCT CGACGGGCTT TCCGACGCGC TTGGCGCGGT AAATCCCATC
TCGATCGGCG ACAGGCTTGA AAGCGCCCTG CCGCGCCAGC GCAACGCCTT CTTCTCCAGC
AGCGACGCCG CTTTTGCCGA CCGCGCTTCG GCCAGCGCCG ATTGGGATCG TCTCAAGCGC
GGCGAAATCG GCCTCGAAGG CGGCTGGCGG ATCTATTCGA GCGGACCTGG CATCTTCATG
AATCTGTTGA TCCGCCACGG TTTCGGGCGC CAGAGGCTTT GGGGACGCGA GACGCCGCAA
CAAGGCTGGA GCCACGCCAC GCTCGAATGG GATCTGGACA CGACAGCGTA G
 
Protein sequence
MTGIEARRWT TPRDENLGLR RISNACGLSV AALPNGALFA IEHQGAGAPV LVNQTLGSPV 
AGAISRIYLR IEGEEPIEIT GPGAAMFGQG ADRFVWDSQS HGIFRRVTLL LQPSASAWIW
RLDVENRGGE PRAMDAILIQ DIGLGPRGFL MNNEAYVSQY VDHFISPHER FGPVVMSRQN
LAQDGFCPFA MHGCFDGAAG FATDGKQLFG PAFRDSDLFR FPYGTDLPNQ RLQHEMACVA
IQSTPLRLGA GARESRTFFG LFAPDQPQAS SDVDLARIDA IDWRAPDFSE APLMARPARS
IVEAAPVAIA DALDDDEIAR LYPDRFEEEW IDGRLMSFFT PDGPHNRHVV LRDKERIVTR
RHGALMRSGA AMLPDDSTLC ATAWMHGVFA AQLTIGNTSF HKLFSVSRDP YNVTRGSGLR
MLAEIDGRWR LLATPSAFEI GLNDCRWIYR IGARTIIVSA IVSGADSAMA FALDFEGEPC
RFLVFGHLAL GEREYDHHSR VEIDPVSMRV TLRPDPDWLW GQRYPQAAYH IVSPEPADIA
AIGGDELLYE DAHQSDGSCV ALRTRPVTAF SFAVVGSMTS GAEAKILAEK YARRILREEL
LEAAQEFWKR ITRGARVQGD RALDTLIPWL AHDAMIHLTV PHGLEQYTGA AWGVRDVCQG
PVELLLALRH DEPVKEILRL VFAQQYAETG DWPQWFMLDP YAIIQDRVSH GDVIIWPLKA
VNDYIEATGD FAFLDEHIVW RGREDLERTP RKDSVLRHIE KLIEAIRARF VPGTHLLSYG
HGDWNDSLQP VDPTMRDIMA SSWTVSLLYQ QLGRYAAILE KTARAGEATS LTELADAMRA
DFNALLVGDG IVAGYGVFEP GASGPELLLH PRDARTGLHY SLLPMTRSII GDLFTPDQAE
HHVRIINDHL LFADGARLMD RPVPYHGGPQ SIFRRAESAS FFGREIGLMY VHSHLRFGEA
LAHRGDLDGL SDALGAVNPI SIGDRLESAL PRQRNAFFSS SDAAFADRAS ASADWDRLKR
GEIGLEGGWR IYSSGPGIFM NLLIRHGFGR QRLWGRETPQ QGWSHATLEW DLDTTA