Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_2456 |
Symbol | |
ID | 7094008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 2678842 |
End bp | 2682072 |
Gene Length | 3231 bp |
Protein Length | 1076 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643465777 |
Product | hypothetical protein |
Protein accession | YP_002362747 |
Protein GI | 217978600 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3459] Cellobiose phosphorylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGGCA TTGAGGCGCG CCGCTGGACA ACGCCGCGCG ATGAGAACCT TGGCCTTCGC CGAATTTCGA ACGCTTGCGG CCTGTCCGTC GCCGCGCTGC CGAATGGCGC GCTGTTTGCG ATCGAGCATC AAGGCGCCGG CGCGCCGGTC CTCGTCAATC AGACGCTCGG CTCGCCCGTC GCAGGCGCGA TCTCCCGCAT CTATCTGCGG ATCGAGGGCG AAGAGCCGAT CGAGATAACC GGCCCGGGCG CGGCGATGTT TGGGCAAGGC GCCGACCGCT TTGTCTGGGA CAGCCAATCG CATGGAATTT TTCGCCGCGT CACGCTGCTC TTGCAGCCGA GCGCCAGCGC CTGGATCTGG CGCCTCGATG TCGAGAACCG CGGCGGCGAG CCGCGCGCGA TGGATGCGAT TTTAATCCAG GACATCGGCC TTGGGCCGCG CGGTTTCCTC ATGAACAATG AGGCCTATGT CTCGCAATAT GTCGATCATT TTATCAGCCC GCATGAGCGT TTCGGCCCTG TTGTGATGAG CCGCCAGAAT CTCGCGCAGG ACGGGTTCTG TCCCTTCGCC ATGCATGGGT GTTTTGACGG CGCCGCCGGC TTTGCGACGG ACGGCAAGCA GCTGTTTGGC CCCGCCTTCC GCGACTCCGA CTTGTTTCGT TTTCCTTATG GAACGGACCT GCCGAACCAG CGGCTGCAAC ATGAGATGGC CTGCGTCGCG ATCCAGTCGA CGCCGCTGCG GCTCGGCGCG GGCGCGCGTG AATCCCGAAC GTTTTTTGGG CTGTTCGCGC CGGATCAGCC ACAGGCCTCA AGCGACGTGG ATCTTGCGCG CATCGATGCG ATCGACTGGC GCGCGCCGGA TTTTTCCGAG GCGCCGCTCA TGGCGCGCCC GGCGCGCAGC ATAGTTGAAG CCGCGCCAGT TGCGATCGCC GACGCCCTCG ACGATGACGA GATCGCGCGC CTCTACCCAG ACCGCTTCGA GGAGGAGTGG ATTGACGGCC GGCTGATGTC TTTCTTCACG CCGGACGGGC CGCATAATCG GCATGTCGTG TTGCGCGACA AGGAGCGGAT CGTGACGCGC CGCCATGGCG CGCTGATGCG CTCCGGCGCC GCCATGCTGC CAGATGATTC GACGCTCTGC GCGACGGCCT GGATGCATGG CGTATTCGCG GCGCAGCTGA CAATCGGCAA CACCTCATTC CACAAGCTGT TTTCGGTCTC GCGCGATCCC TATAATGTCA CGCGCGGAAG CGGCCTGCGC ATGCTCGCCG AGATCGATGG ACGTTGGCGC CTGCTCGCGA CGCCATCGGC TTTCGAGATC GGGCTCAATG ACTGCCGCTG GATCTACAGG ATCGGCGCGC GAACGATCAT CGTATCTGCG ATCGTGTCCG GCGCAGATTC GGCGATGGCG TTTGCGCTCG ATTTCGAAGG CGAGCCCTGC CGCTTCCTTG TTTTTGGCCA TCTCGCTTTA GGCGAGCGCG AATATGATCA TCATAGCCGG GTCGAAATCG ACCCTGTTTC GATGCGCGTG ACGCTGCGCC CCGATCCGGA CTGGCTGTGG GGCCAGCGCT ATCCTCAAGC CGCCTATCAT ATCGTCAGCC CAGAGCCCGC TGACATCGCG GCGATCGGCG GCGACGAATT ATTATATGAG GATGCTCATC AATCCGACGG TTCCTGCGTC GCGCTCCGCA CGAGGCCGGT CACAGCCTTC TCCTTCGCTG TCGTCGGCTC AATGACGAGC GGCGCCGAAG CCAAGATCCT CGCCGAAAAA TATGCGCGCC GGATTTTGCG CGAAGAACTT CTGGAGGCCG CGCAAGAATT CTGGAAACGC ATCACCCGCG GCGCTCGCGT CCAGGGCGAT CGCGCGCTGG ATACGCTGAT CCCCTGGCTT GCGCATGACG CGATGATCCA CCTTACCGTT CCGCACGGGC TCGAACAATA TACCGGCGCG GCCTGGGGCG TGCGCGACGT CTGCCAGGGT CCTGTCGAGC TGTTGCTGGC GCTGCGTCAC GACGAGCCAG TCAAGGAGAT TTTGCGCCTC GTCTTCGCCC AGCAATATGC AGAGACAGGC GATTGGCCGC AATGGTTCAT GCTCGATCCC TATGCGATCA TTCAGGATCG CGTCAGCCAT GGCGATGTAA TCATATGGCC GCTGAAAGCC GTCAACGACT ACATCGAGGC GACGGGCGAT TTCGCATTTC TTGACGAGCA TATCGTGTGG CGCGGCAGGG AGGATCTCGA GCGCACCCCG CGCAAGGACA GCGTTCTCCG CCATATCGAG AAACTGATCG AGGCCATTCG CGCGCGCTTC GTGCCGGGAA CGCATCTCCT CTCCTATGGC CACGGCGACT GGAACGATTC GCTACAACCC GTCGATCCGA CGATGCGCGA CATTATGGCG TCGAGCTGGA CGGTCAGCCT GCTCTATCAA CAACTGGGCC GCTACGCCGC CATCCTTGAA AAAACCGCGC GCGCCGGCGA AGCGACCTCG CTCACCGAAC TCGCGGACGC TATGCGGGCC GACTTCAACG CCTTGTTGGT CGGCGATGGA ATCGTGGCCG GCTATGGCGT CTTCGAGCCC GGCGCTTCCG GGCCGGAATT GCTGCTGCAT CCGCGCGACG CGCGCACCGG CCTGCATTAT TCGCTGCTGC CGATGACGCG CTCGATCATC GGCGATCTGT TTACGCCAGA TCAGGCGGAA CACCATGTTC GCATTATCAA CGACCATCTT CTGTTTGCCG ATGGAGCCCG GCTGATGGAC CGCCCCGTTC CCTATCACGG CGGTCCGCAG AGCATTTTTC GCCGCGCCGA ATCGGCGAGT TTTTTTGGCC GCGAAATCGG GCTCATGTAT GTACATTCCC ATCTACGCTT CGGCGAGGCG CTGGCCCACC GCGGGGATCT CGACGGGCTT TCCGACGCGC TTGGCGCGGT AAATCCCATC TCGATCGGCG ACAGGCTTGA AAGCGCCCTG CCGCGCCAGC GCAACGCCTT CTTCTCCAGC AGCGACGCCG CTTTTGCCGA CCGCGCTTCG GCCAGCGCCG ATTGGGATCG TCTCAAGCGC GGCGAAATCG GCCTCGAAGG CGGCTGGCGG ATCTATTCGA GCGGACCTGG CATCTTCATG AATCTGTTGA TCCGCCACGG TTTCGGGCGC CAGAGGCTTT GGGGACGCGA GACGCCGCAA CAAGGCTGGA GCCACGCCAC GCTCGAATGG GATCTGGACA CGACAGCGTA G
|
Protein sequence | MTGIEARRWT TPRDENLGLR RISNACGLSV AALPNGALFA IEHQGAGAPV LVNQTLGSPV AGAISRIYLR IEGEEPIEIT GPGAAMFGQG ADRFVWDSQS HGIFRRVTLL LQPSASAWIW RLDVENRGGE PRAMDAILIQ DIGLGPRGFL MNNEAYVSQY VDHFISPHER FGPVVMSRQN LAQDGFCPFA MHGCFDGAAG FATDGKQLFG PAFRDSDLFR FPYGTDLPNQ RLQHEMACVA IQSTPLRLGA GARESRTFFG LFAPDQPQAS SDVDLARIDA IDWRAPDFSE APLMARPARS IVEAAPVAIA DALDDDEIAR LYPDRFEEEW IDGRLMSFFT PDGPHNRHVV LRDKERIVTR RHGALMRSGA AMLPDDSTLC ATAWMHGVFA AQLTIGNTSF HKLFSVSRDP YNVTRGSGLR MLAEIDGRWR LLATPSAFEI GLNDCRWIYR IGARTIIVSA IVSGADSAMA FALDFEGEPC RFLVFGHLAL GEREYDHHSR VEIDPVSMRV TLRPDPDWLW GQRYPQAAYH IVSPEPADIA AIGGDELLYE DAHQSDGSCV ALRTRPVTAF SFAVVGSMTS GAEAKILAEK YARRILREEL LEAAQEFWKR ITRGARVQGD RALDTLIPWL AHDAMIHLTV PHGLEQYTGA AWGVRDVCQG PVELLLALRH DEPVKEILRL VFAQQYAETG DWPQWFMLDP YAIIQDRVSH GDVIIWPLKA VNDYIEATGD FAFLDEHIVW RGREDLERTP RKDSVLRHIE KLIEAIRARF VPGTHLLSYG HGDWNDSLQP VDPTMRDIMA SSWTVSLLYQ QLGRYAAILE KTARAGEATS LTELADAMRA DFNALLVGDG IVAGYGVFEP GASGPELLLH PRDARTGLHY SLLPMTRSII GDLFTPDQAE HHVRIINDHL LFADGARLMD RPVPYHGGPQ SIFRRAESAS FFGREIGLMY VHSHLRFGEA LAHRGDLDGL SDALGAVNPI SIGDRLESAL PRQRNAFFSS SDAAFADRAS ASADWDRLKR GEIGLEGGWR IYSSGPGIFM NLLIRHGFGR QRLWGRETPQ QGWSHATLEW DLDTTA
|
| |