Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_2725 |
Symbol | |
ID | 4286090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 2990708 |
End bp | 2992543 |
Gene Length | 1836 bp |
Protein Length | 611 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638142224 |
Product | cellulose 1,4-beta-cellobiosidase |
Protein accession | YP_757949 |
Protein GI | 114571269 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.451738 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.174242 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGTCA CACTTATCAG CCTGACCCAG GCATTGGCCG CAGCAGGTTG CGACATTCCC GCTGGTGAAA CACCGATCCT GTTCAACCAG ACCGGCTTTG AGTCGGGTGC CGTGAAACTT GCGGTCTTGC GCAATGCCGC GACAGAACCG GCCCGCTGGG AAATCCGTGA CGCGAGCGGC AGCGTCCGGT TGAACGGCGA GACCATCGTC CACGGGTTGG ATGCCAGCTC GGGCGATTTT GTTCACCAGA TCCGCTTTGA TGCCTTGACC GACCAGGGGG AAGATTACCA ACTCACTGTG TGCGGCGAGG AGAGCCGACC GTTCACGATC GCCGACGAGC CCTATGGCCG ATTGGCGGAA GACGCGCTGC GCTATTTCTA TCTCAACCGG CTGGGGACCG AATTGCAGCC GGAATTTACC GGCGGGGCGA TGTGGGCGCG CGCGGCCGGG TTCATTGACA GCCATCCGAC CTGCTTTTTC GGCCCGGACC AAGAGGGAAC CGAGTGGCCG GGCTGTAGCT ACACGCTCGA AACGACCGGT GGCTGGGCCG ATGCTGGGGA TTACGGCCAG TATGTGGTCA ATGGAGGCAT CTCAGTCTGG ACCCTGCAAC ATGCCTACGA ACGCCTGGCG GCACGCGGTC AACTGGCCAC ATTGGGCTGG ACCGGTGAAC GCGTAGCACT GCCCCAAGAG CAGGAGACCA TCAGTGAAAT CCTGCTGGAG GCGCGCTGGC ACCTGGACTG GATGCTGACG ATGCAGATCC CCGACGGCGA GACCGTTTGG GTCGATCAGC GGTTACCCGG GGCCGACACG GCAGACCTTC AGGAGATTGA TGGCTCGGGC CTGGTCCATC ACAAGCTCCA TGAACGGGCC TGGCTGCCTT TGCCCTTATT GCCACAAGAC GCGGATGAAG AACGCTTTCT CATACCCCCA TCTACGGCTG CAAGCCTGAA CCTGGCCGCA GCCGCGGCCC AAGCTTCGCG TCTGTGGGAA GAACTTGATC CTGACTATGC CGTGCGCACG CTCGAGGCCG CGCGTCGCGC CTATACTGCC GCTCAACGCA ATCCGGACCT TCTGGCCCGG AACAATTTCG ATGGCGGTGG CGCCTATGGC GATACCGATC TTCTGGATGA GTTTGCCTGG GCCGCGGCAG AGCTCTTTCT GACGACCGGA GACACAGGCT ACGCAGATGA CCTTGCGGCG GCTCTGACCA ATGACGCCTA TCAGCCCTCG CCCATATTCT GGGCCAATAC AGGCCTGCTG GCGGACCTGT CCCTTGCACT GAGTGACGAG CCCTCGCCTG AGCGGGCAGA CGCGGCTGAC CGCCTGGTCG CGGAGGCCAA TCGATATCTC GAAGAGGCGG ATCGCGAAGG ATACCACTTC CCGATGCCTG CGAGTGAAAT GAACTGGGGG TCGAACGCCA ACCTCCTCAA TCGGGGACTG GTCCTTGCCA GCGCCCACGA CCTCACCGGG GACGAACGAT ACCGCAATGG CGCAGTCCAT GCCCTGGACT ATGTATTGGG CCGCAACGCC CTGGATCAGA GTTATGTGTC CGGCCACGGA GTCCGCAGCA TGCAGACACC GCATCACCGC TTTTGGGCAC GTGGTGCCGA TCCGGAATTC CCGCCAGCCC CGGCCGGGGC CCTGTCGGGC GGCGCCAATC ATCTGAACAT GGCAGATCCG GTTGCGATGG AAATGCGCGG AACCTGCGCG CCCCAACGCT GCTGGGCCGA TCATGTCGAT GCATTCGCTC TCAACGAGGT CGCCATCAAC TGGAATGCGC CCCTGTTTGC GCTGGCGGCT CAACTGGAGA GCACCACCGC GCAAGCCTCT GAATGA
|
Protein sequence | MLVTLISLTQ ALAAAGCDIP AGETPILFNQ TGFESGAVKL AVLRNAATEP ARWEIRDASG SVRLNGETIV HGLDASSGDF VHQIRFDALT DQGEDYQLTV CGEESRPFTI ADEPYGRLAE DALRYFYLNR LGTELQPEFT GGAMWARAAG FIDSHPTCFF GPDQEGTEWP GCSYTLETTG GWADAGDYGQ YVVNGGISVW TLQHAYERLA ARGQLATLGW TGERVALPQE QETISEILLE ARWHLDWMLT MQIPDGETVW VDQRLPGADT ADLQEIDGSG LVHHKLHERA WLPLPLLPQD ADEERFLIPP STAASLNLAA AAAQASRLWE ELDPDYAVRT LEAARRAYTA AQRNPDLLAR NNFDGGGAYG DTDLLDEFAW AAAELFLTTG DTGYADDLAA ALTNDAYQPS PIFWANTGLL ADLSLALSDE PSPERADAAD RLVAEANRYL EEADREGYHF PMPASEMNWG SNANLLNRGL VLASAHDLTG DERYRNGAVH ALDYVLGRNA LDQSYVSGHG VRSMQTPHHR FWARGADPEF PPAPAGALSG GANHLNMADP VAMEMRGTCA PQRCWADHVD AFALNEVAIN WNAPLFALAA QLESTTAQAS E
|
| |