Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA1586 |
Symbol | |
ID | 3104909 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 1689904 |
End bp | 1693314 |
Gene Length | 3411 bp |
Protein Length | 1136 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637170754 |
Product | hypothetical protein |
Protein accession | YP_114036 |
Protein GI | 53804074 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3459] Cellobiose phosphorylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.707632 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCCAA AGCATCCGAT GACGGCGAGG CCGGCGATCC GCCTGGAAAG CCCTTCCGGC CTGATCTTCC AGCTCACATC CAAGGGTTCG GTCCGGCGCA TGGATCACCG CGACATTCTC TTGAACCTGT TCCCCGGCTC GGAGGCCGAG GGCGGACCGG CCAACCTCTA TCTGCGCCGC CTCTCGGAGC CTCCCCTGGG CGTGCCCGAA GGTCCCGCGG AGGCCGTGCC GCTGCTCGGT CCGCGCAGTC CCGGCCGCAT ACTGTGCGAC GGACGAGGGC TATCGATCGA GGGCGAATGG GCGGGCATCC GGTTCGGCGT CTTCCTGGCG CTGGCCGAAA CCGCGCCCGC CTGGTTCTGG CACGTGGCCC TGGAGAACAC CGGCGGTACC GGAGAGACGG TGGAGTTGCT CTATGCCCAG GACCTGGGAC TGGCGCACTA TGGCGCGGTG CGGCTCAACG AGTACTACGT CAGTCAGTAC CTGGACCACA CGCCGCTCCC TCACCCCTCA CGGGGTACGG TGCTGGCTAC CCGCCAGAAC CAGGCGATGG GCGGCCGTTT TCCCTGGGTC ATCATCGGTT CGCTGAACCG GGCCAGGAGT TTCGCTACCG ACGCACTGCA GTTCTACGGT CTGGAGCGCC GTGCCGGCCG ACCACCGCGG GGCCTGGTGG AAGGACTCCC CGGCTCGCGC CGCCAGCACG AACACGCCAT GGCCGTGATC CAGGACGCCC CGCTGAAACT GGCGCCGGGC GAGGCAGCGG CGGTGGGTTT CTTCGGCTGG TTCGAACCGG ATCACCCGGA GGCGACATCC GCCGCTGACC TGGCCTTCGT CGACCGGGCC CTGAACCTGC CGGAAGCGGC GCCTCCGCCG GCAAGACGAA ATCGGTCCGA AGGTTTCACG CCCCCCGCCA GTCTGTTCAG CGCCGCACCG CTCCTCGATG CCCGCGACCT CGGCGATGCC GAGGTCACCG GGCTGTTCGG CGGCGAAAGA CGGGAGCCGG AGCTGGAGCA TGGCCGCCTG CTGTCCTTCT TCACCGGCGA CCGCAGCCAC GTCGTGCTCA GGGCCAAGGA ACTCGAAGTG CTGCGTCCCC ACGGCCACAT CCTCCGCACC GGCAACGGCC TGGTGCCCGA CGAGGCCGGC CTGACCTCCA CCGTCTGGAT GGCCGGCGTG TTCCATTCGA TGGTGACCCA GGGTCATGTG AGCATCAACC GCTTCCTGTC CACCACCCAC GGTTACCTGG GCCTGTTCCG GGGGCACGGC CAACGGCTGT TCGTCGAGAT CGACGGTCGC TGGCATCTGT TGGATGTGTC TTCCGCCTTC GAAATGCGGC CGGAGGGTTG CCGCTGGATC TACAAACACG CCGGCGGCAT GCTGCAGGTG CGCAGCGAGG CCGCCACCGG GAGCCATGAA CTCTCGGTAA CCCTGGACGT GCTGGAAGGG CCGCCCGTCC GCTGCCTGCT CAGCAACCAC GTGGCCCTCA ACGGCGACGA CGGCGCCGAG GCCGTGCCGG CGCGGTTCGT CCGCGACGGA ATGGGAGTGT TCGTGCACCC CATCCCTGAG TCCGACCTCG GCCGCCGATT CCCGAATGGC GGCTTCCGCA TCGATCCGCT GCCCGGCACT CCGCTCGAGA CCGTCGGGGG GGACGAGCTG CTGTACGCCG ATGGCCAGTC CCGGGGAGAA CCCTTCCTGT GCCTGGTCAC GGCCCCGGTC TCGTCGGCCG GCTTCCGCAT CACCGGCTGT CTGGTGCCGG CGGCTCCGGC CACGGCCGAA GCCGGCGCGG ACCGGTATTG GCGTGAAATG ACCGCAGCCT TGCGGGTCTG TCCGCCGGCC GGCAGTGCAT TGGGGGGAGA CATCGAGCGC CTGCGGGAGA TTCTGCCATG GTTCGCCCAT AACGCCCTGA TCCATTATCT CAGCCCGCGC GGGCTGGAGC AGTATTCCGG CGGCGGCTGG GGCGTCCGCG ACGTATGCCA GGGGCCGGTG GAAATGCTGC TGGCGCTGGG CCGGTTCGAA CCGGTCCGCG ACCTGCTGTG CCGGGTGTTC AAGAACCAGA ATCCGGACGG CGACTGGCCG CAGTGGTTCA TGTTCTTCGA GCGCGAGCGC AACATCCGCC CCGGCGACTC CCACGGCGAC ATCGTGTTCT GGCCGGTCCT GGCCCTCGCC CAATACCTGC TGGCCTCCGG AGACGGTGCG CTGCTGGATG AGGTTCTGCC CTTCTTCCAT CCGGAAGGCG GCGACAAGGC CGGGCGGGCG ACGCTCTGGG GGCACGTTGA GCGCGCGCTG GGGGTGGTGG CGGCGCGGAC GATTCCCGAC ACCCGGCTGG TCGCCTACGG CCATGGCGAT TGGAACGACT CGCTGCAGCC GGTGGATCCG GCCATGCGCG AACGGCTGTG CAGCGCCTGG ACCGTGACCC TGCATTACCA GACGCTGACC ACGCTGGCCG AAGCGCTGCG CCGCCTGGGG CGCGATGACC AGGCCGGGGC CTTCGAGGCC GCGGCCGCGG GAGTGCGGGA AGACTTCCAG CGCCTGCTGA TGGCCGACGG CGTCCTCGCC GGCTACGCCC ATTTCGGCGA GGACGGACGA ATCGCTCTTC TGGTGCATCC GCGCGACCGC GCCACCGGGC TTTCCTTCAG CCTGCTGCCG ATGATCCACG CCATCAGCAA CGGCCTGCTG ACGCCGGAAC AGGCGTCACG GCACCTGAGA CTCATCGAAA ACCATCTGCT CGGACCCGAC GGTGCCAGGC TGTTCGACCG GCCTCTGACC TACCGCGGCG GGCCACAAAA GTATTTCCAG CGCGCAGAGA GCAGCAGCTT CTTCGGCCGC GAAATCGGAC TGATGTACAC CCACGCCCAT CTCCGCTATG CGGAAGCACT GGCCCGTTAC GGCGATGCAG AGGGCTTCTT CGAAGCGCTG TGCAAAGCCA ATCCCATCGG TCTGCGCGCC CGGGTTCCCT CCGCCACCCC GCGCCAGGCC AACTGCTATC ACTCCAGTTC CGATGCCGCC TTTGCCGACC GCTATCAGGC GCAAGCCGAA TACGAACGGG TCCGGACAGG GGAAATCGCC CTCGACGGCG GCTGGCGGGT TTATTCGAGC GGCGCGGGCA TCGGGCTGGG ACTCATCTTG CGCGGCTTGC TGGGGTTGCG GCTGGAAAGC TCGAAGCTGG TGATCGACCC GGTCATACCG AAGGCATTGG ATGGCCTCCG GGTGGAGCTG GAACTCGCCG GCTCCGCCTT CGAAGTGGTC TACTCCATCC AGAGTTGCGG CTGCGGCCTC CTCTCAGTCA GCCTGAACGG CACCGAACTG CCGTTCCGCC GGACACCGAA TCCGTACCGG GTCGGCGGCG CCGAAGTCGC CCTGGACACA CTGACGACCC TGATGGAATG CCGCAACCGG CTGACCGTGA ACTTGCGTTG A
|
Protein sequence | MNPKHPMTAR PAIRLESPSG LIFQLTSKGS VRRMDHRDIL LNLFPGSEAE GGPANLYLRR LSEPPLGVPE GPAEAVPLLG PRSPGRILCD GRGLSIEGEW AGIRFGVFLA LAETAPAWFW HVALENTGGT GETVELLYAQ DLGLAHYGAV RLNEYYVSQY LDHTPLPHPS RGTVLATRQN QAMGGRFPWV IIGSLNRARS FATDALQFYG LERRAGRPPR GLVEGLPGSR RQHEHAMAVI QDAPLKLAPG EAAAVGFFGW FEPDHPEATS AADLAFVDRA LNLPEAAPPP ARRNRSEGFT PPASLFSAAP LLDARDLGDA EVTGLFGGER REPELEHGRL LSFFTGDRSH VVLRAKELEV LRPHGHILRT GNGLVPDEAG LTSTVWMAGV FHSMVTQGHV SINRFLSTTH GYLGLFRGHG QRLFVEIDGR WHLLDVSSAF EMRPEGCRWI YKHAGGMLQV RSEAATGSHE LSVTLDVLEG PPVRCLLSNH VALNGDDGAE AVPARFVRDG MGVFVHPIPE SDLGRRFPNG GFRIDPLPGT PLETVGGDEL LYADGQSRGE PFLCLVTAPV SSAGFRITGC LVPAAPATAE AGADRYWREM TAALRVCPPA GSALGGDIER LREILPWFAH NALIHYLSPR GLEQYSGGGW GVRDVCQGPV EMLLALGRFE PVRDLLCRVF KNQNPDGDWP QWFMFFERER NIRPGDSHGD IVFWPVLALA QYLLASGDGA LLDEVLPFFH PEGGDKAGRA TLWGHVERAL GVVAARTIPD TRLVAYGHGD WNDSLQPVDP AMRERLCSAW TVTLHYQTLT TLAEALRRLG RDDQAGAFEA AAAGVREDFQ RLLMADGVLA GYAHFGEDGR IALLVHPRDR ATGLSFSLLP MIHAISNGLL TPEQASRHLR LIENHLLGPD GARLFDRPLT YRGGPQKYFQ RAESSSFFGR EIGLMYTHAH LRYAEALARY GDAEGFFEAL CKANPIGLRA RVPSATPRQA NCYHSSSDAA FADRYQAQAE YERVRTGEIA LDGGWRVYSS GAGIGLGLIL RGLLGLRLES SKLVIDPVIP KALDGLRVEL ELAGSAFEVV YSIQSCGCGL LSVSLNGTEL PFRRTPNPYR VGGAEVALDT LTTLMECRNR LTVNLR
|
| |