Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_3417 |
Symbol | |
ID | 7116064 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | - |
Start bp | 3605316 |
End bp | 3607202 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643526152 |
Product | protein of unknown function DUF839 |
Protein accession | YP_002422165 |
Protein GI | 218531349 |
COG category | [R] General function prediction only |
COG ID | [COG3211] Predicted phosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.556034 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGACA CGCCCGACGC TCCGACCCTC GATGCGGTGA TCGCGCAGCG CTTCTCCCGG CGCGACCTGA TGCGTGGCTC GCTGGCCGCC GCACTCGCCG CCGGAATCGC GCCGCAGACC GTCGCGGCCC CCGCCGATTC CTCCGCCTTC GACTTCCCCG AACTCGCTGC CGGAATCGAT GAGCACCTGC ACGTAGCCGA GGGCTACGAG GCGAAGCCGC TGCTGCGCTG GGGCGACCCG TTGTTTCCCG ACGCGCCGGC CTTCGATCCG CAAAAGCAGA GCGCCCGGGC TCAGGAGCGC CAGTTCGGCT ACAACAACGA CTTCGTCGGC TTCGTCCCGC TCGACAAAGA CGGCCAGCGT GGTCTCCTCG TGGTCAACCA CGAATATACC AATGCCGAGC TGATGTTTCC CGGCCTCGGG GCCGCCGACC GCAAGGCTGT GATCGCCGCC CTGAGCCCGG AACAGGTCGC GACCGAGATG GCCGCCCATG GCGGATCGGT GGTCGAGATC GTCCGCGAGG CGGAAGGCTG GCGCCCGGTG ATCGGCTCGC CCTACACGCG GCGCATCACG GCGGAGACCC CGATCGCCCT CACCGGCCCG GCGGCGGGTC ATCCCCGCCT CATGACCGAG GCCGACCCGA GCGGCCGGCG CGTGCTCGGC ATGATCAACA ACTGCTCGGG CGGTGTCACG CCCTGGGGCA CGTGGCTCTC CGGCGAGGAG AACATCAACT ACTACTTCTC CGGAACGCTC CCTGCCGGAC ATGCGGAGGC CGGCAACGCG AAAGCGATCG GCCTGAACAG CCCGCAATAT GCCTGGAGCC GTTTCCATCC GCGCTTCGAT CTCGCACAGG CGCCGAACGA GCCGAACCGG TTCGGCTGGG TGGTCGAGAT CGATCCGTTC GATCCGGCCT CGACGCCGAA GAAGCGCACG GCGCTCGGCC GGTTCAAGCA TGAGGGTGCG GCGGGCGCGC GTTCGCTCGA CGGGCGCTAC GTCGTCTATC TCGGCGACGA CGAGCGCTTC CAGCACGTCT ACCGCTTCGT CAGCGAGGGC CGGGTGCAGG CCGAGCGCGG GGACAACGCC GACCTCCTCG ATTCCGGCAC GCTCAGCGTC GCCCGGTTCG AGTCCGACGG CACCGGCCGC TGGCTGCCGC TGGTGCACGG CGCGAACGGA CTCGACGCGG GCAACGGTTT CGCCTCCCAG GCCGACGTGC TGATCGAGGC CCGCCGCGCC GCCAAGAGCC TCGGGGCCAC ACCGATGGAC CGGCCCGAGG ATATCGAGGC GAACCCGCGC AACGGCCGCG TCTACGTGAT GCTGACCAAC AACGGGAAGC GCACCGCCGA TCAAGAGGAG CCCGCCAACC CGCGCGGCCC CAACACCTTC GGCCACGTCA TCGAGATCAC CCCCGACGGC ACCGACCACG CCGCCGAGAC CTTCCGCTGG GAGGTGCTGG TGCGCTGTGG CGACCCGGCC AAGCCCGAGG TGAAGGCGAG CTTTTCCGCG CTCACCACGG AGAACGGGTG GTTCGGCATG CCCGACAACT GCACCGTCGA CGGGCGCGGC CGCCTCTGGA TCGCCACTGA CGGCAACAAC CGCGGCGCCA CCGGCCGGGC CGACGGCATC TGGGCGGTGG AAACGGAAGG CCCGCGCCGG GGCACCGCGC GCCACTTCCT GCGGGTGCCG GTGGGCGCCG AGATGTGCGG CCCCTGCTTC ACCCCCGACG ACGAGACCTT CTTCGTCGCC GTCCAGCATC CCGGCGAGCC CGACGAGGAG GGCGCCCTCG GCTCTTACGA GAAGCCCTCG ACCCGCTGGC CGGATTTTTC ACCCGACCTG CCGCCGCGGC CCTCCGTTGT GGCGGTGCGG CGGACGGGGG GCGGACGGAT CGGCTGA
|
Protein sequence | MSDTPDAPTL DAVIAQRFSR RDLMRGSLAA ALAAGIAPQT VAAPADSSAF DFPELAAGID EHLHVAEGYE AKPLLRWGDP LFPDAPAFDP QKQSARAQER QFGYNNDFVG FVPLDKDGQR GLLVVNHEYT NAELMFPGLG AADRKAVIAA LSPEQVATEM AAHGGSVVEI VREAEGWRPV IGSPYTRRIT AETPIALTGP AAGHPRLMTE ADPSGRRVLG MINNCSGGVT PWGTWLSGEE NINYYFSGTL PAGHAEAGNA KAIGLNSPQY AWSRFHPRFD LAQAPNEPNR FGWVVEIDPF DPASTPKKRT ALGRFKHEGA AGARSLDGRY VVYLGDDERF QHVYRFVSEG RVQAERGDNA DLLDSGTLSV ARFESDGTGR WLPLVHGANG LDAGNGFASQ ADVLIEARRA AKSLGATPMD RPEDIEANPR NGRVYVMLTN NGKRTADQEE PANPRGPNTF GHVIEITPDG TDHAAETFRW EVLVRCGDPA KPEVKASFSA LTTENGWFGM PDNCTVDGRG RLWIATDGNN RGATGRADGI WAVETEGPRR GTARHFLRVP VGAEMCGPCF TPDDETFFVA VQHPGEPDEE GALGSYEKPS TRWPDFSPDL PPRPSVVAVR RTGGGRIG
|
| |