Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_0878 |
Symbol | |
ID | 7115955 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | + |
Start bp | 894758 |
End bp | 895747 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643523681 |
Product | Extensin family protein |
Protein accession | YP_002419724 |
Protein GI | 218528908 |
COG category | [S] Function unknown |
COG ID | [COG3921] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.268141 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.17078 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGCGTA AAGCGTTAGC GTTCTCGGCT CTGGTGCTGT TCGGCGCGGG GCTCACGGGC TGTGCGATCA ACCGGTTCGA GCGCCGGGAA GCATGGCGCG ACCAAGCCGA ACAGATGTGC ATCGCGCGCA AGCTCGTGCA GCCGACGGCC TATGTCTCGC TCGCCAAGGA GATCGACGGC CCCGGCCCCT GCGGCATGCA GCAGCCGTTC AAGGTCACCC GGCTCGGTGG CGGCACGGTG GCGCTCAAGC AGCGCATGAC CCTGGCTTGC CCGGCGCTCG CCGAGGCCGA GGCGTGGCTC GCCGACACGA TCCAACCCGC CGCCAACCTC TATTTCGGCG TGCCGGTGGC CGAGATCAAC GCGGGCACCT ATTCCTGCCG CGGCCGCAAC AACCAAGCCG GTGCCAAACT CTCCGAGCAT TCGTTCGGCA ACGCGCTCGA CATCATGTCC TTCACGCTCG CCGACGGGCA CGTCATCACC GTCAAGGGAG GCTGGCGCGG CACCGAGGCC GAGCAGGCCT TCCTGCGCGA GGTCTTCGTG GGGGCCTGTG CCCGGTTCTC GACCGTGCTG GCGCCGGGTT CCAACGTGTT CCACTACGAC CACATCCACG TCGATCTGGC GATGCACGAC CCGCGCGGCC TGAAGCGCAT CTGCAAGCCG CTGCTGAAGT TCGAGTCGCA GCTCAACCTT GCCGACGGCT CGCCGCGGCC GCTGGCCTCG CCCCGCCCGC CCGCGCGTCA GACCGTCCCG ACCCAGGCCC CGATCGACGT CGAAGAGGAC GATCCCTACG GCGTCGCGCC GACCTCCTCG CGCACGACCG GCACGCGCGT CGCCCGCGCC CCGGCCGCCC CGGCGCCGAC GGCCTATGCC GCCGCTCCGG CCCCAAGCCG GCCACGCTCC CCGGTTCCGG CGCATGACGC GGCCTACGCG CCGCTGTCGC TGGCCGCGCC CCATGCCTCG GACCACGCTT CGGACGAGCC GATCTATTAA
|
Protein sequence | MWRKALAFSA LVLFGAGLTG CAINRFERRE AWRDQAEQMC IARKLVQPTA YVSLAKEIDG PGPCGMQQPF KVTRLGGGTV ALKQRMTLAC PALAEAEAWL ADTIQPAANL YFGVPVAEIN AGTYSCRGRN NQAGAKLSEH SFGNALDIMS FTLADGHVIT VKGGWRGTEA EQAFLREVFV GACARFSTVL APGSNVFHYD HIHVDLAMHD PRGLKRICKP LLKFESQLNL ADGSPRPLAS PRPPARQTVP TQAPIDVEED DPYGVAPTSS RTTGTRVARA PAAPAPTAYA AAPAPSRPRS PVPAHDAAYA PLSLAAPHAS DHASDEPIY
|
| |