Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_4370 |
Symbol | |
ID | 6133073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 4816158 |
End bp | 4817192 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 641644509 |
Product | triple helix repeat-containing collagen |
Protein accession | YP_001771147 |
Protein GI | 170742492 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.324657 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCTCC GCCGTGCCGT CCTGGTCCTG TCCACGCTCG TCCTGATCGG ATCGGGGCCG GCCTTCGCCC AGGGCGAGGC CGCCCCTGCC CCCGCCGCGC CGAAACCCGA GGCGCCGCGG GGAGCGGCCC GAAAGCCCGC CTCCTCGGCG ATCCAGATCT GGGACGCGCG GATCGAGGGC GGCGACCTGC GCATCTCCGG CAATGTCGGC AAGGCGGGCG TGACCGTCTC GCTCGACGAC GAGGTCGCGG TCCAGAGCGA CCGGCGCGGC CGCTTCGCGA TCAAGGTCCC GTACGTTCCG CAGACCTGCG TGGCGACCCT GACGGCGGGC GAGGAGTCGC GCGAGGTCGC GGTGGCGAAT TGCGCCCCGC AGGGCCAGCC CGGCCCCGCC GGCCAGCCCG GGCCGACCGG CCCGCAGGGC GTGGCCGGCC TGCCCGGCCC GAAGGGCGAC CCAGGCCCGC AGGGACCGGC GGGTCCCAAG GGGGAGCCCG GGCCCAAGGG GGAGCCCGGG CCCAAGGGGG AGCCCGGGCC CAAGGGGGAG CCCGGGCCCA AGGGTGAGCC CGGGCCCAAG GGTGAGCCCG GGCCCAAGGG TGAGCCCGGA CCCAAGGGGG AGCCGGGCCC GCGCGGAGAG GCCGGACCTC AGGGCGCGCT GGGGCCCAAG GGCGAAGCTG GATCAAGGGG CGAACCCGGA CCAAGGGGCG AACCCGGCCC GAAGGGAGAG GCGGGGCTGG CTGGCGCGCC CGGCCCGAAG GGCGAGGCCG GTCCGCGCGG ACCGCAGGGC GAGCGCGGAC CCCCGGGCGC GCCCGGCGCG GCGGCCCCCG TCGCCGCGGC GACGGCGCTG CCGATGCGGG TCCTGCGCAG CGAGACCTGC GCCACCGGCT CCTGCGAACT CGCCTGCGAG GGCGGCGAGA CGCTGCTCTC GGCCTATTGC GTGCGGGCCG GCGCGCCGAC CTTCACGCGG CGGGAGGGCG GGCAGGCCGC GGCCTTCTGC CCGTCCGAGA GCGCCGGCAT CGTCGCGGTC TGCGCGAAGC TCTGA
|
Protein sequence | MRLRRAVLVL STLVLIGSGP AFAQGEAAPA PAAPKPEAPR GAARKPASSA IQIWDARIEG GDLRISGNVG KAGVTVSLDD EVAVQSDRRG RFAIKVPYVP QTCVATLTAG EESREVAVAN CAPQGQPGPA GQPGPTGPQG VAGLPGPKGD PGPQGPAGPK GEPGPKGEPG PKGEPGPKGE PGPKGEPGPK GEPGPKGEPG PKGEPGPRGE AGPQGALGPK GEAGSRGEPG PRGEPGPKGE AGLAGAPGPK GEAGPRGPQG ERGPPGAPGA AAPVAAATAL PMRVLRSETC ATGSCELACE GGETLLSAYC VRAGAPTFTR REGGQAAAFC PSESAGIVAV CAKL
|
| |