Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_3122 |
Symbol | |
ID | 6132556 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 3457049 |
End bp | 3458836 |
Gene Length | 1788 bp |
Protein Length | 595 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641643312 |
Product | capsular polysaccharide biosynthesis protein-like protein |
Protein accession | YP_001769965 |
Protein GI | 170741310 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4421] Capsular polysaccharide biosynthesis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.043835 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGATG TGAACCTTGC CGCGTCTCGA CGCGAGTCGC TCGATCAGAT CGGACTTCGT CTCGGCACTC TTCGGGCGTC CAACGTTCAC GACTATCTGC GGCGCTACGA AGCATTATTG CGCCCGAAAC TTTCGCGCCC GATCAAGATT CTCGATCTCT CGTTGTTGGA GATCACCGGC GCGCGCGCGC TGGCGGAGTT CATCGAAACA GGATTGATCG TCGTCAGCGT CGGTCCGGAG CGCGAGATCC CGGACCTCGA GCTCGCCGAT CATCCGCGCC TCCTCCTGAC GCGCGGCGAT TGCCGCGATC CGGCTTACCT CTGCAGCCTT CACGCGTACG GCCCGTTCGA CCTCATTCTG GAGAACGACC GGCACGTGGT CGAGGATCAG CTGATCGCAC TGGAATACCT GTTCCCCGCG TTGGCGCCGG GCGGATCCTT CGTCTTCGAG AGCGCGTTCG CCTCGACAGC GGAAACGCCG CGCCTGAGCG AGGCGGGGAT CACGGGCATC ATCGACGTCG CGCGAGACCT CGGGACCAGC TTGACGGCCA GAGAGCCGAG GATCCGGCAG TTCGACGACG AGATCGTCAC GAAGGCGCTC GATGCCGTGA TCTTCGAGAG GTCGAACATC ACCCTGAGGC GCACCGACAA GCCGAAGAAC GCGCCGGTGA TCCTGCACGC CAAACCGTTC GCGGAGATCG GCGACGGCGT CGTGGAAGCG CTCGAGAGCA AGCCGTACAC GCGCACGGAC CCGGTTGTCC AAACCCGGCT GGCTTGGATG ACCGAGCGGC TGCTGGAGCG CGTCGGAAAG GTCGAGCATC CCCCGGCGGG CCAGATCGGC ACCGTGAGCA ACGCCATCAT CTTCGGCGAA GGCATCATCG TCGATCGCGC CGGCCGCCTG GTGGTCGAGA GCTTGATGAA CGAGCGGGAC GTTCCGCGTC TTCCCTACAT CAAGAAGCTG CACGGCGACC GCTACGCGAT GCTGGACCAC GATCAGGTCG AGCATCTGAG CGGCGAGAAC ATCGTTGCCG TCAAGCAGCG CTGGGATACC AATTACGGCC ATTGGCTTGT CGAAACCCTG CCCAGGGTCG GCCTGCTGGC GGAGCGCATG CCGCTGGACA GCTGCAAGCT GCTCATCACC GCTTGGTCGG ACGCAATGGC GAGCGTGATG CGGCAGTCGC TCGTCCTATG TGGCGCGACG AACGAAAACA TCCTCCAGGT GTCTGGCGCC CCCCTGTCGG TCGACAAACT GATCTATGTG ACGCCGATCT CCAGCCATCC ATTTGTTTTC CACCCTTACT CGGCGAGATT TTTGCGGTCG CTCGCCGTCA AGCACATGGA GCGCATCGGG GTGTCCGGCG CCCAGCCGAC CAAGGTGTAC GTCTCTCGAA ACAAGGGAAA TTCGCGCCGC ATCGTCAACG AGGACGAGAT CGCTGCGATC CTCACGTCGC GTGGATACAG GATCGTTTAT CCTGAGAATC ACACTTTCTA TGAGCAACTC GAGATTTTCG CAAATTGTAC GCATATCGTG GGAAACCTCG GTGCCGCGTT GACGAACGTT GCCTTCGCAC GCGACGGAGT CGGCCTTCTT GCGCTCGCCT CGGAGTTCAT GCCGGACGAT TTCTTCTGGG ATCTCACCAG TCAGCGCGGG GGGAGGTATT TCTCCATTCA CGGCACAGCG GTGGAGAAAC ACGATGAAGG TCATCGATCG GAAATGAATG CTTGGTTCGT GCTGGATATC CCGAATTTTG TCGAGATGCT CGACCGGTTC GAAGCCGAAG TTGCTTGA
|
Protein sequence | MEDVNLAASR RESLDQIGLR LGTLRASNVH DYLRRYEALL RPKLSRPIKI LDLSLLEITG ARALAEFIET GLIVVSVGPE REIPDLELAD HPRLLLTRGD CRDPAYLCSL HAYGPFDLIL ENDRHVVEDQ LIALEYLFPA LAPGGSFVFE SAFASTAETP RLSEAGITGI IDVARDLGTS LTAREPRIRQ FDDEIVTKAL DAVIFERSNI TLRRTDKPKN APVILHAKPF AEIGDGVVEA LESKPYTRTD PVVQTRLAWM TERLLERVGK VEHPPAGQIG TVSNAIIFGE GIIVDRAGRL VVESLMNERD VPRLPYIKKL HGDRYAMLDH DQVEHLSGEN IVAVKQRWDT NYGHWLVETL PRVGLLAERM PLDSCKLLIT AWSDAMASVM RQSLVLCGAT NENILQVSGA PLSVDKLIYV TPISSHPFVF HPYSARFLRS LAVKHMERIG VSGAQPTKVY VSRNKGNSRR IVNEDEIAAI LTSRGYRIVY PENHTFYEQL EIFANCTHIV GNLGAALTNV AFARDGVGLL ALASEFMPDD FFWDLTSQRG GRYFSIHGTA VEKHDEGHRS EMNAWFVLDI PNFVEMLDRF EAEVA
|
| |