Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_3487 |
Symbol | |
ID | 6129324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 3891684 |
End bp | 3894230 |
Gene Length | 2547 bp |
Protein Length | 848 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641643658 |
Product | triple helix repeat-containing collagen |
Protein accession | YP_001770306 |
Protein GI | 170741651 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.18885 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTACG GAACGATCAC GGTAACGGCA AATTTCGTAC AGCCCCTAAC AAATGCAGTT TCGAATTCAG GCATCGTACT TCTTTATGAC ACGACCGGCA CATCGACCCT GTCTGCATCT AATTTTATTG GATCTTCAGG ATATTACTTT ACACCCTCAG GCAGCACCAC AGGAACTTCC GCATATTTCT CGCTGACAAC CACGGCGACC GTCGGAAGCG GCCGCAGCTT CACCGCCATC ATCGAAAATC CAACCTACTT TAATTCACCA GCACCGACGA GCCTCACCAG TTCAGCAGTC TACAATTCGA ATGGCCAGGA TACCGGCGCG TCCGCAACAC TTAGTGCTGG CGGCGTCGAC AAGGCAAACG GTTTGGGTGG AGCGAACTAT GACGTGAATT ACGCGAAGCT CGTCTTCTCG AATGTCACCG TCACGGCCGG AGGGGGCGCA CCCGGGCCGA CAGGTGCCAG CGGCGCAACG GGCGCAACGG GCGCCACTGG CGCAACGGGG AGCGGAGCCA CAGGCGCGAC AGGGGCCACC GGCGCCACGG GCACGGGCGC GACCGGGCCG ACCGGGGCCG CGGGCTCCAC CGGCGCGACG GGCACGGCCG GTGCGACGGG CGCCACGGGA GCGGGCGCCA CAGGCGCCAC AGGCGCGACC GGTGCGACGG GAGACACGGG CCCCACGGGC GCGGGGGCGA CCGGAGCCAC AGGCGCGACG GGCGCGGGGG CGACCGGCGC CACGGGATCG GATGGCGCGA CCGGCGCCGC GGGCGCGACC GGTGCGACGG GATCCGCCGG TGCGACGGGG GCGACGGGCA CGGCGGGGAC CACGGGCGCG ACCGGCGCCG CGGGCGCGGA TGGCGCCGGG GGCGCGACCG GCAGCACCGG CGCCGCGGGC GCGACGGGTG CCACCGGCGC GGACGGCGCG AGGGGCGCCA CGGGCAGGGA CGGCGCCGCG GGGGCGACCG GCGGCACAGG GGCCGCAGGC GCAACCGGCG GCACGGGAGC GGCCGGCAGC ACCGGCGCCG CGGGCGCAAC GGGTGCCACC GGCGCAGATG GCGCGAGGGG CGCCACGGGC AGGGACGGCG TTGCCGGGGC GACCGGCGGC ACGGGGGCTG CGGGTGCGAC CGGCGCCACG GGCGCGGACG GCGCGAGGGG TGCCACGGGC AGGGACGGCG CCACCGGGGC GACCGGCAGC ATGGGGGCTG CGGGTGCGAC CGGCGCGACC GGAATCGGTG CGACGGGGTC GGCCGGCAGC ACCGGCGCCG CGGGCGCGAC GGGTGCCACC GGCGCGGACG GCGCGAGGGG CGCCACAGGC AGGGACGGCG CCGCGGGGGC GACCGGCAGC ACGGGGGCTG CGGGTGCGAC CGGCGCGACC GGAATCGGCG CGACGGGGTC GGCTGGCAGC ACCGGCGCCG CGGGCGCAGA TGGCGCGAGG GGCGCCACGG GCAGGGACGG CGCTGCCGGG GCGACCGGCA GCACGGGGGC CGCAGGCGCA ACCGGCGCCA CGGGCGCGGA CGGCGCGAGG GGTGCCACGG GCAGGGACGG CGCCACCGGG GCGACCGGCA GCATGGGGGC TGCGGGTGCG ACCGGCGCGA CCGGAATCGG TGCGACGGGG TCGGCCGGCA GCACCGGCGC CGCGGGCGCG ACGGGTGCCA CCGGCGCGGA CGGCGCGAGG GGTGCCACGG GCAGGGACGG CGCTGCCGGG GCGACCGGCA GCACGGGCGC CGCGGGCGCG ACCGGTCCCG CTGGGGCCAC GGGCGCCGGC GGCACGACCG GCGCGACGGG GGCGGCCGGG GCGACGGGCA CCGCCGGAGC CACCGGCGCG ACCGGGAGCA CCGGACCGGC CGGGGCCACC GGCGCCACCG GCCCCGAATG CTTCACCCAC GGCACCCGCC TGCTCACGCT CACGGGCGCG CGCCGCGTCG AGGATCTCGC GGTCGGCGAC CGCCTCCTCA CCGCCGCCGG CGAGGCCCGG CCGGTGGTCT GGATCGGGCG CCGCCGCCTG CGCCCCGACG CCCATCCCCG CCCCGACCGC GTCCGCCCGG TGCGGATCCG GGCCGGGGCC CTGGCGCCGG GCCTGCCGGA GCGCGACCTC CTGCTCTCGC CCGGCCACGG GGTGCTGTTC GCCGGCCACC TGATCCCGGC CGGCCTGCTG GTCGACGGGC GCGGCGTGGC GGTGGAGGCC GTGGCCGAGG TGGAGTACCT GCACGTGGAA CTCGACCTGC ACGACGTGGT CCTGGCCGAG GGCGTGCCCT GCGAGAGCTA CCTCGATGCC GGGCAGCGGG CGGATTTCGA GGAGGCGGGC GGGGTGACGC GCCTGCACCC GGTCTTCCTG CCGCTGACCT ACGAGGCGGC CTGCGCGCCG CTGGCGGTGG CCGGCCCGGT GCTGGCGGCG GCGCGGGCGC AGATCGCGGC CCGGGCGGAG GCCCAAGCCG AGGCCGCAGC CCAAGCCGAG GCCCAAGTCG GGGCCCAAGT CGAGGCCCAA GCCGAGGCCG CGGCGCGGGA GGGCGAGGCC GCGCCGCGCC GGCGGCCGGC CGGGTGA
|
Protein sequence | MAYGTITVTA NFVQPLTNAV SNSGIVLLYD TTGTSTLSAS NFIGSSGYYF TPSGSTTGTS AYFSLTTTAT VGSGRSFTAI IENPTYFNSP APTSLTSSAV YNSNGQDTGA SATLSAGGVD KANGLGGANY DVNYAKLVFS NVTVTAGGGA PGPTGASGAT GATGATGATG SGATGATGAT GATGTGATGP TGAAGSTGAT GTAGATGATG AGATGATGAT GATGDTGPTG AGATGATGAT GAGATGATGS DGATGAAGAT GATGSAGATG ATGTAGTTGA TGAAGADGAG GATGSTGAAG ATGATGADGA RGATGRDGAA GATGGTGAAG ATGGTGAAGS TGAAGATGAT GADGARGATG RDGVAGATGG TGAAGATGAT GADGARGATG RDGATGATGS MGAAGATGAT GIGATGSAGS TGAAGATGAT GADGARGATG RDGAAGATGS TGAAGATGAT GIGATGSAGS TGAAGADGAR GATGRDGAAG ATGSTGAAGA TGATGADGAR GATGRDGATG ATGSMGAAGA TGATGIGATG SAGSTGAAGA TGATGADGAR GATGRDGAAG ATGSTGAAGA TGPAGATGAG GTTGATGAAG ATGTAGATGA TGSTGPAGAT GATGPECFTH GTRLLTLTGA RRVEDLAVGD RLLTAAGEAR PVVWIGRRRL RPDAHPRPDR VRPVRIRAGA LAPGLPERDL LLSPGHGVLF AGHLIPAGLL VDGRGVAVEA VAEVEYLHVE LDLHDVVLAE GVPCESYLDA GQRADFEEAG GVTRLHPVFL PLTYEAACAP LAVAGPVLAA ARAQIAARAE AQAEAAAQAE AQVGAQVEAQ AEAAAREGEA APRRRPAG
|
| |