Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mnod_4285 |
Symbol | |
ID | 7301184 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium nodulans ORS 2060 |
Kingdom | Bacteria |
Replicon accession | NC_011894 |
Strand | + |
Start bp | 4325554 |
End bp | 4327500 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643601938 |
Product | Collagen triple helix repeat protein |
Protein accession | YP_002499464 |
Protein GI | 220924162 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.275713 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGAGG CTGACCGGCT CGCGGAGGCG GTGTTTCGCG CGGTGGAGGG CTATCTCGCC CGGACGGTGG GGCCGCTGCT CGCCCGGCTC GAGGCGCTGG AGAGGCGGGA GCCTCTGCGC GGCGAGCGGG GTCACAATGG CCAGCCGGGC CGCGGCCTCG CGAAGGCCAT CGTCACCGCG GATGGCCTGC TCGTCCTGAC GATGACGACG GGCGAGGAGC TCAGCGTCGG CCGCGTGACA GGCAAGGACG GCAAGGACGG GGCCGATGGC GCGCCCGGCC GGGACGGCCG CGATGGGGTC GACGGCGCGC CGGGCGAGCC GGGCCGCGAT GGCACGGATG GGGCGGATGG TCGGGACGGC CGCGACTTCG ATCCGGAGCT TCTCCGCACC GCGGTGGTCG AGGAGGTATC CAAGGCGGTC GACGCTATCC CGAGGCCGCG GGACGGTGCG CCTGGGCGCG ACGGCACGGA CGGGAAGGAC GGACGGGATG GCAAGGACTT CGATCCCGAG GTGCTCGCGG CCGCTGTCGA GCAGGCCGTC ACGAAGGCCC TCAGCCAGAT CCCTGTTCCG AAGGACGGGA CGCCGGGCCG GGACGGCAGG GATGGCGTCG GCGTTGCGGG CGCCCTGATC GACCGGAGCG GCAAGCTCAT CCTCACTCTG TCGAATGGCG AAACGCGGGA TCTCGGCTTG GTGGTTGGCC GAGATGGCAA GGACGGTGCC GACGGCCGCG ACGGCCGCGA CGGCGCTCCC GGTGAGCGTG GCGAACAAGG AGAACCAGGT CGCGATGGCA AGGATGGCAC CGACGGTCGG GATGGCCAGG ACGTCGAACC GGAGGCCCTG CGGGTCGCTG TCGAGGAGGC GGTCACGCAG GCCGTCAGCC AGATCCCGCT CCCGAAGGAC GGGACGCCGG GCCGGGACGG TCGAGACGGC GTGGACGGCA AGGACGGCGT CGGCCTCGCC GATGCTCTGA TCGATCGAGC TGGCAATCTC GTGGTCACGC TGTCGAACGG CGACACCAAG CAGCTTGGCC TCGTGGTCGG CCGGGACGGC AAGGATGGCC GGGCCGGCAG CGCCGGCAAG GATGGTGCGC CGGGCGAGCG GGGGGAGCCG GGCCCGCGCG GCGAGCAGGG CGAGAAGGGC GACCCCGGAG AGCGGGGCGA GCGAGGTCCC GCCGGCGAGC GGGGACCGGC CGGCGAGCCG GGTCCGCCGG GCGAGCCGGG TCCGCCGGGC GAGCCGGGTC CGCTGGGCGA GCGCGGGCCA CAAGGTGAGC GCGGCCTGCC GGGCGAGCCC GGCGCTGCGG GTGAGCAGGG TCCGCCCGGT GAGCGCGGTG AACAGGGACC GCCCGGCGAT CGAGGAGAGC GTGGCGAGCC CGGCGAGCAG GGGCCTCCAG GCGAGCGGGG CGAACGGGGG CCGCAGGGCG AGCCGGGTCC TCCTGGTGAG ACGGGCGAGC GCGGTGAGCC GGGTGCTCCT GGTGAGCGCG GCGAGCGCGG GATACCTGGC GGGCGCGGTG AGAAGGGCGA CCCGGGCCGC GACGGAAAGG ACGGCGCTCC TGGGGCCGCC GGAGAGCGCG GCGAACGGGG CGAGAAGGGC GAAACCGGTG AGCGCGGCGC CGACGGCTTT GGCTTCGAGG ACCTGGAGGA GGAGCTCGCC GAGGACGGCC GCACGCTTGT GCGGCGCTAC CGCCGCGGCG AGGAGGTGAA GGAGTTCCGC CACCGCGTCC CGACGCTGAT CGATCGCGGC GTCTACAAGG CGGGCACGAT CTACCAGCCC GGCGACGGCG TCACCTGGGC CGGCTCGTTC TGGATCGCCC AGACGGAGAC CGACGCGAAG CCGGATGGCG GCGAGGGCTG GCGCCTCGCG GTCAAGCGCG GCCGCGATGG CAAGGACGGC AAGCCGGGCG AGCGCGGGCC CGAGGGCAAG GCCGGCCCTG ACGGCCGGAG GTGGTGA
|
Protein sequence | MDEADRLAEA VFRAVEGYLA RTVGPLLARL EALERREPLR GERGHNGQPG RGLAKAIVTA DGLLVLTMTT GEELSVGRVT GKDGKDGADG APGRDGRDGV DGAPGEPGRD GTDGADGRDG RDFDPELLRT AVVEEVSKAV DAIPRPRDGA PGRDGTDGKD GRDGKDFDPE VLAAAVEQAV TKALSQIPVP KDGTPGRDGR DGVGVAGALI DRSGKLILTL SNGETRDLGL VVGRDGKDGA DGRDGRDGAP GERGEQGEPG RDGKDGTDGR DGQDVEPEAL RVAVEEAVTQ AVSQIPLPKD GTPGRDGRDG VDGKDGVGLA DALIDRAGNL VVTLSNGDTK QLGLVVGRDG KDGRAGSAGK DGAPGERGEP GPRGEQGEKG DPGERGERGP AGERGPAGEP GPPGEPGPPG EPGPLGERGP QGERGLPGEP GAAGEQGPPG ERGEQGPPGD RGERGEPGEQ GPPGERGERG PQGEPGPPGE TGERGEPGAP GERGERGIPG GRGEKGDPGR DGKDGAPGAA GERGERGEKG ETGERGADGF GFEDLEEELA EDGRTLVRRY RRGEEVKEFR HRVPTLIDRG VYKAGTIYQP GDGVTWAGSF WIAQTETDAK PDGGEGWRLA VKRGRDGKDG KPGERGPEGK AGPDGRRW
|
| |