Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mnod_2610 |
Symbol | |
ID | 7305144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium nodulans ORS 2060 |
Kingdom | Bacteria |
Replicon accession | NC_011894 |
Strand | - |
Start bp | 2695657 |
End bp | 2697960 |
Gene Length | 2304 bp |
Protein Length | 767 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643600329 |
Product | Collagen triple helix repeat protein |
Protein accession | YP_002497876 |
Protein GI | 220922574 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.965964 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGAGG CTGACCGGCT CGCGGAGGCG GTGTTTCGCG CGGTGGAGGG CTATCTCGCC CGGACGGTGG GGCCGCTGCT CGCCCGGCTC GAGGCGCTGG AGAGGCGGGA GCCTCGGCGC GGCGAGCGGG GTCACAATGG CCAGCCGGGC CGCGGCCTCG CGAAGGCCAT CGTCACCGCG GATGGCCTGC TCGTCCTGAC GATGACGACG GGCGAGGAGC TCAGCGTCGG CCGCGTGACA GGCAAGGACG GCAAGGACGG GGCCGATGGC GCGCCCGGCC GGGACGGCCG CGATGGGGTC GACGGCGCGC CGGGCGAGCC GGGCCGCGAT GGCACGGATG GGGCGGATGG TCGGGACGGC CGCGACTTCG ATCCGGAGCT TCTCCGCACC GCGGTGGTCG AGGAGGTATC CAAGGCGGTC GACGCTATCC CGAGGCCGCG GGACGGTGCG CCTGGGCGCG ACGGCACGGA CGGGAAGGAC GGACGGGATG GCAAGGACTT CGATCCCGCG GCCATGACGG CCGCTGTCGA GCAGGCCGTC ACGAAGGCCC TCAGCCAGAT CCCTGTTCCG AAGGACGGGA CGCCGGGCCG GGACGGCAGG GATGGCGTCG GCGTTGCGGG CGCCCTGATC GACCGCAGCG GCCATCTCGT CCTCACCCTG TCGAACGGCG AGACCCGCGA TCTCGGCAGC GTGGTCGGCC GGGACGGCAA AGATGGCGAG CCTGGACGTG ATGGCGCTCG TGGCGAGCGA GGAGAGCCGG GTCGAGACGG TAAGGACGGG GCCGATGGTC GGGATGGCCA GGACTTCGAT CCGGACCTGC TGCGTTCCGC GGTTGCCGAG GAGGTCTCCA AGGCGGTTGC AGCCATCCCG CAGCCGAAGG ACGGCGCGCC CGGGCGAGAT GGGAGGGACG GAAAGGACTT CGATCCCGAG GTGCTCGCGG CCGCTGTCGA GCGTGCCGTC ACGCAGGCGG TCGGTCGGAT CCCCGTTCCG AAGGACGGCG CGCCGGGCCG GGACGGCGCA GATGGGGTCG GTGTTGCTGA CGCCCTCATC GACCGGACTG GCAAGCTTGT CGTCACGCTG ACGAATGGCG AGACCCGTGA TCTTGGCTTG GTGGTTGGCC GAGATGGCAA GGACGGTGCC GACGGCCGCG ATGGCCGGGA CGGCGCTCCC GGTGAGCGTG GCGAACAAGG AGAACCAGGT CGCGATGGCA AGGATGGCAC CGACGGTCGG GATGGCCAGG ACGTCGAACC GGAGGCCCTG CGGATCGCTG TCGAGGAGGC GGTCACGCAG GCCGTCAGCC AGATCCCGCT CCCGAAGGAC GGGGCGCCCG GCCGGGACGG TCGCGACGGC GTGGACGGCA AGGACGGCGT CGGCCTCGCC GATGCTCTGA TCGATCGCTC CGGCAACCTC GTGGTCACGC TGTCGAACGG CGACACCAAG CAGCTTGGCC TCGTGGTCGG CCGGGACGGC AAGGATGGCC GGGCCGGCAG CGCCGGCAAG GATGGTGCGC CGGGCCCGCG CGGTGAGCAG GGACCGCCCG GGGAGAAGGG CGAGCCGGGT CCTTCTGGCG AGCGCGGAGA GCGTGGCGAC CAGGGCCGGC CGGGCGAGCG CGGGCCACAA GGTGAGCGCG GCCTGCCGGG CGAGCCCGGC GCTGCGGGTG AGCAGGGTCC GCCCGGTGAG CGCGGTGAAC AGGGACCGCC CGGCGATCAA GGAGAGCGTG GCGAGCCCGG CGAGCAGGGG CCTCCAGGCG AGCGGGGCGG ACGGGGGCCG CAGGGCGAGC CGGGTCCTCC CGGTGAGCGG GGCGAGCGCG GTGAGCCCGG AGAGCGTGGC GAGCGCGGTG AACCTGGCCC GGCCGGCGAG CGCGGCGAGC CGGGCCCGCC CGGAGAGAAG GGCGAGCAGG GGCCTCCTGG TGAGACGGGC GAGCGCGGTG AGCCGGGGGC TCCTGGTGAG CGCGGCGCCG ATGGCTTCGG CTTCGAGGAT ATGGAGGAGG AGCTCGCCGG CGATGGGCGG ACCCTCATCC GGCGCTACCG CCGCGGCGAG GAGGTGAAGG AGTTCCGCCA CCGCGTCCCG ACGGTGATCG ATCGCGGCGT CTACAAGGCG GGCACGACCT ACCAGCCCGG CGACGGCGTC ACCTGGGCCG GCTCGTTCTG GATCGCTCAG GCCGAGACGA GTTCGAAGCC GGATGGCGGC GAGGGCTGGC GCTTGGCGGT CAAGCGCGGC CGCGATGGCA AGGACGGCAA GCCGGGCGAG CGCGGGCCCG AGGGCAAGGC CGGCCCGCCC GGCCGCGACC CAACTCGGCC CTGA
|
Protein sequence | MDEADRLAEA VFRAVEGYLA RTVGPLLARL EALERREPRR GERGHNGQPG RGLAKAIVTA DGLLVLTMTT GEELSVGRVT GKDGKDGADG APGRDGRDGV DGAPGEPGRD GTDGADGRDG RDFDPELLRT AVVEEVSKAV DAIPRPRDGA PGRDGTDGKD GRDGKDFDPA AMTAAVEQAV TKALSQIPVP KDGTPGRDGR DGVGVAGALI DRSGHLVLTL SNGETRDLGS VVGRDGKDGE PGRDGARGER GEPGRDGKDG ADGRDGQDFD PDLLRSAVAE EVSKAVAAIP QPKDGAPGRD GRDGKDFDPE VLAAAVERAV TQAVGRIPVP KDGAPGRDGA DGVGVADALI DRTGKLVVTL TNGETRDLGL VVGRDGKDGA DGRDGRDGAP GERGEQGEPG RDGKDGTDGR DGQDVEPEAL RIAVEEAVTQ AVSQIPLPKD GAPGRDGRDG VDGKDGVGLA DALIDRSGNL VVTLSNGDTK QLGLVVGRDG KDGRAGSAGK DGAPGPRGEQ GPPGEKGEPG PSGERGERGD QGRPGERGPQ GERGLPGEPG AAGEQGPPGE RGEQGPPGDQ GERGEPGEQG PPGERGGRGP QGEPGPPGER GERGEPGERG ERGEPGPAGE RGEPGPPGEK GEQGPPGETG ERGEPGAPGE RGADGFGFED MEEELAGDGR TLIRRYRRGE EVKEFRHRVP TVIDRGVYKA GTTYQPGDGV TWAGSFWIAQ AETSSKPDGG EGWRLAVKRG RDGKDGKPGE RGPEGKAGPP GRDPTRP
|
| |