Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mnod_4881 |
Symbol | |
ID | 7303821 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium nodulans ORS 2060 |
Kingdom | Bacteria |
Replicon accession | NC_011894 |
Strand | + |
Start bp | 4968569 |
End bp | 4970800 |
Gene Length | 2232 bp |
Protein Length | 743 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643602523 |
Product | Collagen triple helix repeat protein |
Protein accession | YP_002500043 |
Protein GI | 220924741 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCCAACG GAACGATTAC CGTCACAGGT TCATTTTCTG TTCCACTCGT CAGCGGATAT ACCGCGCCAG GAGTTGTTTA TCTCTACAAT ACGACCAGCG GCGGGTTTTC GGCAGGTAAT TATCTCGGAA CTACACAATA TGTTTACCGT CCAACGGATC CGGATAACGC CACAACCGCC TTTTTCTCGC TCACGACGAC GTTGCCCGTC GGCTCTAGCT ACCAGTTAAC GGCGGTTCTG GAAGCGGTGA GTGGATCTCC ACCAGCGACC GCTTTGACGA GTGCGAGCGT CAATAACTCG AACGGGACGG ACGCCATTAC GACGGCGTTC ACCCAGAACG CGGGCACCGA CATCGCCAAT GCCGTCGCAA ACAAGTACTC CACCATTCTG TACAATGGCG TGGACGTGAC GGGATCCCCT GGCCCGACAG GTCCGGCTGG CCCGGCTGGG GCGACCGGTG CTGCGGGTGC TACCGGGGCA ACCGGTCCAG CTGGCGCTAC GGGTGCGACT GGCGCAGACG GAGCGGCCGG GGCGACCGGC GCCACGGGTG CAACCGGTGC TGCCGGCGCG ACCGGGGCGA CGGGTCCGGC TGGCGCCACG GGTGCGACCG GCGCAGACGG AGCAACCGGC GCGACCGGTG CAGATGGAGC AACCGGAGCG ACCGGTGCAA CCGGCGTCAC GGGTCCGGCT GGCGCGACAG GCGCCGCGGG TGCGACTGGT GCTACAGGCC CGACGGGCGC TGCGGGTGTG ACCGGCCCGA CCGGTGCTGC GGGTGCGACC GGGGCAACCG GTTCAGCTGG CGCTACAGGT GCGACCGGCG CGGCTGGCGC TACAGGTGCG ACCGGCGCTG TGGGTGCTAC TGGGGCGACG GGCCCGGCCG GGGCAGATGG AGCAACCGGA GCGACCGGTC CGGCTGGCGC GACCGGGGCA ACGGGCGCTG CGGGTGCAAC CGGCGCGACG GGCCCGGCCG GGGCAGATGG AGCAACCGGA GCGACCGGTC CGGCTGGTGC GACAGGCGCC GCGGGTGCGA CTGGTGCTAC AGGCCCGACG GGCGCTGCGG GTGTGACCGG CCCGACCGGT GCTGCGGGTG CGACCGGGGC AACCGGTTCG GCTGGCGCCA CAGGTGCGAC CGGCGCAGAT GGAGCAACCG GCGCGACCGG TGCAGAGGGA GCAACCGGAG CGACCGGTCC GGCTGGCGCG ACAGGCGCCG CGGGTGCGAC TGGTGCTACA GGCCCGACGG GCGCTGCGGG TGTGACCGGC CCGACCGGTG CTGCGGGTGC GACCGGGGCA ACCGGTTCAG CTGGCGCTAC AGGTGCGACC GGCGCGGCTG GCGCTACAGG TGCGACCGGC GCTGTGGGTG CTACTGGGGC GACGGGCCCG GCCGGCGCAG AGGGAGCAAC CGGAGCGACC GGTCCGGCTG GCGCGACAGG CGCCGCGGGT GCGACTGGTG CTACAGGCCC GACCGGTGCT GCGGGTGTGA CCGGGGCGAC CGGTCCGGCT GGCGCCACGG GCGCGACCGG CGCCACCGGA CCGATTGGTC CAACGGGCGC CACCGGAGCC ACCGGAACCG TCGATTGCTT CGTTGTCGGG ACCCGTCTGC TTACCCCGCA GGGCGAGCGG CTCATCGAGG ATCTCGCGGT CGGCGATCTC GTCACCACGG CGGACGGCGC GCCGCGGCCG ATCATCTGGA TCGGGCGCCG CCGTGTCCGG ATCGACACCC ATCCGCAAGC CGATCTGGTG CGGCCGGTCT GGATCCAGGC CGAAGCCGTG GCGCCCGGGA TCCCGCAGCG CGACATGGTG CTGTCGCCGG GCCACGGCGT GTTCTTCGAC GGTCACCTGA TCCCGATCGG CTGCCTCGTG AACGGGCAGA CGATCCGCAC CGTGCCCTGC GCGGAGGTGG AGTACATGCA TGTCGAGCTC GACCTGCATG ACATCGTGCT GGCCGAAGGT CTCCCCTGCG AGAGCTACCT GGAGTCCGGT CGCCGGTCCG ACTTCGCCGA TCAGGGCGGG GTGACGACCC TGCATCCGAC GTTCATGCCG CTGAATTACG AGGCGGCGTG CGCGCCCTTC GCCATCGCAG GTCCTGCCCT GGACGCCGCC CGGGCGCAGA TCGAGGCACG CGCCGTCGCC TGCGAGGCGG AAGCGGAGGC CCTACCAGCA CGCGACGGGG AGCGGGCGGA CCAGCGCGTC GAGGAAGTGC ACGAGCGGAA CCTTCCGCTC GTGGCCCGAT AG
|
Protein sequence | MANGTITVTG SFSVPLVSGY TAPGVVYLYN TTSGGFSAGN YLGTTQYVYR PTDPDNATTA FFSLTTTLPV GSSYQLTAVL EAVSGSPPAT ALTSASVNNS NGTDAITTAF TQNAGTDIAN AVANKYSTIL YNGVDVTGSP GPTGPAGPAG ATGAAGATGA TGPAGATGAT GADGAAGATG ATGATGAAGA TGATGPAGAT GATGADGATG ATGADGATGA TGATGVTGPA GATGAAGATG ATGPTGAAGV TGPTGAAGAT GATGSAGATG ATGAAGATGA TGAVGATGAT GPAGADGATG ATGPAGATGA TGAAGATGAT GPAGADGATG ATGPAGATGA AGATGATGPT GAAGVTGPTG AAGATGATGS AGATGATGAD GATGATGAEG ATGATGPAGA TGAAGATGAT GPTGAAGVTG PTGAAGATGA TGSAGATGAT GAAGATGATG AVGATGATGP AGAEGATGAT GPAGATGAAG ATGATGPTGA AGVTGATGPA GATGATGATG PIGPTGATGA TGTVDCFVVG TRLLTPQGER LIEDLAVGDL VTTADGAPRP IIWIGRRRVR IDTHPQADLV RPVWIQAEAV APGIPQRDMV LSPGHGVFFD GHLIPIGCLV NGQTIRTVPC AEVEYMHVEL DLHDIVLAEG LPCESYLESG RRSDFADQGG VTTLHPTFMP LNYEAACAPF AIAGPALDAA RAQIEARAVA CEAEAEALPA RDGERADQRV EEVHERNLPL VAR
|
| |