Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_5071 |
Symbol | |
ID | 7118945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | + |
Start bp | 5428098 |
End bp | 5429495 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643527765 |
Product | triple helix repeat-containing collagen |
Protein accession | YP_002423764 |
Protein GI | 218532948 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0278582 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.141411 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCTGC CGATCAATTT TAATAATACG GTCTATGCTC CTGGATTCCT TGGTACCGAT GACGGCGGGG CATCTGGTAA TTTCCAAGTC GATGCTGCCT CCACCTACTC TGTCAGCGTC ACGGGGACGG TCAACGGACC GGGTAGCACG GTCAGCTTGA CTTACGGGGC CGACGCGCCA GCCGCATTTG CCGGAAAATC GGTGCAGCTC ACTGCAACCC AATTCGACAA TTCGAATATG ATATTGTTTA CCAGCAATGC CATTCCTCCC GGAGAGACGG ACCAGGGGAA TTATCGATAC ATTCTGTCGA ATACCCAGTT GTACGGATCA AATCCTCCGG CCGGGACAAC TCGGACGCGC TTTACAGCGG ACACCAATAA TGCCCTCGGT GATTACGCAG TAGCGGCGAC GGGCGGCACG GGAGCAACGG GCGGCACCGG AGCCACGGGC GGCACGGGAG CAACGGGCGG CACCGGAGCC ACGGGCGGCA CGGGAGCAAC GGGCGGCACC GGAGCCACGG GCGGCACGGG AGCAACGGGC GGCACCGGAG CCACGGGCGG CACCGGAGCA ACGGGCGGCA CCGGAGCCAC GGGCGGCACC GGAGCCACGG GCGGCACGGG AGCAACGGGC GGCACGGGAG CAACGGGCGG CACGGGAGCA ACGGGCGGCA CGGGAGCAAC GGGCGGCACG GGAGCCACCG GCGGAACCGG AGCGACCGGT GGCACAGGAG CCACGGGCAG CACCGGGGCC ACCGGCGGTA CCGGTCCCGT CATCTGCTTC ACGCCCGGCA CCCGCATCGC GACGCCGGAT GGCGAGCGCG CGATCGAGCA CCTGCAGCCC GGCGATGTCG TGAGCCTCGC CGACGGCGCC GTCGCCACCG TACGCTGGAT CGGCCGTCGC TTCCTCGATC TGCGGACGCA TCCGCAGCCC ACCACCGCTC ACCCCGTGCG GATCGCCGCC GGCGCCTTCG GTCAGAGCCT GCCAGTGCGG GACCTCATCG TCTCGCCCGG CCACGGCCTC TACTGCGACG GCGTTCTCAT CCCCGCGATC TGCCTCGTCA ACGACCGCAC GATCACACGG GTTGAGGTCA CGTCGGTCGA ATACCTGCAC GTCGAGTTGG AGCGGCATGC ACTCCTACTG GCCGAGGGGC TGCCGACGGA AAGCTATCTC GACGTGGACA ACCGCGGCTT CTTCGAGAAC GGCGGAGCGC CGCTGATCCT GCACCCGACC TTCGCGGCGA TGGCGCATGA GGGGGGCTGT GCGCCCTACG TGATTGCCGG GGCCAAGCTG CGAACGGTGC GGGCGCAACT GGAGCGTCAG GCCGACATCT GGGAGGCGCA GCGGCAGCCG GGTACCGGCT GGCGGGCACG TCTCGGCCTC AGCCGCCGCA CCGCGTGA
|
Protein sequence | MALPINFNNT VYAPGFLGTD DGGASGNFQV DAASTYSVSV TGTVNGPGST VSLTYGADAP AAFAGKSVQL TATQFDNSNM ILFTSNAIPP GETDQGNYRY ILSNTQLYGS NPPAGTTRTR FTADTNNALG DYAVAATGGT GATGGTGATG GTGATGGTGA TGGTGATGGT GATGGTGATG GTGATGGTGA TGGTGATGGT GATGGTGATG GTGATGGTGA TGGTGATGGT GATGGTGATG GTGATGSTGA TGGTGPVICF TPGTRIATPD GERAIEHLQP GDVVSLADGA VATVRWIGRR FLDLRTHPQP TTAHPVRIAA GAFGQSLPVR DLIVSPGHGL YCDGVLIPAI CLVNDRTITR VEVTSVEYLH VELERHALLL AEGLPTESYL DVDNRGFFEN GGAPLILHPT FAAMAHEGGC APYVIAGAKL RTVRAQLERQ ADIWEAQRQP GTGWRARLGL SRRTA
|
| |