Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_3190 |
Symbol | |
ID | 7114189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | - |
Start bp | 3371033 |
End bp | 3372343 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643525941 |
Product | thymidine phosphorylase |
Protein accession | YP_002421956 |
Protein GI | 218531140 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02643] thymidine phosphorylase [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.728246 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.268471 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACTTC CCCAGGAGAT CATCCGCGAC AAGCGCGACG GACACGCGCT GTCGGAGGCC GACATCGCCG GCTTCATCGA GGGGCTGACG CAGGATCAGG TCACCGAGGG GCAGGCCGCG GCGTTTGCCA TGGCAGTGTT CTTCCGCGGC CTCTCGCTCG ACGAGCGCGT GGCGCTGACC CGCGCCATGA CGCATTCGGG CACGGTGCTC GCGTGGGATC TGCCCGGCCC CGTCCTCGAC AAGCACTCCA CCGGCGGCGT CGGTGACACC GTAAGCCTTC CGCTCGCGGC GATGGTCTCC GCCTGCGGCG GCTACGTGCC GATGATCTCC GGGCGCGGCC TGGGGCATAC CGGCGGCACG CTGGACAAGC TCGCGAGCAT CCCCGGCTAC GACGTGACGC CGGGCCTCGA TCGCTTCCGC AAGGTCACGG CCGAGGCCGG CTGCGCCATC ATCGGCCAGA CCGCCGAGCT CGCGCCGGCC GACCGGCGCC TGTACGCGAT CCGCGACGTC ACCGGGACGG TGGAATCGCT CGATCTTATC ACCGCATCGA TCCTGTCGAA GAAGCTCGCG GCGGGGCTCC AGGGCCTCGT CATGGATGTG AAGACCGGCT CCGGCGCCTT CATGGCCCGC CGCGAGGATG CGCGGGCGCT TGCCGACAGC ATCGTAACGG TCGCGAACGC GGCGGGCTTG CGCACGCGCG CGCTCATCAC CGACATGGAT GCCCCGCTGG CCTCCTGCGC CGGCAACGCG GTCGAGGTGG CCTATGCCGT CGATTACCTC ACCGGTCGCG CCCGCGAACC GCGCTTCCAC GCCGTGACGA TGGCGCTGGC CGTCGAGATG CTGGTGATCG GGGGGCTGGC TGCAAACATC GCGGAGGCAG AGGGGCGGCT GACGGCGGCG CTCGAATCGG GCCGGGCCTG CGAGGCCTTC GCACGGATGG TGGCGGCGCT GGGCGGCCCC GGTGACTTCA TCGAGGCAGC GGACCGGCAC CTGCCCCGGG CGCCGGTCGT GCGGCCGGTG CCGGCGCAGG AAGCGGGCGT GGTGGAGGCG GTGGAGACCC GCGCCATCGG CCTCGCGGTG ATCGGCCTCG GCGGCGGGCG CACCCGGCCG CAGGACGCCA TCGACGCCCG CGTCGGTTTC ACCGGCCTCG CGCGGCCGGG CGACCAACTG TCACCGAACG ATCCGATCGG CATCGTCCAC GCCGCGGACG AGGCCGCCGC CGAGCGGGCA GCGGCATCGC TCCGGGCGGC CTATCGGATC GGTGCAACGC CGCCGGAGCC ACGCGCGGCC GTAGGGGAGC GTTTGGGCTG A
|
Protein sequence | MRLPQEIIRD KRDGHALSEA DIAGFIEGLT QDQVTEGQAA AFAMAVFFRG LSLDERVALT RAMTHSGTVL AWDLPGPVLD KHSTGGVGDT VSLPLAAMVS ACGGYVPMIS GRGLGHTGGT LDKLASIPGY DVTPGLDRFR KVTAEAGCAI IGQTAELAPA DRRLYAIRDV TGTVESLDLI TASILSKKLA AGLQGLVMDV KTGSGAFMAR REDARALADS IVTVANAAGL RTRALITDMD APLASCAGNA VEVAYAVDYL TGRAREPRFH AVTMALAVEM LVIGGLAANI AEAEGRLTAA LESGRACEAF ARMVAALGGP GDFIEAADRH LPRAPVVRPV PAQEAGVVEA VETRAIGLAV IGLGGGRTRP QDAIDARVGF TGLARPGDQL SPNDPIGIVH AADEAAAERA AASLRAAYRI GATPPEPRAA VGERLG
|
| |