Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2964 |
Symbol | |
ID | 5830996 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 3311400 |
End bp | 3312710 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641368764 |
Product | thymidine phosphorylase |
Protein accession | YP_001640424 |
Protein GI | 163852381 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02643] thymidine phosphorylase [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.277231 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCTCC CCCAGGAGAT CATCCGCGAC AAGCGCGACG GACACGCCCT GTCCGAGGCC GACATCGCCG GCTTCATCGA GGGGCTGACG CGGGATCAGG TTACCGAGGG GCAGGCCGCG GCGTTTGCCA TGGCGGTGTT CTTCCGCGGC CTCTCGCTCG ACGAGCGCGT GGCGCTGACC CGCGCCATGA CGCATTCGGG CACGGTGCTC GCGTGGGATC TGCCCGGCCC CGTCCTCGAC AAGCACTCCA CCGGCGGCGT CGGCGACACC GTGAGCCTTC CGCTCGCCGC GATGGTCGCG GCCTGCGGCG GCTACGTGCC GATGATCTCC GGGCGCGGCC TGGGGCATAC CGGCGGCACG CTGGACAAGC TCGCGAGCAT TCGCGGCTAC GACGTGACGC CGGGCCTCGA CCGGTTCCGC AAGGTCACGG CCGAGGCCGG TTGCGCCATC ATCGGCCAGA CCGCCGAGCT CGCGCCGGCC GACCGGCGCC TCTACGCGAT CCGCGACGTC ACCGGGACGG TGGAATCGCT CGATCTCATC ACCGCGTCGA TCCTGTCGAA GAAGCTCGCG GCGGGGCTCC AGGGCCTCGT CATGGATGTG AAGACCGGCT CCGGCGCCTT CATGGCCCGT CGCGAGGATG CGCGGGCGCT TGCCGAGAGC ATCGTAACGG TCGCCAACGC GGCGGGACTG CGCACGCGCG CGCTCATCAC CGACATGGAC GCCCCGCTGG CCTCCTGCGC CGGCAACGCG GTCGAGGTGG CCTATGCCGT CGATTACCTC ACCGGTCGCG CCCGCGAACC GCGCTTCCAC GCCGTGACGA TGGCGCTGGC CGTCGAGATG CTGGTGATCG GGGGGCTGGC TGCAAACATC GCGGAGGCGG AAGGGAAGCT GACGGCGGCG CTCGATTCGG GCCGGGCCTG CGAGGCCTTC GCGCGGATGG TGGCGGCGCT GGGCGGCCCC GGCGATTTCG TCGAGGCGGC ATACCGGCAC CTGCCCCCGG CGCCGGTCGT GCGGCCGGTG CCGGCGCAGG AAGCGGGCGT GGTGGAGGCG GTGGAGACCC GCGCCATCGG CCTCGCGGTG ATTGGCCTCG GCGGAGGGCG CACCCGGCCG CAGGATGCCA TCGACGCCCG CGTCGGTTTC ACCGGCCTCG CGCGGCCGGG CGACCGACTG TCACCGAACG ATCCGATCGG CATCGTCCAC GCCGCGGACG AGGCCGCCGC CGAGCGGGCA GCGGCATCGC TGCGGGCGGC CTATCGGATC GGCGCAACGC CGCCGGAGCC ACGCGCGGCC GTACTGGAGC GCTTGGGCTG A
|
Protein sequence | MRLPQEIIRD KRDGHALSEA DIAGFIEGLT RDQVTEGQAA AFAMAVFFRG LSLDERVALT RAMTHSGTVL AWDLPGPVLD KHSTGGVGDT VSLPLAAMVA ACGGYVPMIS GRGLGHTGGT LDKLASIRGY DVTPGLDRFR KVTAEAGCAI IGQTAELAPA DRRLYAIRDV TGTVESLDLI TASILSKKLA AGLQGLVMDV KTGSGAFMAR REDARALAES IVTVANAAGL RTRALITDMD APLASCAGNA VEVAYAVDYL TGRAREPRFH AVTMALAVEM LVIGGLAANI AEAEGKLTAA LDSGRACEAF ARMVAALGGP GDFVEAAYRH LPPAPVVRPV PAQEAGVVEA VETRAIGLAV IGLGGGRTRP QDAIDARVGF TGLARPGDRL SPNDPIGIVH AADEAAAERA AASLRAAYRI GATPPEPRAA VLERLG
|
| |