Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA1717 |
Symbol | |
ID | 3102436 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | - |
Start bp | 1833082 |
End bp | 1834632 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637170878 |
Product | thymidine phosphorylase |
Protein accession | YP_114156 |
Protein GI | 53803970 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02645] putative thymidine phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.124603 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGACG AAGAGTCGAT CAAGACTCGC CTGAAGCTCC GGCCGGTGGC GATCGATACG TACCGGGAAA ACGTGGCGTA CCTGCACCGC GAATGTTCGG TGTATCGGGC GGAGGGCTTT CAGGCGCTCG CCAAGATCCG GGTGGGCTGC AACGGCAAGC AGATCGAGGC GGTGCTGAAC GTGGTGGACG ATGTCTGCAT CGTGGCCCCG GACGAGCTGG GCCTGTCGGA GCAGGCATTC CAGCGTTTCG GCGAACCCGC GGGCCAGCTC GTGAACGTGG CGCAGGCCGA GCCTCCACTA TCCATGGACG GCGTCCGGCG CAAGATCGGC GGGGAACGGC TGGATTACGG CGATTATCAG GCCATTACCA GCGACATTGC CAAAGGGCGC TATTCCAAGA TGGAGATGGC CGCGTTCCTG GTGGCGACCG GCCAGAACGG GTTGGATCGT GACGAAGTGC TGTCGCTGAC GCGTGCGATG CTGGAAACCG GGGTGAGATT GAGCTGGAAT GAGCCGCTGG TGGCCGATAA ACACTGCATC GGTGGCATTC CGGGCAACCG GACGTCGTTG TTGATCGTGC CCATCGTGGC GGCGCACGGG ATGCTGATCC CCAAGACCTC CAGCCGCGCC ATCACTTCGC CTGCCGGGAC GGCGGACACC ATGGAAGTGC TCGCGCGCAC CGATCTGGCG CCGGAATCAC TCGACCGGCT GGTGCGGATG GAACGGGGTT GTCTGGCGTG GGGTGGCACC ACTCGCCTGG CTCCGGTCGA CGATATGCTC ATCTCGGTCG AACGTCCCCT CGGCATCGAT TCTCAAGGCC AGATGGTCGC TTCTATTCTG TCGAAGAAGT TGGCGGCCGG CGCCACCCAT CTGCTGTTGG ACATTCCGGT CGGCCCTACC GCCAAGGTGC GGCAGATGCG CGATGCCATG AGTCTCAGGA AACTGTTCGA ATACGTCGGT GACCGCGTCG GCCTGCATCT GGAAGCCGTG ATCACCGACG GTGCCCAGCC GGTCGGCCGC GGCATCGGTC CGGTGCTGGA GGTGCGGGAC GTCATGCAGG TGCTGGAGAA CGATCCGGAA GCGCCGGTCG ATCTGCGCGA AAAATCCTTG CGCCTGGCCG GTCGCATCCT CGAGTTCGAT CCCGACGTCC GGGGCGGCTT CGGCTATTCG ATCGCTCGAG ACATCCTGGA ATCGGGCCGG GCGCTCGCCA AGATGCACCG GATCATCGAT GCCCAGGGCC GGCAGGAGCG GCGCCTGGAG CCCGGCAGGC TGGTCTTCGA GGTGCGGGCG GAGCGGGCCG GCGTGGTGGT CGGGATCGAC AATTTTTTCT TGGCGCAGAC GGCCCGCCTC GCCGGCGCAC CGATGAGCCG AGGGGCAGGC GTGGATCTGT TGAACAAGCT GGGCGATACG GTAGAGGAGG GGCAGCCGCT CTACCGGGTC TACGCGGAAT TCCCTGCCAA TTTCGAATTC GCCCGGGAGT TCACGCGCAC GAGAAGCGGT TACAACATCG GAGATGCCGC TTTCCTGACC AAGACTCACA TGGAGTTCTG A
|
Protein sequence | MSDEESIKTR LKLRPVAIDT YRENVAYLHR ECSVYRAEGF QALAKIRVGC NGKQIEAVLN VVDDVCIVAP DELGLSEQAF QRFGEPAGQL VNVAQAEPPL SMDGVRRKIG GERLDYGDYQ AITSDIAKGR YSKMEMAAFL VATGQNGLDR DEVLSLTRAM LETGVRLSWN EPLVADKHCI GGIPGNRTSL LIVPIVAAHG MLIPKTSSRA ITSPAGTADT MEVLARTDLA PESLDRLVRM ERGCLAWGGT TRLAPVDDML ISVERPLGID SQGQMVASIL SKKLAAGATH LLLDIPVGPT AKVRQMRDAM SLRKLFEYVG DRVGLHLEAV ITDGAQPVGR GIGPVLEVRD VMQVLENDPE APVDLREKSL RLAGRILEFD PDVRGGFGYS IARDILESGR ALAKMHRIID AQGRQERRLE PGRLVFEVRA ERAGVVVGID NFFLAQTARL AGAPMSRGAG VDLLNKLGDT VEEGQPLYRV YAEFPANFEF AREFTRTRSG YNIGDAAFLT KTHMEF
|
| |