Gene MCA1717 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1717 
Symbol 
ID3102436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1833082 
End bp1834632 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content64% 
IMG OID637170878 
Productthymidine phosphorylase 
Protein accessionYP_114156 
Protein GI53803970 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02645] putative thymidine phosphorylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.124603 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGACG AAGAGTCGAT CAAGACTCGC CTGAAGCTCC GGCCGGTGGC GATCGATACG 
TACCGGGAAA ACGTGGCGTA CCTGCACCGC GAATGTTCGG TGTATCGGGC GGAGGGCTTT
CAGGCGCTCG CCAAGATCCG GGTGGGCTGC AACGGCAAGC AGATCGAGGC GGTGCTGAAC
GTGGTGGACG ATGTCTGCAT CGTGGCCCCG GACGAGCTGG GCCTGTCGGA GCAGGCATTC
CAGCGTTTCG GCGAACCCGC GGGCCAGCTC GTGAACGTGG CGCAGGCCGA GCCTCCACTA
TCCATGGACG GCGTCCGGCG CAAGATCGGC GGGGAACGGC TGGATTACGG CGATTATCAG
GCCATTACCA GCGACATTGC CAAAGGGCGC TATTCCAAGA TGGAGATGGC CGCGTTCCTG
GTGGCGACCG GCCAGAACGG GTTGGATCGT GACGAAGTGC TGTCGCTGAC GCGTGCGATG
CTGGAAACCG GGGTGAGATT GAGCTGGAAT GAGCCGCTGG TGGCCGATAA ACACTGCATC
GGTGGCATTC CGGGCAACCG GACGTCGTTG TTGATCGTGC CCATCGTGGC GGCGCACGGG
ATGCTGATCC CCAAGACCTC CAGCCGCGCC ATCACTTCGC CTGCCGGGAC GGCGGACACC
ATGGAAGTGC TCGCGCGCAC CGATCTGGCG CCGGAATCAC TCGACCGGCT GGTGCGGATG
GAACGGGGTT GTCTGGCGTG GGGTGGCACC ACTCGCCTGG CTCCGGTCGA CGATATGCTC
ATCTCGGTCG AACGTCCCCT CGGCATCGAT TCTCAAGGCC AGATGGTCGC TTCTATTCTG
TCGAAGAAGT TGGCGGCCGG CGCCACCCAT CTGCTGTTGG ACATTCCGGT CGGCCCTACC
GCCAAGGTGC GGCAGATGCG CGATGCCATG AGTCTCAGGA AACTGTTCGA ATACGTCGGT
GACCGCGTCG GCCTGCATCT GGAAGCCGTG ATCACCGACG GTGCCCAGCC GGTCGGCCGC
GGCATCGGTC CGGTGCTGGA GGTGCGGGAC GTCATGCAGG TGCTGGAGAA CGATCCGGAA
GCGCCGGTCG ATCTGCGCGA AAAATCCTTG CGCCTGGCCG GTCGCATCCT CGAGTTCGAT
CCCGACGTCC GGGGCGGCTT CGGCTATTCG ATCGCTCGAG ACATCCTGGA ATCGGGCCGG
GCGCTCGCCA AGATGCACCG GATCATCGAT GCCCAGGGCC GGCAGGAGCG GCGCCTGGAG
CCCGGCAGGC TGGTCTTCGA GGTGCGGGCG GAGCGGGCCG GCGTGGTGGT CGGGATCGAC
AATTTTTTCT TGGCGCAGAC GGCCCGCCTC GCCGGCGCAC CGATGAGCCG AGGGGCAGGC
GTGGATCTGT TGAACAAGCT GGGCGATACG GTAGAGGAGG GGCAGCCGCT CTACCGGGTC
TACGCGGAAT TCCCTGCCAA TTTCGAATTC GCCCGGGAGT TCACGCGCAC GAGAAGCGGT
TACAACATCG GAGATGCCGC TTTCCTGACC AAGACTCACA TGGAGTTCTG A
 
Protein sequence
MSDEESIKTR LKLRPVAIDT YRENVAYLHR ECSVYRAEGF QALAKIRVGC NGKQIEAVLN 
VVDDVCIVAP DELGLSEQAF QRFGEPAGQL VNVAQAEPPL SMDGVRRKIG GERLDYGDYQ
AITSDIAKGR YSKMEMAAFL VATGQNGLDR DEVLSLTRAM LETGVRLSWN EPLVADKHCI
GGIPGNRTSL LIVPIVAAHG MLIPKTSSRA ITSPAGTADT MEVLARTDLA PESLDRLVRM
ERGCLAWGGT TRLAPVDDML ISVERPLGID SQGQMVASIL SKKLAAGATH LLLDIPVGPT
AKVRQMRDAM SLRKLFEYVG DRVGLHLEAV ITDGAQPVGR GIGPVLEVRD VMQVLENDPE
APVDLREKSL RLAGRILEFD PDVRGGFGYS IARDILESGR ALAKMHRIID AQGRQERRLE
PGRLVFEVRA ERAGVVVGID NFFLAQTARL AGAPMSRGAG VDLLNKLGDT VEEGQPLYRV
YAEFPANFEF AREFTRTRSG YNIGDAAFLT KTHMEF