Gene Anae109_0662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_0662 
Symbol 
ID5375799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp744940 
End bp746244 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content75% 
IMG OID640842172 
Productpyrimidine-nucleoside phosphorylase 
Protein accessionYP_001377859 
Protein GI153003534 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.132104 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCGCCT ACGAGATCAT CCACCGCAAG CGCGACGGAC GGGCCATCCC GCCCGCCGCG 
ATCGCCGCGC TCGTGGACGG CTTCACGACG GGCGAGATCC CCGACTACCA GATGGCCGCC
TTCTGCATGG CGGTGTTCTT CCGCGGCATG GACGAGGTCG AGGTGCGCGC CCTCACCGAG
GCGATGCTGC GCTCGGGCGA CGTGCTCGAT CTCTCGGACA TCCCCGGCGC GAAGGTCGAC
AAGCACTCCA CGGGAGGCGT CGGCGACAAG GTCTCGCTCG CGCTCGCGCC GCTCGCCGCC
GCCTGCGGCG TCAAGGTCCC CATGATCTCC GGGCGAGGCC TCGGCCACAC CGGCGGGACG
CTCGACAAGC TGGAGGCGAT CCCGGGGTTC CGCGTGGACC TGCCGGTGGG GAGGTTCCGC
GAGCTCGTCC GCGACGTGGG CGCCTGCCTC GTCGGCCAGA CCGAGCGGCT CGCCCCGGCG
GACCGGAAGC TCTACGCCCT GCGCGACGTC ACCGCGACCG TCGAGTCGAT CCCCCTCATC
GCCGCGTCGA TCATGTCGAA GAAGCTCGCG GAGGGGATCG ACGCGCTCGT CCTCGACGTG
AAGGTCGGCT CCGGCGCGTT CATGAAGCGG CTCGACGACG CGCGGACGCT GGCGGCGACG
CTCGCGGGGA TCGGGCGCCG GATGGGCAAG CGCGTCTCGG CGCTCCTCAC GCGCATGGAC
GAGCCGCTCG GCCGCGCCGT CGGCAACGCC CTCGAGGTGG CGGAGACGGT GGCGCTCCTC
TCCGGCGGCG GCCCCGAGGA CCTGCGCGAG GTGACCGTGG AGCTCACCGC CGAGATGCTC
GTGCTCGGTG GCGCCGCGGC GGACCTCGCC GCCGGCCGGG CGCGGGTCGC GGCGGCCATC
GCCGACGGGC GGGGGCTCGC GAAGCTGGAG GAGATCGTGC GCGCGCAGGG GGGTGACGCC
GCCGTGCTCC GCGATCCGGA GCGGCTGCCG CGCGCGCCCG TGCGGTACGA CGTCCCCTCG
CCCGCGGCGG GGTTCGTCGC CGAGCTCGAC GCCGAGGCCA TCGGGCTGGC CGCCGTCGCG
CTCGGGGCCG GCCGCGCGCG CGTCGAGGAC CGGATCGACC CGGCGGTGGG CGTCGTCGTG
GCGAAGAAGC TCGGCGATCG GGTCGAGCGG GGTGAGCCGC TCTGCACGGT GCACGCGGGA
GAGGGCAGCG AGTCCCGGGA GCGGGTCACC GCTCGGCTCG CCGGCGCGTA CCGGATCGGG
CCGGTCGCGC CGGCCCAGCG GCCGCTGGTG CTCGAGCGGC TCTGA
 
Protein sequence
MRAYEIIHRK RDGRAIPPAA IAALVDGFTT GEIPDYQMAA FCMAVFFRGM DEVEVRALTE 
AMLRSGDVLD LSDIPGAKVD KHSTGGVGDK VSLALAPLAA ACGVKVPMIS GRGLGHTGGT
LDKLEAIPGF RVDLPVGRFR ELVRDVGACL VGQTERLAPA DRKLYALRDV TATVESIPLI
AASIMSKKLA EGIDALVLDV KVGSGAFMKR LDDARTLAAT LAGIGRRMGK RVSALLTRMD
EPLGRAVGNA LEVAETVALL SGGGPEDLRE VTVELTAEML VLGGAAADLA AGRARVAAAI
ADGRGLAKLE EIVRAQGGDA AVLRDPERLP RAPVRYDVPS PAAGFVAELD AEAIGLAAVA
LGAGRARVED RIDPAVGVVV AKKLGDRVER GEPLCTVHAG EGSESRERVT ARLAGAYRIG
PVAPAQRPLV LERL