Gene Tmz1t_0088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0088 
Symbol 
ID7083471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp98951 
End bp100450 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content71% 
IMG OID643697135 
ProductAMP nucleosidase 
Protein accessionYP_002353784 
Protein GI217968550 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0775] Nucleoside phosphorylase 
TIGRFAM ID[TIGR01717] AMP nucleosidase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGGCC ACGCCCCCCA CTTCGACGTC GAGGACTTCG CCGATCCGGC CGCGGCGCTG 
GCCCGGGTGC ACGAGATCTA CGACCTCGCG GTCGACCACC TGCGCCGCGG CCTGCAGCAC
TACGTCGACG GCGCCGACAT CGGCCGCCAC GTGCGCGCCT GCTACCCGCT GCTGCGCGTG
CGCACCGACA CCGTGGCGCG CGCCGATTCG CGGCTGTCCT ACGGCTTCGT CGCCGGGCCG
GGCGTGTTCG AGACCACGCT GACCCGGCCC GACCTGTTCG CGGACTACTA CCTGGAGCAA
TTCCGCCTGC TGGTGCAGAA CCACGGCGTA GCCTTGCAGG TGGGCAGCAG CACGCAGCCC
ATCCCGGTGC ATTTCGCCCT GCCCGAGCAC GACTACCTGG AAGGCCACCT CGGCCCCGAG
CGCCGCCGCC TGCTGCGCGA TCACTTCGAC CTGCCCGACT TGGGCGCCAT GGACGACGGC
ATCGCCAACG GCACCTTCGA GCCCGGCCCC GGCGAACCGC ATCCGCTCGC GCTGTTCACC
GCGCCGCGGG TGGATTACTC GCTGCACCGG CTGCGCCACT ACACCGGCAC GCGGCCGGCC
TTCTTCCAGA ACTTCGTGCT GTTCACCAAC TACCAGTTCT ACATCGACGA GTTCATCCGC
CTCGGCCACG AGCTCATGGC CGACACCGCC TCCGGCCACG GCTACGAGGC CTTCGTCGAG
CCGGGCAACG TGCTCACCCG CCGCGCCGAC CTTCCGCCGC AGGCCGAGGA CGCCGATGGC
ACCCCGCCGC CGCGCCTGCC GCAGATGCCG GCCTACCACC TGGTGCGCGG CGACCACGCC
GGCATCACGA TGGTGAATAT CGGCGTCGGC CCGGCCAACG CCAAGACCAT CACCGACCAC
ATCGCGGTGC TGCGCCCGCA CGCCTGGATC ATGCTCGGCC ACTGCGCCGG GCTGCGCAAC
AGCCAGCATC TGGGCGACTA CGTGCTCGCC CACGGCTACG TGCGCGAGGA CCACGTGCTC
GACGAGGAGC TCCCGCCCTG GGTGCCGATC CCGCCGCTGG CCGAGGTGCA GGTCGCGCTC
GAGGCCGCCG TGGCCGAGGT CACGCAGCTG TCCGGCTACG AGCTCAAGCG CCTGATGCGC
ACCGGCACCG TCGCCAGCAC CGACAACCGC AACTGGGAGC TGCTGCCCTC GCACGGCATG
TCGAGCAGCC CGGAGCGCCG CTTCAGCCAG AGCCGCGCGG TGGCGCTCGA CATGGAATCC
GCCACCATCG CCGCCAACGG CTTCCGCTTC CGCGTGCCCT ACGGCACCCT GCTGTGCGTC
AGCGACAAGC CGCTGCACGG CGAGATCAAG CTGCCCGGCA TGGCCGACAA GTTCTACCGC
GAGCGGGTGG ACCAGCACCT GCGCATCGGC ATCCGCGCGC TCGAGCAGTT GCGCGAACAA
GGCGTCGACC GCCTGCACAG CCGCAAGCTG CGCAGCTTCG CCGAGGTGGC GTTCCAGTAG
 
Protein sequence
MNGHAPHFDV EDFADPAAAL ARVHEIYDLA VDHLRRGLQH YVDGADIGRH VRACYPLLRV 
RTDTVARADS RLSYGFVAGP GVFETTLTRP DLFADYYLEQ FRLLVQNHGV ALQVGSSTQP
IPVHFALPEH DYLEGHLGPE RRRLLRDHFD LPDLGAMDDG IANGTFEPGP GEPHPLALFT
APRVDYSLHR LRHYTGTRPA FFQNFVLFTN YQFYIDEFIR LGHELMADTA SGHGYEAFVE
PGNVLTRRAD LPPQAEDADG TPPPRLPQMP AYHLVRGDHA GITMVNIGVG PANAKTITDH
IAVLRPHAWI MLGHCAGLRN SQHLGDYVLA HGYVREDHVL DEELPPWVPI PPLAEVQVAL
EAAVAEVTQL SGYELKRLMR TGTVASTDNR NWELLPSHGM SSSPERRFSQ SRAVALDMES
ATIAANGFRF RVPYGTLLCV SDKPLHGEIK LPGMADKFYR ERVDQHLRIG IRALEQLREQ
GVDRLHSRKL RSFAEVAFQ