Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nham_1900 |
Symbol | |
ID | 4033038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter hamburgensis X14 |
Kingdom | Bacteria |
Replicon accession | NC_007964 |
Strand | - |
Start bp | 2115175 |
End bp | 2116767 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637970366 |
Product | thymidine phosphorylase |
Protein accession | YP_577168 |
Protein GI | 92117439 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02645] putative thymidine phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGTTC CGGGATGCAC GTTAAGGCTG ATCCATTATG TCAGTCCGCA GGTCCTAAAC ATGAATGGTA CCGATCTTCC TCGGCTTCAG CCGAAGATCC GTCGTGTCAA TCTCGATACC GGCCGCGAAA ATGTCGTCGT CATATCGCGG CACTCTGCAG CCCTACGACC GGAAATATTT CGGGGCTTCA GTCGCGTGGA GCTCCGTCGG AATGCCAAGA TCATGTTGGC CACGCTCATT ATTACCGACG ACGACTCGTT GGTGGGCCCT GACGACCTAG GACTTTCCGA GCCGGCATTC CGGCGCTTCG CAGAGCCCGT GGGCAGCGCG GTAACAATAG CGCCGGCAGC ATCTCCGGCC AGCCTCGATG CGGTGCGCGC CAAAATCATG GGGCAAACCT TCAGCGCAGT CGACATCAGC GCAATTATCG ACGATCTCAC CCATTATCGG TACTCCGACA TGGAGATAGC TGCATTCCTG ATCAGTTCTG CCAGCTTCAT GACAAATGGC GAACTGATTG CTTTGGTCGA TTCGATGGCC CGGGCCGGTA CGCAACTCAA ATGGAGAAAT CCGATTATTG TCGACAAACA TTGCATCGGT GGCATTCCCG GAAATCGCAC ATCCATGATT GTGGTGCCGA TCGTGGCAGC TCACGGCCTC ACGATTCCAA AGACGTCGTC CCGAGCTATT ACGTCCCCCG CCGGTACGGC AGATACGATG GAAATGCTGG CGCGTGTCGA TGTCGGCGTT GAAGAAATGA AAGACATCGT AGCTGCATGC CGCGGCTGTC TGGTGTGGGG CGGACATGTC AATCTGTCCC CGGCGGACGA CATCCTGATT TCTGTTGAGC GGCCACTTGG CCTCGATACT CGCGAGCAAA TGGTAGCTTC TATTCTGTCA AAAAAGCTCG CCGCCGGCTC AACCCATCTC CTGATTGACT TGCCTGTCGG TCCGACCGCC AAGCTCGTCA ACGAGATGGA AGCGATGAGG CTCCGCAAAC TGTTCGAATT CGTCGGCGAT CATTATGGAA TATCCGTCGA GGTTGTCGTT ACCGATGGTC GCCAGCCGAT CGGCAATGGC ATTGGTCCCG TTCTTGAGGC GCAGGATGTT ATGGCGGTTC TAGCCAATGA TCCGGAAGCA CCAGCAGACC TGCGCGAAAA ATCTTTGCGG CTTGCCGCAC ACTTACTCGA ATATGACCCC AAGCTGCGTG GCGGCAGCGG TTATGCACGC GCCCGCGAAC TGCTCGATAG TGGCGCCGCG CTCAAACAAA TGCAAAAGAT CATCGACGCG CAAGGGCCTC CGACCTGTTG CACAGATTTG GGGAATTTGA CGTTCGATGT CACAGCCTCA CGCGATGGCT TCGTTTCAGG CATCAACTGC CTGCAGTTGA ACCGGCTTGC ACGAATCGCG GGAGCGCCCA TCGATAAAGG CGCCGGCATC AGACTATTCA AGAAAATTGG CGACCGTGTC CAGCAAGGAG AGCCGCTTTA CCGCATCCAT GCGTTCGAGC GGTCGGGGCG CGATCTTGCC GCCGCCGGCA CAACGGCCTA CACGATCGAC AGCGAGGAAT CCAACCTGGA AGCGACGCCG TGA
|
Protein sequence | MRVPGCTLRL IHYVSPQVLN MNGTDLPRLQ PKIRRVNLDT GRENVVVISR HSAALRPEIF RGFSRVELRR NAKIMLATLI ITDDDSLVGP DDLGLSEPAF RRFAEPVGSA VTIAPAASPA SLDAVRAKIM GQTFSAVDIS AIIDDLTHYR YSDMEIAAFL ISSASFMTNG ELIALVDSMA RAGTQLKWRN PIIVDKHCIG GIPGNRTSMI VVPIVAAHGL TIPKTSSRAI TSPAGTADTM EMLARVDVGV EEMKDIVAAC RGCLVWGGHV NLSPADDILI SVERPLGLDT REQMVASILS KKLAAGSTHL LIDLPVGPTA KLVNEMEAMR LRKLFEFVGD HYGISVEVVV TDGRQPIGNG IGPVLEAQDV MAVLANDPEA PADLREKSLR LAAHLLEYDP KLRGGSGYAR ARELLDSGAA LKQMQKIIDA QGPPTCCTDL GNLTFDVTAS RDGFVSGINC LQLNRLARIA GAPIDKGAGI RLFKKIGDRV QQGEPLYRIH AFERSGRDLA AAGTTAYTID SEESNLEATP
|
| |