Gene Nham_1900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_1900 
Symbol 
ID4033038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp2115175 
End bp2116767 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content57% 
IMG OID637970366 
Productthymidine phosphorylase 
Protein accessionYP_577168 
Protein GI92117439 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02645] putative thymidine phosphorylase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGTTC CGGGATGCAC GTTAAGGCTG ATCCATTATG TCAGTCCGCA GGTCCTAAAC 
ATGAATGGTA CCGATCTTCC TCGGCTTCAG CCGAAGATCC GTCGTGTCAA TCTCGATACC
GGCCGCGAAA ATGTCGTCGT CATATCGCGG CACTCTGCAG CCCTACGACC GGAAATATTT
CGGGGCTTCA GTCGCGTGGA GCTCCGTCGG AATGCCAAGA TCATGTTGGC CACGCTCATT
ATTACCGACG ACGACTCGTT GGTGGGCCCT GACGACCTAG GACTTTCCGA GCCGGCATTC
CGGCGCTTCG CAGAGCCCGT GGGCAGCGCG GTAACAATAG CGCCGGCAGC ATCTCCGGCC
AGCCTCGATG CGGTGCGCGC CAAAATCATG GGGCAAACCT TCAGCGCAGT CGACATCAGC
GCAATTATCG ACGATCTCAC CCATTATCGG TACTCCGACA TGGAGATAGC TGCATTCCTG
ATCAGTTCTG CCAGCTTCAT GACAAATGGC GAACTGATTG CTTTGGTCGA TTCGATGGCC
CGGGCCGGTA CGCAACTCAA ATGGAGAAAT CCGATTATTG TCGACAAACA TTGCATCGGT
GGCATTCCCG GAAATCGCAC ATCCATGATT GTGGTGCCGA TCGTGGCAGC TCACGGCCTC
ACGATTCCAA AGACGTCGTC CCGAGCTATT ACGTCCCCCG CCGGTACGGC AGATACGATG
GAAATGCTGG CGCGTGTCGA TGTCGGCGTT GAAGAAATGA AAGACATCGT AGCTGCATGC
CGCGGCTGTC TGGTGTGGGG CGGACATGTC AATCTGTCCC CGGCGGACGA CATCCTGATT
TCTGTTGAGC GGCCACTTGG CCTCGATACT CGCGAGCAAA TGGTAGCTTC TATTCTGTCA
AAAAAGCTCG CCGCCGGCTC AACCCATCTC CTGATTGACT TGCCTGTCGG TCCGACCGCC
AAGCTCGTCA ACGAGATGGA AGCGATGAGG CTCCGCAAAC TGTTCGAATT CGTCGGCGAT
CATTATGGAA TATCCGTCGA GGTTGTCGTT ACCGATGGTC GCCAGCCGAT CGGCAATGGC
ATTGGTCCCG TTCTTGAGGC GCAGGATGTT ATGGCGGTTC TAGCCAATGA TCCGGAAGCA
CCAGCAGACC TGCGCGAAAA ATCTTTGCGG CTTGCCGCAC ACTTACTCGA ATATGACCCC
AAGCTGCGTG GCGGCAGCGG TTATGCACGC GCCCGCGAAC TGCTCGATAG TGGCGCCGCG
CTCAAACAAA TGCAAAAGAT CATCGACGCG CAAGGGCCTC CGACCTGTTG CACAGATTTG
GGGAATTTGA CGTTCGATGT CACAGCCTCA CGCGATGGCT TCGTTTCAGG CATCAACTGC
CTGCAGTTGA ACCGGCTTGC ACGAATCGCG GGAGCGCCCA TCGATAAAGG CGCCGGCATC
AGACTATTCA AGAAAATTGG CGACCGTGTC CAGCAAGGAG AGCCGCTTTA CCGCATCCAT
GCGTTCGAGC GGTCGGGGCG CGATCTTGCC GCCGCCGGCA CAACGGCCTA CACGATCGAC
AGCGAGGAAT CCAACCTGGA AGCGACGCCG TGA
 
Protein sequence
MRVPGCTLRL IHYVSPQVLN MNGTDLPRLQ PKIRRVNLDT GRENVVVISR HSAALRPEIF 
RGFSRVELRR NAKIMLATLI ITDDDSLVGP DDLGLSEPAF RRFAEPVGSA VTIAPAASPA
SLDAVRAKIM GQTFSAVDIS AIIDDLTHYR YSDMEIAAFL ISSASFMTNG ELIALVDSMA
RAGTQLKWRN PIIVDKHCIG GIPGNRTSMI VVPIVAAHGL TIPKTSSRAI TSPAGTADTM
EMLARVDVGV EEMKDIVAAC RGCLVWGGHV NLSPADDILI SVERPLGLDT REQMVASILS
KKLAAGSTHL LIDLPVGPTA KLVNEMEAMR LRKLFEFVGD HYGISVEVVV TDGRQPIGNG
IGPVLEAQDV MAVLANDPEA PADLREKSLR LAAHLLEYDP KLRGGSGYAR ARELLDSGAA
LKQMQKIIDA QGPPTCCTDL GNLTFDVTAS RDGFVSGINC LQLNRLARIA GAPIDKGAGI
RLFKKIGDRV QQGEPLYRIH AFERSGRDLA AAGTTAYTID SEESNLEATP