Gene Noc_0804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0804 
Symbol 
ID3707070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp876266 
End bp877225 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content55% 
IMG OID637737306 
Productthiamine-monophosphate kinase 
Protein accessionYP_342847 
Protein GI77164322 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.914964 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATGAGT TCTCCCTGAT TGAAAATTTT TTTGCGGATT GTACTCAGAA ACGGGAAGAC 
GTTGCGCTGG CGGTGGGTGA CGATTGCGCC TTGATGACTG TCCCTCCAGG TTGTGAATTG
GCGGTTTCTA TTGATACGTT AGTAGCCGGG GTGCACTTTA CTGCCGAGGT GGATTCCGCC
GCTTTGGGGC ACAAAGCGCT GACGGTAGGA TTGAGCGATC TTGCTGCTAT GGGGGCAGAA
CCGGCCTGGG CGACTTTGGC GTTGACTCTG CCAGAGCTCG ACAGAGCTTG GCTGGCTGGG
TTTACTCAAG GGTTAAGCAA GCTTGCCAGA AGCTACGGTG TGCAATTGGT AGGGGGAGAT
ACCACTCGGG GGCCGCTGGC GGTCACTATG CAGTTGCATG GTTTCGTGCC TCGGGGTAAA
GCCCTGAGGC GTGATGGAGC ACGTCCCGGT GATGGAATTT ACGTAACGGG AACTTTGGGT
GATTCTGGCC TTGCCCTTCA AGCGCGATTG GAAGGTCTCC AGTTATCCCA GGAGGCTTTA
TGCTATGTTG AGCATCGCCT GGATTGGCCA CAGCCTCGGG TACATGAAGC CTTGGCGCTT
CGTCCTCTCG CCCATGCTGC TATCGATATC TCAGATGGTC TCGCAGCAGA TTTGGGACAT
ATCCTGAAAG GCAGCGGTGT TGGTGCGGCG GTTGAAGTAG AGGCTTTGCC GCTTTCAGAT
TCCTTTCGTG CTTCTCTTGA GTTGGAGCAA GCCTGGGCCT TGGCGCTAAC CGCAGGCGAT
GACTACGAAT TGTGTGTGAC CGCGCCTGCC GAATACCATG ACCGGATACA GGCGGTGCTC
TCGGATCGGG GTTGTCCCTG CACCTTGATT GGAACGATTG AAGAGGAGCC AGGCTTCCGT
TGCCGCCGCC GGAATGGAGC TTCATTTATT CCCCAACAGC AGGGTTACCG TCATTTTTAG
 
Protein sequence
MNEFSLIENF FADCTQKRED VALAVGDDCA LMTVPPGCEL AVSIDTLVAG VHFTAEVDSA 
ALGHKALTVG LSDLAAMGAE PAWATLALTL PELDRAWLAG FTQGLSKLAR SYGVQLVGGD
TTRGPLAVTM QLHGFVPRGK ALRRDGARPG DGIYVTGTLG DSGLALQARL EGLQLSQEAL
CYVEHRLDWP QPRVHEALAL RPLAHAAIDI SDGLAADLGH ILKGSGVGAA VEVEALPLSD
SFRASLELEQ AWALALTAGD DYELCVTAPA EYHDRIQAVL SDRGCPCTLI GTIEEEPGFR
CRRRNGASFI PQQQGYRHF