Gene Noc_0339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0339 
Symbol 
ID3706510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp369158 
End bp370654 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content56% 
IMG OID637736851 
Producthypothetical protein 
Protein accessionYP_342395 
Protein GI77163870 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00480998 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTATTC TCCCCCTTAA TTTATACACG GCTGCCCAAG TACGTGAATT GGACAGGTGC 
ACCATTGAAG AGTTTGGAAT ATCTGGCGCC ATGTTAATGG AGCGAGCCGG TAAGGCTTCC
CTAGAGCAAC TACGCAAACA CTGGCCACAG GCACAACGCT TAGTGATTAT CTGTGGAGTC
GGCAATAATG GTGGAGATGG CTATGTCCTG GCCCGCCTAG CCCGTAAAGC CGAAATGTAT
GTATCCATCT ATCAACTAGG GGATAATAGC AAGCTTAGCA CCGATGCCCA AGCCGCGCGA
CAAATGCTAT TAGACAGCGG CATGGAAATC CTTCCCTTCC AACCCCAAGT GCTCCGCGCC
GCGGATGTGG TAATAGATGC TATTTTCGGC ACCGGACTCA GCCGGGGAGT CACTGGCCAG
TGGGCCGATG CTATCGAAGC CATCAATACC TGCGGACAGC CGGTGTTTGC TATGGATATC
CCCTCTGGGC TCCATGCCGA TACCGGGAAT ATCCTGGGTA TCGCCATCAA AGCCCAAGTG
ACTGCCACCT TTGGGGGCCT GAAACAAGGG ATGTTCACCC ATCTGGGTCC GGATTACTGT
GGCGAGATTG CTTTCGACTC CCTTAGTATT CCTCCCGAAG CCTATCACGG GGTGACGCCT
TCGGCCCGCC GCATTACCCT TGAAGATCAC ATCACCAAAC TCCCCTCCCG CGCTAAGGCT
GGCCATAAGG GCGATTACGG CCATGTTGTC ATCATTGGAG GCGAGAGAGG AATGCCCGGA
GCCGCACGCA TGGCGGGGGA AGCAGCTTAT CGGGTAGGAG CGGGGCTGGT CAGTATTGCC
ACCCGGGAGA AGCATGCTTC CCTGCTCAAC CTAGCCCGCC CAGAATTGAT GTGCTACGGG
GTAGAGAGCG CCGAAGAGTT AAAACCATTG CTGAACCGGG CTACTACCCT TGTCATTGGT
CCAGGCCTAG GCCAGGATCT TTGGGGCCAA ACAATGCTCG CTGAGGCACT CAACCATTCC
CACCCTCTGG TGGTGGATGC GGATGCCCTC AATCTGCTAG CCAGTCAACC CCGCCAGCAT
AACCGCTGGA TTATCACTCC CCACCCCGGT GAAGCATCGA GACTGCTGAA CATTAATATT
GAAGAAATAC AAGCAGACCG TTTTGCCGCA GTCCAGGCTC TTCAACAGCG CTACGGCGGG
GTTGCGGTGC TTAAAGGAAA TGGGTCTCTG GTATGCTCCA CGAATCACCC CCTAGGGCTA
TGCACGGCAG GCAATCCAGG AATGGCCAGT GGCGGTATGG GGGACGTCCT CTCCGGAACT
ATCGCCGGCC TGCTGGCTCA AGGACTTACC CTGAATAATG CGGCCCATTT GGGGGTAACA
ATTCATGCCA TGGCTGGCGA TCGCGCTGCC CGTGAAGGGG GTGAGCGGGG ACTGCTGGCG
AGCGATTTGA TGGAACATCT GCGACAACTG GCCAATCTTC AACAAATAGG CCCATGA
 
Protein sequence
MAILPLNLYT AAQVRELDRC TIEEFGISGA MLMERAGKAS LEQLRKHWPQ AQRLVIICGV 
GNNGGDGYVL ARLARKAEMY VSIYQLGDNS KLSTDAQAAR QMLLDSGMEI LPFQPQVLRA
ADVVIDAIFG TGLSRGVTGQ WADAIEAINT CGQPVFAMDI PSGLHADTGN ILGIAIKAQV
TATFGGLKQG MFTHLGPDYC GEIAFDSLSI PPEAYHGVTP SARRITLEDH ITKLPSRAKA
GHKGDYGHVV IIGGERGMPG AARMAGEAAY RVGAGLVSIA TREKHASLLN LARPELMCYG
VESAEELKPL LNRATTLVIG PGLGQDLWGQ TMLAEALNHS HPLVVDADAL NLLASQPRQH
NRWIITPHPG EASRLLNINI EEIQADRFAA VQALQQRYGG VAVLKGNGSL VCSTNHPLGL
CTAGNPGMAS GGMGDVLSGT IAGLLAQGLT LNNAAHLGVT IHAMAGDRAA REGGERGLLA
SDLMEHLRQL ANLQQIGP