Gene Noc_1858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1858 
Symbol 
ID3705122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2110924 
End bp2112633 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content56% 
IMG OID637738337 
Producthypothetical protein 
Protein accessionYP_343854 
Protein GI77165329 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGAAA CTGCGCGAGC TCGAAGTAAT AAAGGGAAAG AGCGGAATGC TGAAGGACGT 
GGGAATGGAA GCCCGCGGCC TCACGTGGAG GCGGGTCCTC GGCTAGATCC TTTAGCTGGC
CTGCTTAACT TACAGCGCAT GGCGGGCAAC CATGCGGTGA ATCAACTACT TGCTGGTGCT
GTTGGAGGGA ATGAATCACT ACAGGGAGTG GGTGTGGAGC GAAAACCCAA TTTCAACGGG
ATGTCGGCAT TTGCACAAGC ACCGGTAACA GTGCAGAAAA AATTAAAGGT GAGCAGCCCG
GGGGATGGCT ACGAGCAGGA GGCGGACCGG ATTGCCGATC GGGTGATGGC TATGCCGGCA
CGGCAGAGGG CCAGCAGCGT TACACCCCGT ATCCAGCGTT TATCTGGAGA GGCGAATGGC
CGGACCGATG TAGTGCCCGA TAGCGTAAAG CAAACGCTCG CCAGCCCTGG CAAACCGCTC
GATTCCGCCA CCCGCTCTCT CTTTGAACCG CGTTTTGGGC ATGATTTCAG CCAAGTGCGA
GTGCATACTG ATACACTGGC GGCAGAATCA GCAAAAGCGG TGAATGCGCA GGCTTATACT
GTCGGAGATC ACGTCGTTTT CGGACAACGA AAATATAAAC CGAGATCGCT TGGCGGCCTC
CGCCTGCTAG CCCATGAGCT GACGCATGTA GTCCAGCAAG GCCATAGCCG GTCGCTTGTC
CAACTTCAGC CTGCGCTAAA CGTCGCTCCC GCGAACGATC AGATACGCAG GGAATTCGTT
CGAGACACCA TAAAATTCCT TGAGGGAAGC GCCTCATATT ACCAGAGTCC CAATGTTCCA
ATCGATAGGG CTCTTTTCGA TCGGGTAATC GACGGCTGGT ACGCCGCGGT AGTGAACCAG
GAAAGGATAA TCGATAATGA TCTGGGCGGC GACCGTGCTT TGAAGGCTGA TCTGAGAGCC
GCCTACATAA GCGCCATTCG CGTGCTGATC AGCAGGGCGG CGGCTGCCCT GGGAAAGAGC
GAGGCCGAGT TATACTCGGA AAACAGCGGA CGCATTCCGC TATGGGCGTG GCAAACTCCC
CACCATATGG AGCAGTTCAT CTCAACGCCC ATTCCGGAGG GACGAGTGGC CGACAAACTA
ACGGGAGCGG TGAATTTTTC CACAAACGGC TTTGATGTGA GCATAATGCC GGATGCGCTA
GATCCGGCTC TGGGAAACCA TGCAGAGACG CGGCAAAACA TCCAGGGAGG TAGGATCAAT
TATCAGCGGC AGTCCAGGAG AGGCCGGAGA ATCGTCACAA ATTTCAACGG CCCCGGCACG
CCAGCCGTAA CTATCCAGAC GTTCTACGGG CGCGGCGTAA CAGCGGCTTC CCCATCAGAC
TACGGCAGAG GCACTACGCC GGAGGATATC GCCGGAGGCA GAGTCACGCC CAGAAGCACA
AGCCTCGGTC TTCATGAGGG CAGCCACGGC CTGGACTATA TAGATTTCCT GGAGACTAAC
CCGGCCCCTC AATTCACCGG CGCGGTAGGG ATGACGGAAG CCCAGTTCAG AGCGGCCATA
CGCCAGTATC AGCGAGACCT CAGGGACTAC GCTAATAGAA TGGAAGAGTT TTCTACAAGA
CGCACCGACT GCGTTGGGAC GACGATTGAT CAGTTCGACC AAGCCAACGG CGCGGCAGGA
GCACAGGTCA TTATACAATG CGGTCCGTAG
 
Protein sequence
MPETARARSN KGKERNAEGR GNGSPRPHVE AGPRLDPLAG LLNLQRMAGN HAVNQLLAGA 
VGGNESLQGV GVERKPNFNG MSAFAQAPVT VQKKLKVSSP GDGYEQEADR IADRVMAMPA
RQRASSVTPR IQRLSGEANG RTDVVPDSVK QTLASPGKPL DSATRSLFEP RFGHDFSQVR
VHTDTLAAES AKAVNAQAYT VGDHVVFGQR KYKPRSLGGL RLLAHELTHV VQQGHSRSLV
QLQPALNVAP ANDQIRREFV RDTIKFLEGS ASYYQSPNVP IDRALFDRVI DGWYAAVVNQ
ERIIDNDLGG DRALKADLRA AYISAIRVLI SRAAAALGKS EAELYSENSG RIPLWAWQTP
HHMEQFISTP IPEGRVADKL TGAVNFSTNG FDVSIMPDAL DPALGNHAET RQNIQGGRIN
YQRQSRRGRR IVTNFNGPGT PAVTIQTFYG RGVTAASPSD YGRGTTPEDI AGGRVTPRST
SLGLHEGSHG LDYIDFLETN PAPQFTGAVG MTEAQFRAAI RQYQRDLRDY ANRMEEFSTR
RTDCVGTTID QFDQANGAAG AQVIIQCGP