Gene Noc_0057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0057 
Symbol 
ID3705933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp57899 
End bp59566 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content54% 
IMG OID637736582 
Producthypothetical protein 
Protein accessionYP_342129 
Protein GI77163604 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACGA ATAAACACGA AATAGAGGCA TTGATCCAGC GAGGCGAGAG TGTTTCTCTA 
GAGTTCAAAA GTGACCTGAA GCGCCTGCCG GATCGGGAGC TTGTGGCGGC TGTGGTTTCT
TTAGCAAATA CGGACGGCGG CGATCTATTG CTCGGCGTGG AGGATGACGG CACGGTTACC
GGTTTGCATG CAAGTCACCT GAATGTTTTA GGTATCCCGC CTCTCATTGC AAATAAGACG
AATCCTGCTA TTTCGGTTCG CGTGGAAAAG TGCGAGTCAA CGGGAAAGTC CATTGCCCGG
ATCAGGGTGC CGAAGTCCCA GCAACTGGTA TCCACTTCCG ACGGCTTTTT GGTACGCCGA
CGCCTGAAGT TCGATGGAAC ACCGGAAGCA GTACCTTTCT ATCCCCACGA ATTTATTCAA
CGCCAGTCGT CCCTGGGCCT CGTCGATCCC TCAGCCATGG TACTTGAGGA AGTAGACTCG
GGTCAGCTCG ATCCGCTTCA GCGCCTGCGT ATCCGTAGTG CCATAAAAAA GTATGGTGGG
GAGCAATCCC TGCTTGCTCT AGCTGATGAC GAACTGGACG GCGCACTTGG GCTTTGTCGT
GAATCGAATG GCGCTAGACA CCCCACCATG ACCGGGTTAC TGGTATTGGG AACGGAAGAA
CTCCTGCGAG CCCATGTGCC CGCCTATGAG GTCGCCTTTC AGGTGCTTCA GAGAACCGAT
GTCAAGGTCA ACGAGTTCTT TCGGAAACCG CTTCTGGAAA CCTTTGAAGA AGTGGACCCC
CTCTTCAAGG TCCGGGTTGA GGAAGAAGAA ATTCAGGTTG GCCTTTTCCG GGTACCTGTA
CCCAATTATG ATCGCCGCGC CTTTCGTGAG GCTTTTGTGA ATGCGCTGGT TCACCGTGAT
TTCAGCAGGC TCGGTGCGGT GCATGTCAAA ATCACTGATG ATGGTCTCAC TATCAGCAAC
CCGGGTGGTT TCGTCGAGGG GGTGAGACTG GATAACCTGC TCGTAGTCGA TCCTCGTTCC
CGTAATCCCC TCCTCGCCGA TGTGATTAAG CGGATCGGCC TGGCCGAGCG CACCGGGCGT
GGCATCGACC GCATTTACGA GGGCATGCTC CGCTACGGAC GCCCCGCGCC TGATTACTCA
ATGTCCGATG AATTCACGGT TTCAGTACAG ATGGTGAATG CTGCTGCCGA TCTCGATTTC
CTCAAGATGG TGGTCGAGCA GGAGGATAAG CTTGGCAACA TGCCGATTGA CTCCCTGATC
ATACTCTCCC GCCTTCGGGA AGAGCGACGA CTAACCACTG CGGACCTGGC CCCATCCGTC
CAGAAATCAG AAACCAACGT GAGGATAACC CTGGAGAAGC TGGTTGAAAC CGGATTTCTG
GAGCCGCATG GCACCGGCAG GGGGCGGACC TATACCCTCA GCGCTGCCCT GTACCGTAAG
GCAGGAAAGA AATCGGAATA CATCCGCCAG GCCGGTTTTG CGCCCATCCA GCAAGAGCAA
ATGGTGCTGA AATACATTGA TGCGCACGGT TCCATCAAAC GAGCAGATGC CGCTGATTTA
TGCCGGATCA GCCCATTTCA AGCCACGCGT TTGCTTAAAC GGATGGAAAA GAATGACTTA
GTAAAGCCTG TTGGGCAGGG AAGAGGGACG CGTTATGAGC GAAAATAG
 
Protein sequence
MNTNKHEIEA LIQRGESVSL EFKSDLKRLP DRELVAAVVS LANTDGGDLL LGVEDDGTVT 
GLHASHLNVL GIPPLIANKT NPAISVRVEK CESTGKSIAR IRVPKSQQLV STSDGFLVRR
RLKFDGTPEA VPFYPHEFIQ RQSSLGLVDP SAMVLEEVDS GQLDPLQRLR IRSAIKKYGG
EQSLLALADD ELDGALGLCR ESNGARHPTM TGLLVLGTEE LLRAHVPAYE VAFQVLQRTD
VKVNEFFRKP LLETFEEVDP LFKVRVEEEE IQVGLFRVPV PNYDRRAFRE AFVNALVHRD
FSRLGAVHVK ITDDGLTISN PGGFVEGVRL DNLLVVDPRS RNPLLADVIK RIGLAERTGR
GIDRIYEGML RYGRPAPDYS MSDEFTVSVQ MVNAAADLDF LKMVVEQEDK LGNMPIDSLI
ILSRLREERR LTTADLAPSV QKSETNVRIT LEKLVETGFL EPHGTGRGRT YTLSAALYRK
AGKKSEYIRQ AGFAPIQQEQ MVLKYIDAHG SIKRADAADL CRISPFQATR LLKRMEKNDL
VKPVGQGRGT RYERK