Gene Noc_2066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2066 
Symbol 
ID3705237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2376951 
End bp2378690 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content52% 
IMG OID637738541 
Producthypothetical protein 
Protein accessionYP_344056 
Protein GI77165531 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTTG AAATGAGACA AGAGGAGCTG AAAAGGCTTA TTCTCAAAGG CAAGGCGCAA 
GGTTTGCTGA CTGAGGAAGA GTTACAAGAC TACATTATAC AGGAAGTGGA GGACAGTGAT
GAGGCTGAAA CCATGGCCCA TCTCTTTCAT CATCACCAGG GTCTCGATGA ATTTAACGAA
GCCCTAGATT CCACACTATT GCTTAAAAAT AGCTCGGAGG AGGAAATAGC TGAAACACTT
CCGACTTTAG AAGGAGAAAG CACGGGAGCA GTTAGTGATA TGGTCTATGC CTACATGCGG
GAGATGGGCG CCCATGCACT CCTTACCCGC GAGGAAGAAA TCACCTTAGC CAAGGCGATT
GAGACAGGAT TAAGCCAAAG CACCGAAGCC TTAGGTAACT GTCCCGCAGC CGTGGCTGAG
CTGGTACGTT GGGCCGAAGA AGTGGCCACA GGCAAGGGCC GTTTGAAAGA CTGGCTAACT
GGCTTTACTG AACCAGAAAT AGAAGAAACT GAGATAAAGG ACGGGAAAGA ATCAAGTTTG
GAAGCAACGA GCAAGCATCT ATCCCGAGTC CGCACTCTTT ATGGCAACCT AGAGGCAACT
TTAGCCCAAG AAGGGGTAGC AAGCCCCCGA GCACGAGAGC TGCGAAGGAA ACTTGCTCGG
GAATTTCAGT TCCTGCGCCT AATTCCGTCC AGAATTAATC AGCTGACTCA ATTGGTGCAA
GGCTGGATAA GCAAAGCCCA AACCCAGGAA CAAATCATAA GGTCCTGCTG TATCCATCAA
GGTGAGATGT CCCAAGAGGA CTTTTGGCAC AATTTTTCCA GCGGTACCAC TCATCCCCAG
TGGTTGGATG ACCTTCTAGC CCACAGTCGG ATAGACCGGC AAGGGCTGCA GACCCAAGCC
AAAACGCTTC GAGCAGCCCA AGCAGTACTG CTTCAGGTGG AAGTCGAGGC GGGACTCCCT
CTGGACGAGC TCAAGGCAAT TCACCAACGC CTGGTCCGAA GCCAATTTCA AGCGCAGCGG
GCTAAAGCAC AAATGGTGGA GGCCAATCTG CGCCTGGTGG TCTCGGTAGC GAAGAAGTAT
CGCAATCGGG GATTGACCTT TCTGGATTTG ATCCAAGAGG GGAATATCGG CTTGATGAAA
GCGGTGGACA AATTCGATTA CCACCGGGGC TATAAATTCT CTACCTATGC CCATTGGTGG
ATCCGGCAGG CAATTACGCG AGCTATTGAC GATCAGTCCC GAACTATCCG TATTCCGGTC
CATGTGATGG AGAAACTCAG CAAACTCAAC CGGGCCTCCT ACCAGCTCCG ACAGGAAAAA
GGCCGCGAAG GCCGTCCTGA GGAATTGGCC GAACGTCTCG CCCTGTCCGA GCAACAGATT
CATCGTATGC ACGAGATCGC TAAACAACCC ATCTCCTTGG AAACCCCCCT AGGTAAAGAT
GAGGACTCGC AATTAGGTGA ACTCATGGAA GACGAGCAAG TCCCAAATCC CATGGAGGTT
GCTATCACAG CCGGACTGCA GACCGGGGCC CAGCAGTTGC TCGCAGCACT TTCTCCCCGA
GAGGCCCAGG TGGTTGCCAT GCGCTTTGGG ATCGGTATGG ACACTGACCA TACCTTGGGA
GAAGTGGCCC AGCAATTTGA TTTGAGCCGG GAACGAATCC GGCAAATCGA GGCTCAAGCC
TTGGGTAAAC TGCGTCGCTT GGGCCACTCC AAGGCCCTGC GCAACTTTCT CGAAGACTAG
 
Protein sequence
MDFEMRQEEL KRLILKGKAQ GLLTEEELQD YIIQEVEDSD EAETMAHLFH HHQGLDEFNE 
ALDSTLLLKN SSEEEIAETL PTLEGESTGA VSDMVYAYMR EMGAHALLTR EEEITLAKAI
ETGLSQSTEA LGNCPAAVAE LVRWAEEVAT GKGRLKDWLT GFTEPEIEET EIKDGKESSL
EATSKHLSRV RTLYGNLEAT LAQEGVASPR ARELRRKLAR EFQFLRLIPS RINQLTQLVQ
GWISKAQTQE QIIRSCCIHQ GEMSQEDFWH NFSSGTTHPQ WLDDLLAHSR IDRQGLQTQA
KTLRAAQAVL LQVEVEAGLP LDELKAIHQR LVRSQFQAQR AKAQMVEANL RLVVSVAKKY
RNRGLTFLDL IQEGNIGLMK AVDKFDYHRG YKFSTYAHWW IRQAITRAID DQSRTIRIPV
HVMEKLSKLN RASYQLRQEK GREGRPEELA ERLALSEQQI HRMHEIAKQP ISLETPLGKD
EDSQLGELME DEQVPNPMEV AITAGLQTGA QQLLAALSPR EAQVVAMRFG IGMDTDHTLG
EVAQQFDLSR ERIRQIEAQA LGKLRRLGHS KALRNFLED