Gene Noc_0045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0045 
Symbol 
ID3705920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp43212 
End bp45020 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content48% 
IMG OID637736569 
Producthypothetical protein 
Protein accessionYP_342117 
Protein GI77163592 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.250484 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGAAAA TTATGGAACA AGATCGTCAG TCACAATTGA AACTTTTGAT CGCAAGGGGT 
AAGGATCGGG GTTATCTATC CTATAGCGAG GTAAACGATC ACCTGCCCGA CGACGTGATC
GATCCGGAGC AAATTGAAGA CATTATTACA ATGTTCAACG ATATGGGTAT CTCCGTGCAT
GAGGATATGG CGGAAGCCGA TGCCCTAGCA ATTATGGATA CCGCCGTTGC CGACGATGAG
GTGGCTGAGG AGGCTGCTGC TGCTCTGGTC TCGGTGGATG GAGAGTTTGC CCGAACTTCG
GATCCGGTCC GTATGTATAT GCGGGAAATG GGCAGCGTTG AGCTTTTGAC TCGGGAAGGG
GAGATCAGCA TCGCTAAGCG AATCGAAGAG GGAATGCGGC AAGTTTATAC CGCTCTTTCT
CGTTTTCCAG GCAGCGTGGC TGAATTATTG TGTCAATACG AAAAAGTGGA GGCGGATGAA
ATCCGCTTGG CTGATGTGAT TTCCGGGTTT CAGGAATCAG AGGAAGCGGA TTTAAAGGAA
TCTGCTGCAG AAGAAGCTTC CGTCTCTTCC GAAGGTAGCA CGGAGGAGGA AGAAAGTGAT
AATGGCCTTG ATCTAGATAG AGTCCGGGAA CAGTTTACTC AACTACGGGA GTTGTATGAG
CGGTATCAGG CGGCTGATCT CAAGGATAAT GATAGAGCGC TAGCTGAACT GAGCGAGTGC
TTTCTTCAGC TTAAGCTTGT GCCTCGTTTA TCTCAACGGT TGATAGAAAA TCTTCGCGGA
GTAGTGGCAG AAATTCGTGT GCTTGAACGG AGCATCATGA AAATTGCGGT GAATTCGGCT
CAGATGCCCC GCAAAGATTT TCTGTCTTCG TTCCAGAAAC AAGAAACCAA CCTAAACTGG
GTAGAAAAGC ATATCCGGGC AAAAAAGCAT TATTCTTCCG TACTCAGAAA ACAAGGGGAC
GCAATTCAGA AAGCCCAAGA AAAGCTCATA GTCTTAGAGC AAAGGGCCGG TATGAGTATT
ACCCAGATTA AGGAGATTAA CCGTTCCCTA TCTATTGGAG AAGCACGGGC CCGGCGGGCT
AAAAAAGAGA TGGTCGAGGC TAATTTGCGC TTGGTCATTT CGATTGCCAA AAAATATACC
AACCGTGGTC TCCAGTTTCT GGATCTGATC CAGGAAGGCA ACATTGGTCT CATGAAGGCC
GTTGATAAAT TTGAATACCG CCGTGGTTAC AAGTTTTCTA CCTATGCGAC TTGGTGGATA
CGGCAGGCAA TTACCCGTTC TATCGCTGAT CAGGCTCGGA CCATCCGTAT TCCGGTACAT
ATGATTGAAA CCATCAATAA GCTCAACCGC GTTTCTCGTC AGATGCTACA AGAGATGGGT
CGAGAGCCAA CCCCCGATGA GTTGGCAGAA CGGGTGGAGA TGCCAAAAGA TAAAGTGCGG
AAGGTCCTTA AGATTGCCAG GGAACCCATC TCCATGGAAA CGCCTATTGG TGACGATGAG
GATTCCCATT TAGGGGATTT TGTCGAGGAC GCTGCTGTCA TATCTCCGCC TGATTCGGCG
ATCTTCTCCG GGCTGAGGGA AACTACACAG TTAATATTAG CCGGTTTAAC TCCCCGCGAG
GCTAAAGTGC TGCGGATGCG TTTTGGGATT GATATGAATA CGGATCATAC CTTGGAAGAA
GTAGGTAAGC AGTTTGATGT AACCCGGGAG AGAATTCGGC AGATTGAGGC TAAGGCTCTA
CGTAAATTGC GCCATCCTTC CCGTTCGGAG CAGCTTCGTA GTTTTTTGGA TATAGAAAAT
AACAACTAA
 
Protein sequence
MLKIMEQDRQ SQLKLLIARG KDRGYLSYSE VNDHLPDDVI DPEQIEDIIT MFNDMGISVH 
EDMAEADALA IMDTAVADDE VAEEAAAALV SVDGEFARTS DPVRMYMREM GSVELLTREG
EISIAKRIEE GMRQVYTALS RFPGSVAELL CQYEKVEADE IRLADVISGF QESEEADLKE
SAAEEASVSS EGSTEEEESD NGLDLDRVRE QFTQLRELYE RYQAADLKDN DRALAELSEC
FLQLKLVPRL SQRLIENLRG VVAEIRVLER SIMKIAVNSA QMPRKDFLSS FQKQETNLNW
VEKHIRAKKH YSSVLRKQGD AIQKAQEKLI VLEQRAGMSI TQIKEINRSL SIGEARARRA
KKEMVEANLR LVISIAKKYT NRGLQFLDLI QEGNIGLMKA VDKFEYRRGY KFSTYATWWI
RQAITRSIAD QARTIRIPVH MIETINKLNR VSRQMLQEMG REPTPDELAE RVEMPKDKVR
KVLKIAREPI SMETPIGDDE DSHLGDFVED AAVISPPDSA IFSGLRETTQ LILAGLTPRE
AKVLRMRFGI DMNTDHTLEE VGKQFDVTRE RIRQIEAKAL RKLRHPSRSE QLRSFLDIEN
NN