Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0045 |
Symbol | |
ID | 3705920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 43212 |
End bp | 45020 |
Gene Length | 1809 bp |
Protein Length | 602 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637736569 |
Product | hypothetical protein |
Protein accession | YP_342117 |
Protein GI | 77163592 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.250484 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGAAAA TTATGGAACA AGATCGTCAG TCACAATTGA AACTTTTGAT CGCAAGGGGT AAGGATCGGG GTTATCTATC CTATAGCGAG GTAAACGATC ACCTGCCCGA CGACGTGATC GATCCGGAGC AAATTGAAGA CATTATTACA ATGTTCAACG ATATGGGTAT CTCCGTGCAT GAGGATATGG CGGAAGCCGA TGCCCTAGCA ATTATGGATA CCGCCGTTGC CGACGATGAG GTGGCTGAGG AGGCTGCTGC TGCTCTGGTC TCGGTGGATG GAGAGTTTGC CCGAACTTCG GATCCGGTCC GTATGTATAT GCGGGAAATG GGCAGCGTTG AGCTTTTGAC TCGGGAAGGG GAGATCAGCA TCGCTAAGCG AATCGAAGAG GGAATGCGGC AAGTTTATAC CGCTCTTTCT CGTTTTCCAG GCAGCGTGGC TGAATTATTG TGTCAATACG AAAAAGTGGA GGCGGATGAA ATCCGCTTGG CTGATGTGAT TTCCGGGTTT CAGGAATCAG AGGAAGCGGA TTTAAAGGAA TCTGCTGCAG AAGAAGCTTC CGTCTCTTCC GAAGGTAGCA CGGAGGAGGA AGAAAGTGAT AATGGCCTTG ATCTAGATAG AGTCCGGGAA CAGTTTACTC AACTACGGGA GTTGTATGAG CGGTATCAGG CGGCTGATCT CAAGGATAAT GATAGAGCGC TAGCTGAACT GAGCGAGTGC TTTCTTCAGC TTAAGCTTGT GCCTCGTTTA TCTCAACGGT TGATAGAAAA TCTTCGCGGA GTAGTGGCAG AAATTCGTGT GCTTGAACGG AGCATCATGA AAATTGCGGT GAATTCGGCT CAGATGCCCC GCAAAGATTT TCTGTCTTCG TTCCAGAAAC AAGAAACCAA CCTAAACTGG GTAGAAAAGC ATATCCGGGC AAAAAAGCAT TATTCTTCCG TACTCAGAAA ACAAGGGGAC GCAATTCAGA AAGCCCAAGA AAAGCTCATA GTCTTAGAGC AAAGGGCCGG TATGAGTATT ACCCAGATTA AGGAGATTAA CCGTTCCCTA TCTATTGGAG AAGCACGGGC CCGGCGGGCT AAAAAAGAGA TGGTCGAGGC TAATTTGCGC TTGGTCATTT CGATTGCCAA AAAATATACC AACCGTGGTC TCCAGTTTCT GGATCTGATC CAGGAAGGCA ACATTGGTCT CATGAAGGCC GTTGATAAAT TTGAATACCG CCGTGGTTAC AAGTTTTCTA CCTATGCGAC TTGGTGGATA CGGCAGGCAA TTACCCGTTC TATCGCTGAT CAGGCTCGGA CCATCCGTAT TCCGGTACAT ATGATTGAAA CCATCAATAA GCTCAACCGC GTTTCTCGTC AGATGCTACA AGAGATGGGT CGAGAGCCAA CCCCCGATGA GTTGGCAGAA CGGGTGGAGA TGCCAAAAGA TAAAGTGCGG AAGGTCCTTA AGATTGCCAG GGAACCCATC TCCATGGAAA CGCCTATTGG TGACGATGAG GATTCCCATT TAGGGGATTT TGTCGAGGAC GCTGCTGTCA TATCTCCGCC TGATTCGGCG ATCTTCTCCG GGCTGAGGGA AACTACACAG TTAATATTAG CCGGTTTAAC TCCCCGCGAG GCTAAAGTGC TGCGGATGCG TTTTGGGATT GATATGAATA CGGATCATAC CTTGGAAGAA GTAGGTAAGC AGTTTGATGT AACCCGGGAG AGAATTCGGC AGATTGAGGC TAAGGCTCTA CGTAAATTGC GCCATCCTTC CCGTTCGGAG CAGCTTCGTA GTTTTTTGGA TATAGAAAAT AACAACTAA
|
Protein sequence | MLKIMEQDRQ SQLKLLIARG KDRGYLSYSE VNDHLPDDVI DPEQIEDIIT MFNDMGISVH EDMAEADALA IMDTAVADDE VAEEAAAALV SVDGEFARTS DPVRMYMREM GSVELLTREG EISIAKRIEE GMRQVYTALS RFPGSVAELL CQYEKVEADE IRLADVISGF QESEEADLKE SAAEEASVSS EGSTEEEESD NGLDLDRVRE QFTQLRELYE RYQAADLKDN DRALAELSEC FLQLKLVPRL SQRLIENLRG VVAEIRVLER SIMKIAVNSA QMPRKDFLSS FQKQETNLNW VEKHIRAKKH YSSVLRKQGD AIQKAQEKLI VLEQRAGMSI TQIKEINRSL SIGEARARRA KKEMVEANLR LVISIAKKYT NRGLQFLDLI QEGNIGLMKA VDKFEYRRGY KFSTYATWWI RQAITRSIAD QARTIRIPVH MIETINKLNR VSRQMLQEMG REPTPDELAE RVEMPKDKVR KVLKIAREPI SMETPIGDDE DSHLGDFVED AAVISPPDSA IFSGLRETTQ LILAGLTPRE AKVLRMRFGI DMNTDHTLEE VGKQFDVTRE RIRQIEAKAL RKLRHPSRSE QLRSFLDIEN NN
|
| |