Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2121 |
Symbol | nusA |
ID | 3704924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 2443399 |
End bp | 2444919 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637738596 |
Product | transcription elongation factor NusA |
Protein accession | YP_344111 |
Protein GI | 77165586 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.274581 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAAG AAATTCTGAT GGTAGTAGAC GCTGTTTCCA ATGAAAAAGG CGTAAACAAG GAAGTCATTT TCCAGGCGAT TGAGGCGGCT CTGGCGATGG CGACTCGCAA GCGTTATCAA GAGGATATTG CAGTGCAGGT CGTCATTGAT CGAACAACGG GCGATTACCA ATCCTTTCGG TCTTGGGAAG TTATCGAGGA TGAGGCAGAA TTGGATGCGC CAGAGCGCCA GATGCATTTG AGCGAAGCGT GCAAACGGGA TCCTAACGCC GAAGTGAGTG GCTTTGTGAA AGAACCCATG GAGTCGATAG CATTTGGGCG TATTGCTGCT CAAACCGCTA AACAAGTGAT TGTTCAAAAG GTGCGGGAGG CGGAACGGGC TAAGGTTGTT GCAGCCTACC GGGGCCGGAT TAAAGAGATG GTCATGGGGG TGGTTAAACG CATGGATCGT GGCAACATTA TTCTGGATCT TGGAGACAAT GCCGAGGGTA TTGTTCCTCA AGAAGAGATG ATACCTCGTG AGGCAGTCCG GCCAGGAGAT CGATTGCGAG GCTATCTTAA GGAAGTACGA GCTGAAGGGC GAGGGCCTCA ACTAGTTATT AGCAGGACTG CCCCGGAGCT TCTCATTGAG TTGTTCAAGT TAGAGGTTCC TGAAATCAAC GAGAATCGTA TTGAAGTTAA GGGTGCTGCT CGGGATCCAG GCCTTCGGGC TAAAATTGCC GTAAAAACCA ATGAAACTCG TATTGATCCG GTAGGAGCCT GCGTGGGCAT GCGGGGTTCT AGAGTTCAAG CTGTGTCGAA TGAGCTTGCT GGTGAGCGAG TGGACATCGT GCTTTGGGAC GAAGATCCTG CGCGCTTTGT AATTAATGCT ATGGCGCCAG CAGAGGTCGC CTCTATTGTG GTTGATGAGG AGCGCCATAG CATGGACGTT GCTGTGGCCG AGGGGAATCT CTCTCAAGCT ATTGGCCGTG GAGGGCAGAA CATACGTTTG GCTAGCCAGC TGACCGGCTG GGAGTTGAAT GTAATGACGG AACAAGAAGC AGAAGAAAAA GGTGAAAGCG AAGCAAAGTC GCTCCAGCAG ATGTTTATGG AACAGTTAGA TGTGGATGAG GAAGTGGCGG CTATCCTAGT ACATGAAGGT TTTTCTAGTA TCGAAGAAAT GGCCTATGTG CCGGAACAAG AGATTCTTGC TATAGAGGAG TTTGACTCCC AAATCGTTCA GGAACTACGT AATCGAGCCC GTGACGTTCT TCTTACTCGA GCAATTGCTA ATGAAGAGAC TATTGAGACC GTACAGCTAG ACCAGGAATT ATTGGACATG GAAGAGATTG ACCATGGATT AGCCTCTGCA CTGGCCAGCA AAGGTATTGT AACTAAAGAA AATTTGGCGG AGCAGGCGGT GGATGAACTC ATGGAGATCG AGGGGATGAA CAGAGAGAAA GCTGCTCGTC TCATTATGGC AGCACGAGAG TCCTGGTTTG TTAGCGAGAA ACAGGATGAA GAAACGGAGG AGCGGCAATG A
|
Protein sequence | MNKEILMVVD AVSNEKGVNK EVIFQAIEAA LAMATRKRYQ EDIAVQVVID RTTGDYQSFR SWEVIEDEAE LDAPERQMHL SEACKRDPNA EVSGFVKEPM ESIAFGRIAA QTAKQVIVQK VREAERAKVV AAYRGRIKEM VMGVVKRMDR GNIILDLGDN AEGIVPQEEM IPREAVRPGD RLRGYLKEVR AEGRGPQLVI SRTAPELLIE LFKLEVPEIN ENRIEVKGAA RDPGLRAKIA VKTNETRIDP VGACVGMRGS RVQAVSNELA GERVDIVLWD EDPARFVINA MAPAEVASIV VDEERHSMDV AVAEGNLSQA IGRGGQNIRL ASQLTGWELN VMTEQEAEEK GESEAKSLQQ MFMEQLDVDE EVAAILVHEG FSSIEEMAYV PEQEILAIEE FDSQIVQELR NRARDVLLTR AIANEETIET VQLDQELLDM EEIDHGLASA LASKGIVTKE NLAEQAVDEL MEIEGMNREK AARLIMAARE SWFVSEKQDE ETEERQ
|
| |