Gene Noc_2121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2121 
SymbolnusA 
ID3704924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2443399 
End bp2444919 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content50% 
IMG OID637738596 
Producttranscription elongation factor NusA 
Protein accessionYP_344111 
Protein GI77165586 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.274581 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAG AAATTCTGAT GGTAGTAGAC GCTGTTTCCA ATGAAAAAGG CGTAAACAAG 
GAAGTCATTT TCCAGGCGAT TGAGGCGGCT CTGGCGATGG CGACTCGCAA GCGTTATCAA
GAGGATATTG CAGTGCAGGT CGTCATTGAT CGAACAACGG GCGATTACCA ATCCTTTCGG
TCTTGGGAAG TTATCGAGGA TGAGGCAGAA TTGGATGCGC CAGAGCGCCA GATGCATTTG
AGCGAAGCGT GCAAACGGGA TCCTAACGCC GAAGTGAGTG GCTTTGTGAA AGAACCCATG
GAGTCGATAG CATTTGGGCG TATTGCTGCT CAAACCGCTA AACAAGTGAT TGTTCAAAAG
GTGCGGGAGG CGGAACGGGC TAAGGTTGTT GCAGCCTACC GGGGCCGGAT TAAAGAGATG
GTCATGGGGG TGGTTAAACG CATGGATCGT GGCAACATTA TTCTGGATCT TGGAGACAAT
GCCGAGGGTA TTGTTCCTCA AGAAGAGATG ATACCTCGTG AGGCAGTCCG GCCAGGAGAT
CGATTGCGAG GCTATCTTAA GGAAGTACGA GCTGAAGGGC GAGGGCCTCA ACTAGTTATT
AGCAGGACTG CCCCGGAGCT TCTCATTGAG TTGTTCAAGT TAGAGGTTCC TGAAATCAAC
GAGAATCGTA TTGAAGTTAA GGGTGCTGCT CGGGATCCAG GCCTTCGGGC TAAAATTGCC
GTAAAAACCA ATGAAACTCG TATTGATCCG GTAGGAGCCT GCGTGGGCAT GCGGGGTTCT
AGAGTTCAAG CTGTGTCGAA TGAGCTTGCT GGTGAGCGAG TGGACATCGT GCTTTGGGAC
GAAGATCCTG CGCGCTTTGT AATTAATGCT ATGGCGCCAG CAGAGGTCGC CTCTATTGTG
GTTGATGAGG AGCGCCATAG CATGGACGTT GCTGTGGCCG AGGGGAATCT CTCTCAAGCT
ATTGGCCGTG GAGGGCAGAA CATACGTTTG GCTAGCCAGC TGACCGGCTG GGAGTTGAAT
GTAATGACGG AACAAGAAGC AGAAGAAAAA GGTGAAAGCG AAGCAAAGTC GCTCCAGCAG
ATGTTTATGG AACAGTTAGA TGTGGATGAG GAAGTGGCGG CTATCCTAGT ACATGAAGGT
TTTTCTAGTA TCGAAGAAAT GGCCTATGTG CCGGAACAAG AGATTCTTGC TATAGAGGAG
TTTGACTCCC AAATCGTTCA GGAACTACGT AATCGAGCCC GTGACGTTCT TCTTACTCGA
GCAATTGCTA ATGAAGAGAC TATTGAGACC GTACAGCTAG ACCAGGAATT ATTGGACATG
GAAGAGATTG ACCATGGATT AGCCTCTGCA CTGGCCAGCA AAGGTATTGT AACTAAAGAA
AATTTGGCGG AGCAGGCGGT GGATGAACTC ATGGAGATCG AGGGGATGAA CAGAGAGAAA
GCTGCTCGTC TCATTATGGC AGCACGAGAG TCCTGGTTTG TTAGCGAGAA ACAGGATGAA
GAAACGGAGG AGCGGCAATG A
 
Protein sequence
MNKEILMVVD AVSNEKGVNK EVIFQAIEAA LAMATRKRYQ EDIAVQVVID RTTGDYQSFR 
SWEVIEDEAE LDAPERQMHL SEACKRDPNA EVSGFVKEPM ESIAFGRIAA QTAKQVIVQK
VREAERAKVV AAYRGRIKEM VMGVVKRMDR GNIILDLGDN AEGIVPQEEM IPREAVRPGD
RLRGYLKEVR AEGRGPQLVI SRTAPELLIE LFKLEVPEIN ENRIEVKGAA RDPGLRAKIA
VKTNETRIDP VGACVGMRGS RVQAVSNELA GERVDIVLWD EDPARFVINA MAPAEVASIV
VDEERHSMDV AVAEGNLSQA IGRGGQNIRL ASQLTGWELN VMTEQEAEEK GESEAKSLQQ
MFMEQLDVDE EVAAILVHEG FSSIEEMAYV PEQEILAIEE FDSQIVQELR NRARDVLLTR
AIANEETIET VQLDQELLDM EEIDHGLASA LASKGIVTKE NLAEQAVDEL MEIEGMNREK
AARLIMAARE SWFVSEKQDE ETEERQ