Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_16141 |
Symbol | nusA |
ID | 5730773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 1445233 |
End bp | 1446648 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 641285992 |
Product | transcription elongation factor NusA |
Protein accession | YP_001551499 |
Protein GI | 159904155 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACTAG TTCTTCTCCC TGGACTCAAT AACCTGATCG ACGACATAAG TGAGGAGAAA AAGCTCCCTG CTCAGGTCGT TGAGACTGCT CTTAGGGAAG CACTCCTAAA AGGTTATGAA AGATATAGAC GAACCCTCTA TCTAGGGATT AATGAAAACC CCTTCGAAGA GGAATACTTT AGTAATTTTG ATGTTGGACT GGATCTTGAT GAAGAAGGTT ATCGAGTATT AGCAAGCAAA ATCATTGTTG ACGAAGTTGA GAGTGAAGAT CATCAGATCG CTTTATCTGA AGTCATGCAA GTTGCTGAAG ATGCTCAAAT AGGAGACACA GTAGTTCTAG ATGTAACTCC TGAGAAAGAA GAGTTTGGAA GAATGGCAGC TGCAACAACT AAGCAAGTCC TTGCTCAAAA GTTGCGAGAT CAACAGCGAA GAATGATTCA AGAAGAATTT GCAGATTTAG AAGATCCTGT CCTAACTGCT CGAGTAATTA GATTCGAACG TCAATCAGTA ATCATGGCAG TCAGTTCAGG GCTAGGCAGA CCAGAAGTTG AAGCGGAGCT CCCTCGCAGA GATCAACTGC CAAACGATAA TTATCGCGCA AATGCAACTT TCAAAGTATT TCTAAAAGAA GTAAGTGAAA CACCCAGGCG AGGTCCTCAA TTATTTGTTA GTAGATCTAA CGCTGGTTTA GTAGTTTATT TATTTGAAAA CGAGGTACCA GAAATCCAAG AAGGGTCCGT CAGGATAGTA GCTGTGGCTA GAGAAGCTAA TCCTCCAACA CGAGCTGTCG GACCCAGAAC AAAAGTCGCT GTTGATAGCA TTGAAAGAGA AGTGGACCCA GTAGGCGCAT GCATAGGTGC AAGAGGATCC AGAATTCAGC AAGTAGTTAA TGAACTGAGA GGAGAAAAAA TAGATGTGAT TCGCTGGTCT GCAGACCCAG TCCAATACAT TTCTAATTCG CTCAGTCCTG CAAGAGTTGA AGTTGTAAGG CTCGTTGATC CAGAAGGGCA GCATGCGCAT GTCTTAGTGC CCCCTGATCA ACTTAGTCTT GCAATTGGAA GAGAAGGTCA AAATGTTCGA TTAGCAGCAA GACTTACTGG CTGGAAAATC GATATTAAGA ATTCACAAGA ATATGACCAA GAGTCTGAAG ATTCAGCAGT TGCTGAACTG ATCTCTCAAA GAGAAGAAGA AGAAAGTCTG CAAAGAGAAG CTGAAGAAAG ATTGGCTGCA GAACAGGCTG CTAGGGCAGA AGAAGATGCC CGACTAAGAG AGCTTTATCC TCTTCCAGAA GATGATGAAG AGAATATAGA AGAAAGTACA ACTGAATTAG AAGAGCTTCC AATAAGTGAA AATGAAGAAG CCAAACAAAA TGAAGGATTG AGTAATGAAC AAAGCCCCGA GGATGGACCC CGGTGA
|
Protein sequence | MALVLLPGLN NLIDDISEEK KLPAQVVETA LREALLKGYE RYRRTLYLGI NENPFEEEYF SNFDVGLDLD EEGYRVLASK IIVDEVESED HQIALSEVMQ VAEDAQIGDT VVLDVTPEKE EFGRMAAATT KQVLAQKLRD QQRRMIQEEF ADLEDPVLTA RVIRFERQSV IMAVSSGLGR PEVEAELPRR DQLPNDNYRA NATFKVFLKE VSETPRRGPQ LFVSRSNAGL VVYLFENEVP EIQEGSVRIV AVAREANPPT RAVGPRTKVA VDSIEREVDP VGACIGARGS RIQQVVNELR GEKIDVIRWS ADPVQYISNS LSPARVEVVR LVDPEGQHAH VLVPPDQLSL AIGREGQNVR LAARLTGWKI DIKNSQEYDQ ESEDSAVAEL ISQREEEESL QREAEERLAA EQAARAEEDA RLRELYPLPE DDEENIEEST TELEELPISE NEEAKQNEGL SNEQSPEDGP R
|
| |