Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_04151 |
Symbol | nusA |
ID | 4776556 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 415001 |
End bp | 416455 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640085919 |
Product | transcription elongation factor NusA |
Protein accession | YP_001016432 |
Protein GI | 124022125 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.659645 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCTCG TTCTACTCCC CGGCCTCAAC AACCTGATCG AAGACATCAG CGAGGAGAAG AAACTCCCTA CCCAAGTGGT GGAAGCAGCC CTGCGGGAGG CCCTCCTCAA AGGATATGAA CGCTACCGAC GCACTCTTTA TCTAGGTATC AGTGAGGACC CTTTCGAAGA AGAGTATTTC AGCAACTTCG ATGTTGGACT AGAGCTGGAC GATGAAGGTT ATCGGGTCCT GGCCAGCAAA ATCATCGTTG AAGAGGTGGA GAGCGAAGAC CACCAAATTG CTCTTCAAGA AGTGATGCAA GTCGCTGAAG ACGCCCAGAT CGGTGACACC GTGGTGCTCG ACGTTACCCC CGAGAAGGAA GACTTCGGCC GGATGGCTGC CGCCACAACC AAGCAGGTAC TGGCGCAAAA GCTACGCGAC CAGCAGCGCC GCATGATTCA AGAGGAATTT GCCGATCTAG AAGATCCCGT GCTCACAGCT CGTGTGATCC GTTTTGAACG TCATTCGGTG ATCATGGCCG TGAGTTCTGG TCTTGGGCGC CCTGAAGTGG AGGCCGAGCT CCCACGCCGC GACCAGCTCC CCAACGACAA TTATCGCGCC AACGCCACCT TCAAAGTCTT TCTGAAGGAA GTCAGCGAAG TGCCCCGACG AGGGCCTCAG TTGTTCGTTA GCCGCTCCAA CGCCGGACTA GTGGTTTACC TGTTTGAGAA CGAGGTGCCC GAAATCCAAG AAGGCTCAGT GCGCATTGTG GCCGTAGCCC GTGAAGCAAA TCCTCCGTCT CGTTCCGTGG GCCCACGCAC CAAGGTGGCA GTTGACAGTA TTGAACGTGA AGTGGACCCT GTCGGCGCCT GCATCGGTGC CCGCGGCTCA CGCATTCAGC AAGTGGTTAA TGAACTACGC GGCGAAAAAA TCGATGTGAT CCGCTGGTCA CCTGACCCGG GTCAATACAT TGCCAATTCC CTTAGCCCTG CTCGCGTTGA GATGGTGCGA CTGGTGGATC CAGAAGGGCA GCATGCCCAC GTCCTAGTCC CCCCAGATCA ACTAAGCCTG GCCATCGGAC GAGAAGGACA AAATGTACGC CTAGCAGCTC GTCTAACCGG ATGGAAAATC GACATCAAAA ACTCCCAGGA ATATGACCAG GCCAGTGAGG ACACCACCGT CGCCGAGCTG ATTTCTCAGA GAGAGGAAGA AGAGGCTCTC CAACGCGATG CCGAATCCCG TTTGGCTGCT GAACAAGCCA CCCGAGCAGA AGAGGATGCA CGCCTAAGAG AGCTTTACCC CCTACCGGAA GATGAAGAAG AGTACGACCA AGAAGAACCT GCTAAGACGA TGGCTGAAGA CGAAAATGCA TCCGACGCCG ACGGCCAACC TGACGACTTA AGCAGCCAAC CTGACACCTC AAGCGAACAA CTCTCAAATG AAGAATCAGT AGAGGAAGAG GACAGAGCCC GGTGA
|
Protein sequence | MALVLLPGLN NLIEDISEEK KLPTQVVEAA LREALLKGYE RYRRTLYLGI SEDPFEEEYF SNFDVGLELD DEGYRVLASK IIVEEVESED HQIALQEVMQ VAEDAQIGDT VVLDVTPEKE DFGRMAAATT KQVLAQKLRD QQRRMIQEEF ADLEDPVLTA RVIRFERHSV IMAVSSGLGR PEVEAELPRR DQLPNDNYRA NATFKVFLKE VSEVPRRGPQ LFVSRSNAGL VVYLFENEVP EIQEGSVRIV AVAREANPPS RSVGPRTKVA VDSIEREVDP VGACIGARGS RIQQVVNELR GEKIDVIRWS PDPGQYIANS LSPARVEMVR LVDPEGQHAH VLVPPDQLSL AIGREGQNVR LAARLTGWKI DIKNSQEYDQ ASEDTTVAEL ISQREEEEAL QRDAESRLAA EQATRAEEDA RLRELYPLPE DEEEYDQEEP AKTMAEDENA SDADGQPDDL SSQPDTSSEQ LSNEESVEEE DRAR
|
| |