Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_16961 |
Symbol | nusA |
ID | 4718426 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 1438788 |
End bp | 1440191 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640079422 |
Product | transcription elongation factor NusA |
Protein accession | YP_001010086 |
Protein GI | 123969228 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0339666 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATTAG TTATTCTCCC AGGTTTAAAC AATCTCATTG AAGATATTAG TGAGGAAAAA AAGTTACCTC CTAATATCGT GGAATTAGCC TTACGCGAAG CTTTATTAAA AGGATATGAA AAATATAGAA AAACTTTTTA CATTGGAGTT AACCAAGATC CATTTGATGA AGAATATTTT AGTAATTTTG ATGTTGGACT AGATCTAGAT GAAGAAGGTT ACAGGATATT ATCAAGTAAA ATTATTGTAG AAGAAGTTGA GAGCGAAGAT CATCAAATAT CTCTAGTAGA AGTTAAGCAA GTCGCTGATG ATGCTCAAAT AGGTGACACA GTTGTTTTAG ACGTTACTCC AGAAAAAGAG GATTTTGGGC GAATGGCTGC TTCAACAACA AAGCAAGTTT TAGCCCAAAA GTTAAGAGAT CAACAACGAA AAATGATCCA GGAAGAATTT GCGGATTTGG AAGATCCTGT TTTAACGGCA AGAGTTATAA GATTTGAAAG ACAATCAGTC ATTATGGGAG TTAGTTCGGG TATTGGTAGA CCTGAAGTTG AGGCCGAACT TCCCAAGAGA GATCAATTAC CAAATGATAA TTATAGAGCA AATGCAACTT TTAAAGTATT TTTGAAAGAA GTTAGCGAAA TTGCCAGAAA AGGGCCGCAA CTTTTTGTAA GTAGAGCAAA TGCTGGTTTA GTGGTTTATT TATTTGAAAA TGAAGTACCG GAAATTCAAG AAGGTACAGT GAAAATTGTT GCTGTTTCAA GAGAAGCCAA CCCTCCTTCA AGAGCTGTTG GGCCAAGAAC AAAAGTAGCT GTTGATAGTG TCGAAGAAGA AGTGGACCCT GTAGGTGCAT GTATTGGAGC TAGAGGAGCA AGAATTCAAC AAGTAGTAAA TGAATTAAGG GGTGAAAAAA TTGATGTTAT TAAATGGTCA TCTAACCCAA TACAGTATAT TTTAAACTCT TTAAGTCCTG CGAAAGTAGA TCAAGTAAGA CTTGTAGACC CAGCAGGGCA ACATGCGCAC GTACTAGTTC CTCCTGATCA ATTAAGTCTC GCAATTGGTA GAGAAGGTCA AAATGTAAGA CTTGCCGCAA GATTAACTGG TTGGAAGATT GACGTTAAAA ACTCACATGA ATACGATCAG GAAGCAGAAG ATGCTGCGGT CTCTGAATTA ATTATTCAAA GGGAAGATGA AGAGAATCTC CAGAGAGAAG CTGAATTAAG ATTAGAAGCA GAACAAGCTG AGCGTGCTGC AGAAGATGCG AGATTAAGAG AGCTTTATCC TCTTCCCGAA GATGAAGAAG AATATGGAGA GGAACAATAC GAAGGAGTAG AATTCACAGA TAATGATCCA TTAGAGACTG TTCAAGATAC TGAGACATCT GCCAAAGAGG AGAAAAAACG GTGA
|
Protein sequence | MALVILPGLN NLIEDISEEK KLPPNIVELA LREALLKGYE KYRKTFYIGV NQDPFDEEYF SNFDVGLDLD EEGYRILSSK IIVEEVESED HQISLVEVKQ VADDAQIGDT VVLDVTPEKE DFGRMAASTT KQVLAQKLRD QQRKMIQEEF ADLEDPVLTA RVIRFERQSV IMGVSSGIGR PEVEAELPKR DQLPNDNYRA NATFKVFLKE VSEIARKGPQ LFVSRANAGL VVYLFENEVP EIQEGTVKIV AVSREANPPS RAVGPRTKVA VDSVEEEVDP VGACIGARGA RIQQVVNELR GEKIDVIKWS SNPIQYILNS LSPAKVDQVR LVDPAGQHAH VLVPPDQLSL AIGREGQNVR LAARLTGWKI DVKNSHEYDQ EAEDAAVSEL IIQREDEENL QREAELRLEA EQAERAAEDA RLRELYPLPE DEEEYGEEQY EGVEFTDNDP LETVQDTETS AKEEKKR
|
| |