Gene P9211_16141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_16141 
SymbolnusA 
ID5730773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1445233 
End bp1446648 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content43% 
IMG OID641285992 
Producttranscription elongation factor NusA 
Protein accessionYP_001551499 
Protein GI159904155 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACTAG TTCTTCTCCC TGGACTCAAT AACCTGATCG ACGACATAAG TGAGGAGAAA 
AAGCTCCCTG CTCAGGTCGT TGAGACTGCT CTTAGGGAAG CACTCCTAAA AGGTTATGAA
AGATATAGAC GAACCCTCTA TCTAGGGATT AATGAAAACC CCTTCGAAGA GGAATACTTT
AGTAATTTTG ATGTTGGACT GGATCTTGAT GAAGAAGGTT ATCGAGTATT AGCAAGCAAA
ATCATTGTTG ACGAAGTTGA GAGTGAAGAT CATCAGATCG CTTTATCTGA AGTCATGCAA
GTTGCTGAAG ATGCTCAAAT AGGAGACACA GTAGTTCTAG ATGTAACTCC TGAGAAAGAA
GAGTTTGGAA GAATGGCAGC TGCAACAACT AAGCAAGTCC TTGCTCAAAA GTTGCGAGAT
CAACAGCGAA GAATGATTCA AGAAGAATTT GCAGATTTAG AAGATCCTGT CCTAACTGCT
CGAGTAATTA GATTCGAACG TCAATCAGTA ATCATGGCAG TCAGTTCAGG GCTAGGCAGA
CCAGAAGTTG AAGCGGAGCT CCCTCGCAGA GATCAACTGC CAAACGATAA TTATCGCGCA
AATGCAACTT TCAAAGTATT TCTAAAAGAA GTAAGTGAAA CACCCAGGCG AGGTCCTCAA
TTATTTGTTA GTAGATCTAA CGCTGGTTTA GTAGTTTATT TATTTGAAAA CGAGGTACCA
GAAATCCAAG AAGGGTCCGT CAGGATAGTA GCTGTGGCTA GAGAAGCTAA TCCTCCAACA
CGAGCTGTCG GACCCAGAAC AAAAGTCGCT GTTGATAGCA TTGAAAGAGA AGTGGACCCA
GTAGGCGCAT GCATAGGTGC AAGAGGATCC AGAATTCAGC AAGTAGTTAA TGAACTGAGA
GGAGAAAAAA TAGATGTGAT TCGCTGGTCT GCAGACCCAG TCCAATACAT TTCTAATTCG
CTCAGTCCTG CAAGAGTTGA AGTTGTAAGG CTCGTTGATC CAGAAGGGCA GCATGCGCAT
GTCTTAGTGC CCCCTGATCA ACTTAGTCTT GCAATTGGAA GAGAAGGTCA AAATGTTCGA
TTAGCAGCAA GACTTACTGG CTGGAAAATC GATATTAAGA ATTCACAAGA ATATGACCAA
GAGTCTGAAG ATTCAGCAGT TGCTGAACTG ATCTCTCAAA GAGAAGAAGA AGAAAGTCTG
CAAAGAGAAG CTGAAGAAAG ATTGGCTGCA GAACAGGCTG CTAGGGCAGA AGAAGATGCC
CGACTAAGAG AGCTTTATCC TCTTCCAGAA GATGATGAAG AGAATATAGA AGAAAGTACA
ACTGAATTAG AAGAGCTTCC AATAAGTGAA AATGAAGAAG CCAAACAAAA TGAAGGATTG
AGTAATGAAC AAAGCCCCGA GGATGGACCC CGGTGA
 
Protein sequence
MALVLLPGLN NLIDDISEEK KLPAQVVETA LREALLKGYE RYRRTLYLGI NENPFEEEYF 
SNFDVGLDLD EEGYRVLASK IIVDEVESED HQIALSEVMQ VAEDAQIGDT VVLDVTPEKE
EFGRMAAATT KQVLAQKLRD QQRRMIQEEF ADLEDPVLTA RVIRFERQSV IMAVSSGLGR
PEVEAELPRR DQLPNDNYRA NATFKVFLKE VSETPRRGPQ LFVSRSNAGL VVYLFENEVP
EIQEGSVRIV AVAREANPPT RAVGPRTKVA VDSIEREVDP VGACIGARGS RIQQVVNELR
GEKIDVIRWS ADPVQYISNS LSPARVEVVR LVDPEGQHAH VLVPPDQLSL AIGREGQNVR
LAARLTGWKI DIKNSQEYDQ ESEDSAVAEL ISQREEEESL QREAEERLAA EQAARAEEDA
RLRELYPLPE DDEENIEEST TELEELPISE NEEAKQNEGL SNEQSPEDGP R