Gene P9303_04151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_04151 
SymbolnusA 
ID4776556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp415001 
End bp416455 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content55% 
IMG OID640085919 
Producttranscription elongation factor NusA 
Protein accessionYP_001016432 
Protein GI124022125 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.659645 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCTCG TTCTACTCCC CGGCCTCAAC AACCTGATCG AAGACATCAG CGAGGAGAAG 
AAACTCCCTA CCCAAGTGGT GGAAGCAGCC CTGCGGGAGG CCCTCCTCAA AGGATATGAA
CGCTACCGAC GCACTCTTTA TCTAGGTATC AGTGAGGACC CTTTCGAAGA AGAGTATTTC
AGCAACTTCG ATGTTGGACT AGAGCTGGAC GATGAAGGTT ATCGGGTCCT GGCCAGCAAA
ATCATCGTTG AAGAGGTGGA GAGCGAAGAC CACCAAATTG CTCTTCAAGA AGTGATGCAA
GTCGCTGAAG ACGCCCAGAT CGGTGACACC GTGGTGCTCG ACGTTACCCC CGAGAAGGAA
GACTTCGGCC GGATGGCTGC CGCCACAACC AAGCAGGTAC TGGCGCAAAA GCTACGCGAC
CAGCAGCGCC GCATGATTCA AGAGGAATTT GCCGATCTAG AAGATCCCGT GCTCACAGCT
CGTGTGATCC GTTTTGAACG TCATTCGGTG ATCATGGCCG TGAGTTCTGG TCTTGGGCGC
CCTGAAGTGG AGGCCGAGCT CCCACGCCGC GACCAGCTCC CCAACGACAA TTATCGCGCC
AACGCCACCT TCAAAGTCTT TCTGAAGGAA GTCAGCGAAG TGCCCCGACG AGGGCCTCAG
TTGTTCGTTA GCCGCTCCAA CGCCGGACTA GTGGTTTACC TGTTTGAGAA CGAGGTGCCC
GAAATCCAAG AAGGCTCAGT GCGCATTGTG GCCGTAGCCC GTGAAGCAAA TCCTCCGTCT
CGTTCCGTGG GCCCACGCAC CAAGGTGGCA GTTGACAGTA TTGAACGTGA AGTGGACCCT
GTCGGCGCCT GCATCGGTGC CCGCGGCTCA CGCATTCAGC AAGTGGTTAA TGAACTACGC
GGCGAAAAAA TCGATGTGAT CCGCTGGTCA CCTGACCCGG GTCAATACAT TGCCAATTCC
CTTAGCCCTG CTCGCGTTGA GATGGTGCGA CTGGTGGATC CAGAAGGGCA GCATGCCCAC
GTCCTAGTCC CCCCAGATCA ACTAAGCCTG GCCATCGGAC GAGAAGGACA AAATGTACGC
CTAGCAGCTC GTCTAACCGG ATGGAAAATC GACATCAAAA ACTCCCAGGA ATATGACCAG
GCCAGTGAGG ACACCACCGT CGCCGAGCTG ATTTCTCAGA GAGAGGAAGA AGAGGCTCTC
CAACGCGATG CCGAATCCCG TTTGGCTGCT GAACAAGCCA CCCGAGCAGA AGAGGATGCA
CGCCTAAGAG AGCTTTACCC CCTACCGGAA GATGAAGAAG AGTACGACCA AGAAGAACCT
GCTAAGACGA TGGCTGAAGA CGAAAATGCA TCCGACGCCG ACGGCCAACC TGACGACTTA
AGCAGCCAAC CTGACACCTC AAGCGAACAA CTCTCAAATG AAGAATCAGT AGAGGAAGAG
GACAGAGCCC GGTGA
 
Protein sequence
MALVLLPGLN NLIEDISEEK KLPTQVVEAA LREALLKGYE RYRRTLYLGI SEDPFEEEYF 
SNFDVGLELD DEGYRVLASK IIVEEVESED HQIALQEVMQ VAEDAQIGDT VVLDVTPEKE
DFGRMAAATT KQVLAQKLRD QQRRMIQEEF ADLEDPVLTA RVIRFERHSV IMAVSSGLGR
PEVEAELPRR DQLPNDNYRA NATFKVFLKE VSEVPRRGPQ LFVSRSNAGL VVYLFENEVP
EIQEGSVRIV AVAREANPPS RSVGPRTKVA VDSIEREVDP VGACIGARGS RIQQVVNELR
GEKIDVIRWS PDPGQYIANS LSPARVEMVR LVDPEGQHAH VLVPPDQLSL AIGREGQNVR
LAARLTGWKI DIKNSQEYDQ ASEDTTVAEL ISQREEEEAL QRDAESRLAA EQATRAEEDA
RLRELYPLPE DEEEYDQEEP AKTMAEDENA SDADGQPDDL SSQPDTSSEQ LSNEESVEEE
DRAR