Gene NATL1_18931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_18931 
SymbolnusA 
ID4779831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1555126 
End bp1556619 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content40% 
IMG OID640085182 
Producttranscription elongation factor NusA 
Protein accessionYP_001015713 
Protein GI124026598 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCTTG TTCTACTCCC AGGCTTAAAT AACTTAATTG AAGACATTAG TGAGGAAAAA 
AAGCTTCCAT CACAAGTCGT CGAAGCTGCT TTGAGAGAAG CACTTCTAAA AGGTTATGAA
CGATATCGAA GAACTCTTTA TTTAGGTATC AGCGAAGATC CTTTTGAAGA AGATTATTTC
AGCAACTTTG ATGTTGGTTT AGATCTAGAA GAGGAGGGGT ATAGAGTTTT AGCCAGTAAA
ATAATAGTTG AGGAGGTAGA GAGTGATGAT CATCAAATAG CAATAGCAGA GGTTATGCAA
GTTGCTGATG ATGCTCAAGT TGGCGATACC GTTGTACTAG ATGTGACTCC AGAGAAAGAA
GATTTTGGAA GAATGGCTGC GGCTACAACC AAGCAGGTTT TAGCCCAAAA ATTGAGAGAT
CAGCAAAGAA GAATGATTCA AGAGGAATTT GCTGACCTAG AAGATCCAGT GCTAACTGCC
AGGGTCATTA GATTTGAAAG GCAGTCAGTA ATTATGGCAG TAAGCTCTGG TTTAGGTAGA
CCAGAAGTAG AAGCAGAGCT TCCAAGACGA GATCAATTGC CAAATGATAA TTACCGAGCA
AACGCGACAT TTAAAGTATT TCTTAAAGAA GTTAGTGAAA CGCCTAGGAG AGGTCCACAA
CTTTTTGTTA GCAGGTCAAA TGCTGGATTG GTTGTATATC TGTTTGAGAA TGAAGTTCCT
GAAATTCAAG AAGGTTCAGT ACGTATAGTT GCTGTCGCTC GTGAAGCAAA TCCACCTTCA
AGAGCAGTAG GTCCCAGAAC AAAAGTAGCT GTTGACAGTA TTGAAAGAGA GGTTGACCCC
GTTGGAGCAT GTATTGGTGC TAGAGGTGCT CGAATTCAAC AAGTTGTAAA TGAACTTCGT
GGAGAAAAAA TTGATGTAAT TCGTTGGTCT TCAGACCCTG TTCAATACAT CTGTAATTCA
TTAAGTCCTG CAAGAGTTGA AGTAGTTAGA CTTGTGGATC CAGAAGGTCA GCATGCTCAT
GTCTTAGTTC CTCCAGATCA ACTTAGCCTC GCAATTGGAA GAGAAGGGCA AAATGTAAGA
CTTGCTGCAA GGCTTACAGG ATGGAAAATA GATATCAAAA ATTCACAAGA ATATGATCAA
GCAACTGAAG ACTCGGAGGT TGCGGAATTA ATTTCTCAAA GAGAAGAAGA AGAATCTTTA
CAGAGAGAAG CTGAGAAGAG ATTAGAGGCA GAACAGACAG CTAGAGCTGA GGAAGATGCA
CGCCTAAGAG AGCTTTACCC CCTCCCAGAG GATGAAGAAG ACTTTCAAAA TGAGGAATTG
TCAAACATCG ATAGCAATGA AACAGAATCT GAATCTGATT TTGAATCTTC TGAAATTGAT
TCAACTATCG ATACTGATGA CACTGAAACT GAAGAAAGTG ATATTGATGA AACAATCGAA
GACACAGTTA CTGCTAAAGA TAAAATAATT ACAGAAGAGG AAGGAACCCG GTGA
 
Protein sequence
MALVLLPGLN NLIEDISEEK KLPSQVVEAA LREALLKGYE RYRRTLYLGI SEDPFEEDYF 
SNFDVGLDLE EEGYRVLASK IIVEEVESDD HQIAIAEVMQ VADDAQVGDT VVLDVTPEKE
DFGRMAAATT KQVLAQKLRD QQRRMIQEEF ADLEDPVLTA RVIRFERQSV IMAVSSGLGR
PEVEAELPRR DQLPNDNYRA NATFKVFLKE VSETPRRGPQ LFVSRSNAGL VVYLFENEVP
EIQEGSVRIV AVAREANPPS RAVGPRTKVA VDSIEREVDP VGACIGARGA RIQQVVNELR
GEKIDVIRWS SDPVQYICNS LSPARVEVVR LVDPEGQHAH VLVPPDQLSL AIGREGQNVR
LAARLTGWKI DIKNSQEYDQ ATEDSEVAEL ISQREEEESL QREAEKRLEA EQTARAEEDA
RLRELYPLPE DEEDFQNEEL SNIDSNETES ESDFESSEID STIDTDDTET EESDIDETIE
DTVTAKDKII TEEEGTR