Gene Cyan8802_3391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_3391 
SymbolnusA 
ID8392727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp3457613 
End bp3458851 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content47% 
IMG OID644981328 
Producttranscription elongation factor NusA 
Protein accessionYP_003139054 
Protein GI257061166 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0485206 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.931935 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTTG TGAGTTTGCC AGGGTTGGCA AATATGATTG AAGAAATTAG CCAACTACAC 
AACCTACCCA AATCCGCCGT ACAAGAAGCT TTACGAGAAG CCTTGCTTAA AGGCTATGAA
CGCTACCGTC GTTCCCAAAA CTTAGAACGT CAGGCCTTTC ACGAAGATTA CTTCGACAAT
TTTGAAGTGG AACTGGACAC CGAAGAAGAA GGGTTTCGTA TTCTATCGAC GAAAAAAATT
GTCGAAGCCG TCGAGAATAC TGATCATTTT ATTTCCCTCG AAGAAGTCCA AGAAGTCGCT
TCCGAAGCGC AACTGGGGGA TGAAGTCGTC TTAGATGTTA CCCCTAACCA AAAAGATTTT
GGGCGCATGG CTGCCATTCA AACGAAACAG GTTCTGCTGC AAAAACTGCG AGATCAGCAG
AGAAAACTGA TTCAAGAAGA ATTTAACGAA ATAGAAGGAA CGGTGCTCAA CGCCAGAGTT
TTGCGGTTTG AGCGACAAGA CGCGATCGTT GCCGTGCAAA GTACCTTTGG ACAACCCGAA
GTCGAAGCCG TTTTACCCAA ACGCGAACAA CTCCCCAACG ATAATTACCG AGCTAATGCC
ACCTTTAAAG TCTTACTCAA AAAAGTCCGT GAAGGCTCCC ACCGAGGACC TCAATTGATT
GTCTCCCGTT CGGCAGCCGG GTTAGTGGTA GATTTGTTTA CCGTTGAAGT CCCTGAAATT
GAAGAAGAAA TCGTGCGGAT CGTGGCGGTT TCCCGCGAAG CGAACCCCCC TTCGCGCCAT
GTAGGGCCGC GGACGAAAAT AGCCGTCGAT ACCCTCGAAA GGGATGTTGA TCCCGTGGGA
GCGTGTATTG GAGCTAGGGG ATCGCGTATT CAAGCTGTCG TTAATGAACT GCGAGGAGAA
AAAATCGATG TGATTCGGTG GTCTCCTGAT CCAGCTACTT ATATTGCCAA TGCGTTAAGT
CCAGCTAGGG TGGATAATGT TATCTTAATT AATCCTGATG AACGTCATGC CCTTGTCTTA
GTAGCCGAAG ATCAACTCAG TTTAGCCATT GGGAAAGAAG GGCAAAACGT GCGCCTAGCA
GCTCGTTTGA CGGGATGGAA AATCGATATT AAGGATACAG CGACTTATCA AGCAGAAGTG
GAACAAAAAA AACACGCACA GAATCAGACA GCCATTGATC CCCTAGAAGA AGACCCCAAT
CTTCCCTTAA CCGAGATGTC AGAAACACAG ATAGGTTAG
 
Protein sequence
MSLVSLPGLA NMIEEISQLH NLPKSAVQEA LREALLKGYE RYRRSQNLER QAFHEDYFDN 
FEVELDTEEE GFRILSTKKI VEAVENTDHF ISLEEVQEVA SEAQLGDEVV LDVTPNQKDF
GRMAAIQTKQ VLLQKLRDQQ RKLIQEEFNE IEGTVLNARV LRFERQDAIV AVQSTFGQPE
VEAVLPKREQ LPNDNYRANA TFKVLLKKVR EGSHRGPQLI VSRSAAGLVV DLFTVEVPEI
EEEIVRIVAV SREANPPSRH VGPRTKIAVD TLERDVDPVG ACIGARGSRI QAVVNELRGE
KIDVIRWSPD PATYIANALS PARVDNVILI NPDERHALVL VAEDQLSLAI GKEGQNVRLA
ARLTGWKIDI KDTATYQAEV EQKKHAQNQT AIDPLEEDPN LPLTEMSETQ IG