Gene Synpcc7942_2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_2022 
SymbolnusA 
ID3774209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp2090378 
End bp2091703 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content57% 
IMG OID637800467 
Producttranscription elongation factor NusA 
Protein accessionYP_401039 
Protein GI81300831 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.142374 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAATGG TCACCCTGCC CGGCCTAGAG CAGCTGATCT ACGCCATTAG CGAGCAAAAA 
AAACTGCCCG CCAATGTCAT TGAAGAAGCC CTCAAAGAAG CCTTGCTCAA GGGCTACGAG
CGCTATCGCC GTACCCAGCA GATGGGTGAG CAGTTTGAAG AAGACTACTT CGACAACATT
GACGTTGAAC TCGATGTCGA ACAGGAAGGC TTTCGGGTAC TGGCAACCAA AACCATCGTC
AATCAGGTCG AAAATCCTGA CCATCAGATT GCCCTCGCCG ATGTTCAGGA AGTGGCTCCA
GATGCCCAAG CAGGCGAAAT CGTCGTTCTA GATGTCACAC CCGATAAAGA CGACTTTGGG
CGAATGGCGG CTATTCAGAC TAAGCAAGTC CTGTCGCAAA AACTGCGCGA TCACCAGCGC
AAACTGATCC AAGAAGAGTT CCAAGATCTA GAAGATCCGG TCTTGATGGC CAAGGTGCTG
CGCTTCGAGC GCCAGTCTGT GATCTTGGGG GTCAGCAGTG GTTTAGGACG TCCTGAAGTC
GAGGCAGAAC TGCCCCGTCG CGAACAACTG CCCAACGATA ACTACCGGGC CAACGCCACC
TTCCGCGTCT TCCTCAAGGA AGTCAGTGAA GTACCCCGTC GTGGACCGCA GTTGATTGTC
TCTCGGGCTA ACGCCGGTCT GGTTGTCTAC TTGTTCGAAA ACGAAGTTCC TGAAATCCAA
GATGGTGTCG TCCGCATTGT GGCGGTAGCG CGGGAAGCGA ATCCGCCGAC TCGGCATGTT
GGGCCGCGCA CCAAAATCGC TGTCGATACC TTGGAACGCG AAGTCGATCC GGTTGGGGCT
TGCATCGGGG CGCGGGGATC GCGGATTCAG GTGGTCGTCA ACGAATTGCG GGGCGAAAAA
ATTGATGTGA TCCGCTGGTC GCCGGATCCG GCCACCTATA TTGCCAATGC CCTCAGCCCT
GCCCGGGTTC AGGAAGTGCG CTTGGTCGAT CCCGAAGGTC GGATCGCCCA CGTTTTGGTC
AACGACGACC AACTCAGTCT CGCGATCGGC AAAGAGGGTC AGAACGTGCG CCTCGCGGCT
CGACTGACCG GCTGGAAAAT CGACATCAAG GACGTGGCGC TCTACGACGC AGTCACGGAA
GGTCAACGGA TTTCTGAACT GATTCAAGAA CGCCAAGAGC GGGCGGCGAT TGCTGCCGAA
GAAGAAGCCC GTGCTGCCGC CGAAGCTGCT GAACTGGCGG AATGGGAGGC GGAAGAGGCT
GCTCTCGCAG CCGCTGAAGC TGCGGCAGAA CTCGCAGCTG CTGAAGCTGA GGAAGAGACT
GTTTGA
 
Protein sequence
MSMVTLPGLE QLIYAISEQK KLPANVIEEA LKEALLKGYE RYRRTQQMGE QFEEDYFDNI 
DVELDVEQEG FRVLATKTIV NQVENPDHQI ALADVQEVAP DAQAGEIVVL DVTPDKDDFG
RMAAIQTKQV LSQKLRDHQR KLIQEEFQDL EDPVLMAKVL RFERQSVILG VSSGLGRPEV
EAELPRREQL PNDNYRANAT FRVFLKEVSE VPRRGPQLIV SRANAGLVVY LFENEVPEIQ
DGVVRIVAVA REANPPTRHV GPRTKIAVDT LEREVDPVGA CIGARGSRIQ VVVNELRGEK
IDVIRWSPDP ATYIANALSP ARVQEVRLVD PEGRIAHVLV NDDQLSLAIG KEGQNVRLAA
RLTGWKIDIK DVALYDAVTE GQRISELIQE RQERAAIAAE EEARAAAEAA ELAEWEAEEA
ALAAAEAAAE LAAAEAEEET V