Gene GSU1586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1586 
SymbolnusA 
ID2687296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1738694 
End bp1739851 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content59% 
IMG OID637126266 
Producttranscription elongation factor NusA 
Protein accessionNP_952637 
Protein GI39996686 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00876176 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAAACGA CCTTCAACCT CAAGCACATT ATTGACCAGA TCGTCAAGGA GAAGGGGATT 
GACCGGCACA TCGTCGTGGA AGCCCTGGAG CAGGCGGTAC TCACCGCTGC GAACAAGAAG
TTCCGCAATA CTCGTGATCT TGAGGCCCAC TATAACCCGG AAGTTGGCGA AGTTGAGCTC
TTTGAGTTCG TTACCGTGGT CGACGAGGTT CAGGATTCCT ACAAGGAAAT CGACATGGAA
GAGGCCCGGG AGATCGACCC TGACGTGGAA GTGGGCGATT CTCTCGGTAT GAAGCTGGAT
GCAAGCGGTT TTACCCGTAT CGCCGCCCAG ACCGCCAAGC AGGTCATCAT CCAGAAGGTG
CGCGAGGCGG AGCGGGAAAC TATTTTCAAC GAGTTCAAGG ACCGGATCGG CGAACTGGTG
ACCGGCGTTG TGCGCCGCTT TGAAAAAGGT GATCTGGTAA TCGATCTCGG GCGCGCCGAA
GCGGTGCTTT CCCATAAGGA GCAGGCGCCG CGCGAGGTGT ATCGCCAGGG TGACCGCGTT
AAGACTCTGA TCACCGACAT CCGGATGACC CCAAAGGGGC CCCAGATCGT TCTGTCGCGT
ACCCATCCCG GCGTCCTTGC CAAGCTTTTC GAGGCGGAGG TTCCGGAGAT CGCCGAAGGG
ATCGTGGAGA TCAAGGCCGT TGTACGTGAG CCGGGCAGCC GGGCCAAGAT CGCCGTCTAC
TCCCATGATT CCGATGTGGA TCCCGTTGGG GCCTGCGTGG GTATGCGGGG TAGCCGCGTG
CAGAATGTGG TGTCCGAGCT GAGGGGTGAA AAGATCGATA TCATCCCCTG GTCCGATGAC
GCGGCACGCT TTGCGTGCAA TGCGCTGCAA CCGGCCGTGG TGTCGAAGGT GTACATTGAC
GACGAGAACC GCTCCATGGA GATAATCGTC GCCGACGACC AACTGTCGCT GGCTATCGGT
AAAAAAGGGC AGAACGTGCG GCTTGCCGCA AAGCTTACCG GCTGGCGCAT CGACATCAAG
AGCGAAACCA CTGCTGCCGA GGCGGAACTG CTCCAGTATT CCTCCTATGA TGGGGCCACC
GAAGAGGTTG CTGAAGAGGC CGCCCAAGCC GTTGAGACCG AAGGCGAAGC GGTTGCAGAG
GAGCAGGTGG AAGCATAG
 
Protein sequence
METTFNLKHI IDQIVKEKGI DRHIVVEALE QAVLTAANKK FRNTRDLEAH YNPEVGEVEL 
FEFVTVVDEV QDSYKEIDME EAREIDPDVE VGDSLGMKLD ASGFTRIAAQ TAKQVIIQKV
REAERETIFN EFKDRIGELV TGVVRRFEKG DLVIDLGRAE AVLSHKEQAP REVYRQGDRV
KTLITDIRMT PKGPQIVLSR THPGVLAKLF EAEVPEIAEG IVEIKAVVRE PGSRAKIAVY
SHDSDVDPVG ACVGMRGSRV QNVVSELRGE KIDIIPWSDD AARFACNALQ PAVVSKVYID
DENRSMEIIV ADDQLSLAIG KKGQNVRLAA KLTGWRIDIK SETTAAEAEL LQYSSYDGAT
EEVAEEAAQA VETEGEAVAE EQVEA