Gene TM1040_2910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2910 
SymbolnusA 
ID4078588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp3077203 
End bp3078819 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content59% 
IMG OID638008239 
Producttranscription elongation factor NusA 
Protein accessionYP_614904 
Protein GI99082750 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATCA CCTCTGCAAA CCAGCTGGAG CTGTTGCAAA CCGCCGAGGC CGTGGCGCGT 
GAGAAAATGA TCGATCCCGG TCTGGTGGTC GAAGCGATGG AAGAATCCCT CGCCCGCGCC
GCCAAGAGCC GCTACGGCAG CGAGATGGAC ATTCGTGTCT CCATTGACCG CAAGACCGGT
AAGGCGACTT TCACTCGTGT GCGCACCGTG GTCGAAGACG AAGAGCTCGA AAACTACCAG
TCCGAGCTGA CCGTCGCGCA GGCCAAGCAG TATATGGAAG ACCCCAAGGT CGGCGACACC
ATCGTTGATG AGGTTCCCCC GGTCGAGATG GGCCGGATCG CGGCACAATC CGCCAAGCAG
GTGATCCTGC AGAAAGTGCG CGAAGCAGAG CGGGATCGTC AGTACGAAGA GTTCAAGGAT
CGCAACGGCA CCATCATCAA TGGCGTCGTC AAGCGAGAGG AATACGGCAA CGTCATCGTC
GATATCGGAT CTGGCGAAGG CATTCTGCGT CGCAACGAGA AAATCGGCCG TGAGAGCTAT
CGCCCGAACG ACCGTATTCG CTGCTTCATC AAGGACGTAC GCCGCGAACC CCGTGGCCCG
CAGATCTTCC TCAGCCGCAC CGCGCCGGAG TTCATGGCCG AGCTCTTCAA GATGGAAGTG
CCTGAAATCT ATGACGGCAT CATCGAGATC AAGGCTGTGG CCCGTGACCC CGGTTCGCGT
GCAAAGATCG CTGTTGTGTC CTATGACGGG TCGATCGATC CGGTTGGCGC CTGTGTCGGT
ATGCGTGGCT CCCGCGTGCA GGCGGTCGTG AACGAACTGC AGGGCGAAAA GATCGACATC
ATCCCTTGGA ACGAAGATCA GCCGACCTTC CTTGTGAACG CGCTGCAGCC CGCAGAGGTC
TCCAAGGTTG TCTTGGACGA AGAAGCCGGC AAGATCGAAG TGGTTGTGCC CGACGAGCAG
CTTTCTCTGG CGATTGGCCG TCGCGGTCAG AACGTACGTC TGGCGTCTCA GCTGACCAAC
CTCGACATCG ACATCATGAC GGAAGAGGAA GAATCCGCAC GCCGTCAGAA GGAATTCGAG
GCGCGCACCG CACTGTTCAT GGAAACGCTC GATCTCGACG AGTTCTTTGC ACAGCTTCTG
GTTTCTGAAG GCTTCACCAA CCTCGAAGAG GTCGCCTATG TCGAACTCGA CGAACTCTTG
GTGATCGATG GCGTCGACGA AGGCACCGCC GAAGAACTGC AGGCCCGCGC GCGCGATTAT
CTCGAAGCCA AGGCCAAGGC CGCGCTCGAC AACGCCCGCA GCATGGGCGT CGAGGACAGC
CTTATTGACT TTGACGGCCT GACACCCCAG ATGGTTGAGG CACTGGCGAA GGATGATGTG
AAATCGCTTG AAGACTTCGC AACCTGTGCG GACTGGGAGC TTGCGGGTGG CTGGACCACC
GTCAACGGCG AGCGTGTCAA GGATGAAGGG ATTCTCGAGC CCTTCGATGT GAGCCTCGAA
GAGGCGCAAA ATCTGGTGAT GACGGCGCGG ATTATGCTCG GCTGGGTCGA CCCGGCAGAA
CTTGAATCCG ATGCTGATGA TCTCGAGGAA GAAGCCGAAG GGGAAGCGGA AGCCTGA
 
Protein sequence
MAITSANQLE LLQTAEAVAR EKMIDPGLVV EAMEESLARA AKSRYGSEMD IRVSIDRKTG 
KATFTRVRTV VEDEELENYQ SELTVAQAKQ YMEDPKVGDT IVDEVPPVEM GRIAAQSAKQ
VILQKVREAE RDRQYEEFKD RNGTIINGVV KREEYGNVIV DIGSGEGILR RNEKIGRESY
RPNDRIRCFI KDVRREPRGP QIFLSRTAPE FMAELFKMEV PEIYDGIIEI KAVARDPGSR
AKIAVVSYDG SIDPVGACVG MRGSRVQAVV NELQGEKIDI IPWNEDQPTF LVNALQPAEV
SKVVLDEEAG KIEVVVPDEQ LSLAIGRRGQ NVRLASQLTN LDIDIMTEEE ESARRQKEFE
ARTALFMETL DLDEFFAQLL VSEGFTNLEE VAYVELDELL VIDGVDEGTA EELQARARDY
LEAKAKAALD NARSMGVEDS LIDFDGLTPQ MVEALAKDDV KSLEDFATCA DWELAGGWTT
VNGERVKDEG ILEPFDVSLE EAQNLVMTAR IMLGWVDPAE LESDADDLEE EAEGEAEA