Gene Smed_3439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3439 
SymbolnusA 
ID5324325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3644337 
End bp3645995 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content61% 
IMG OID640792389 
Producttranscription elongation factor NusA 
Protein accessionYP_001329092 
Protein GI150398625 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.787039 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTGC AACAGACGGA GACAAGAGAC ATGGCAGTCA GTGCTAACCG GCTCGAACTT 
CTGCAGATCG CAGATGCTGT GGCACGCGAA AAGGTGATCG ACCGCGAGAT CGTTCTGGCC
GCGATGGCGG ATGCGATCCA GAAGGCGGCT CGTTCGCGCT ACGGTTCGGA ATCGAACATC
CGCGCCGACA TCAATCCGAA GACCGGGGAA ATCCGTCTCC AGCGCCTCCT GGAAGTGGTC
GAAAAGGCGG AAGACTATTC GACTCAGATT CCGATCGAAC TCGCCCGCGA TCGCAATCCC
GATGCAAAGC TCGGCGATTT CATCGCCGAT CCGCTCCCGC CCATGGATTT TGGCCGCATC
GCCGCTCAGT CGGCCAAGCA GGTTATCGTG CAGAAGGTGC GCGAAGCCGA GCGCGACCGC
CAGTACGACG AGTTCAAGGA CCGTGTCGGC GAGATCGTCA ACGGCACCGT CAAGCGCGTC
GAATATGGCA ATGTCATCGT CGATCTCGGG CGCGGTGAAG GCATCATCCG GCGCGACGAG
ATGATCCCGC GCGAAAACAT GCGTTACGGC GACCGCGTCC GTGCCTTTGT CTACGACGTG
CGCCGCGAGC AACGCGGACC GCAGATATTC CTGTCGCGCA CCCATCCGCA GTTCATGGTG
AAGCTCTTCA CCATGGAGGT ACCGGAAATC TACGACGGCG TCATCCAGAT CAAGTCGGTT
GCCCGCGATC CGGGCTCGCG CGCCAAGATC GCCGTCGTCT CGAACGATTC GTCGATCGAT
CCGGTCGGCG CCTGCGTCGG CATGCGCGGC TCGCGCGTGC AGGCTGTCGT CGGTGAACTC
CAGGGCGAAA AGATCGATAT CATCCCGTGG TCGCCGGATC CGGCTTCCTT CATCGTCAAT
GCGCTGCAGC CGGCGGAAGT GGCGAAGGTC GTTCTAGACG AGGATGCGGA GCGTATCGAA
GTCGTGGTTC CGGACGAGCA GCTTTCGCTC GCCATCGGCC GCCGCGGCCA GAACGTCCGT
CTCGCCTCGC AGCTGACCGG ATGGGATATC GATATCCTCA CCGAACAGGA GGAGAGCGAG
CGCCGTCAGA AGGAATTCAA CGAGCGCACA CAGCTCTTCA TGGAAGCCCT GGACGTCGAC
GAGATGGTAG GCCAGGTGCT CGCCTCCGAA GGCTTTGCCC AGGTGGAAGA GCTCGCTTAT
GTCGATCTCG ACGAAATTGC CTCCATCGAG GGCTTCGACG AGGAAACGTC GAACGAGATC
CAGACCCGCG CCCGCGAATA TCTCGAAAAG ATCGAGGCGG AAATGGACGC CAAGCGCAAG
GAACTCGGTG TTGCCGACGA GCTGCGCACG ATCAATGGGC TCAACAGCCA GATGCTGGTC
GCTCTCGGCG AGGAAGGCAT CAAGACGATA GAGGACTTTG CCGGCTGCGC CGCCGACGAC
CTCGTAGGCT GGGTCGAACG CAAGGATGGT GAGACCAAGC GCTTCGAGGG AACGTTCTCG
AAGCTCGAGG TTACCCGGGA AGAGGCCGAA GCGATGATCG TGCAGGCTCG TCTCGCTGCC
GGCTGGATCA CCGAAGAGGA TCTGGCCAAA CAACAGGAGG AAGAGCCGGA ACAGGATGAG
ACGATCGAAG TCGCCGAAGG CGCGGATCAG GACGCCTGA
 
Protein sequence
MRLQQTETRD MAVSANRLEL LQIADAVARE KVIDREIVLA AMADAIQKAA RSRYGSESNI 
RADINPKTGE IRLQRLLEVV EKAEDYSTQI PIELARDRNP DAKLGDFIAD PLPPMDFGRI
AAQSAKQVIV QKVREAERDR QYDEFKDRVG EIVNGTVKRV EYGNVIVDLG RGEGIIRRDE
MIPRENMRYG DRVRAFVYDV RREQRGPQIF LSRTHPQFMV KLFTMEVPEI YDGVIQIKSV
ARDPGSRAKI AVVSNDSSID PVGACVGMRG SRVQAVVGEL QGEKIDIIPW SPDPASFIVN
ALQPAEVAKV VLDEDAERIE VVVPDEQLSL AIGRRGQNVR LASQLTGWDI DILTEQEESE
RRQKEFNERT QLFMEALDVD EMVGQVLASE GFAQVEELAY VDLDEIASIE GFDEETSNEI
QTRAREYLEK IEAEMDAKRK ELGVADELRT INGLNSQMLV ALGEEGIKTI EDFAGCAADD
LVGWVERKDG ETKRFEGTFS KLEVTREEAE AMIVQARLAA GWITEEDLAK QQEEEPEQDE
TIEVAEGADQ DA