Gene Smed_3206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3206 
Symbolrho 
ID5324085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3381576 
End bp3382841 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content58% 
IMG OID640792154 
Producttranscription termination factor Rho 
Protein accessionYP_001328865 
Protein GI150398398 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAAA TGAAGCTTCA AGAACTTAAG AACAAGACTC CGACCGACCT TCTGGCCCTT 
GCCGAAGAGC TCGAGGTGGA GAATGCCAGC ACGATGCGCA AGCAGGAGCT GATGTTCGCG
ATCCTCAAAA TGCTGGCTCA GCAGGAGATC GAGATCATCG GCGAGGGTGT CGTCGAGGTC
CTGCAGGACG GATTTGGCTT CCTGCGCTCG GCGAATGCGA ATTATCTTCC AGGCCCGGAC
GATATCTACA TTTCGCCTTC CCAGATCCGC CGCTTCTCGC TGAAGACCGG CGACACGGTC
GAGGGTCCGA TCCGCGGACC CAAGGAAGGC GAGCGCTATT TCGCACTGCT GAAGGTCAAC
ACGATCAATT TCGACGATCC GGAGAAGATC CGCCACAAGG TGCACTTCGA CAACCTGACG
CCGCTCTATC CGAACGAGCG CTTCAAGATG GAACTCGAGG TCCCGACCTC CAAGGACCTG
TCGGCACGTG TCATAGATCT GGTCGCGCCG CTCGGCAAAG GCCAGCGTGG CTTGATCGTT
GCGCCGCCTC GTACCGGTAA AACGGTTCTT CTGCAGAATA TCGCGCATTC GATCACGGCG
AATCACCCGG AGTGCTATCT GATCGTCCTC CTGATCGACG AGCGGCCGGA GGAAGTCACG
GACATGCAGC GTTCGGTCAA GGGCGAGGTG GTATCCTCGA CCTTCGACGA GCCGGCAACA
CGCCACGTAC AGGTCGCGGA AATGGTGATC GAGAAGGCAA AGCGCCTCGT GGAACATGGG
CGCGATGTGG TGATCCTGCT CGATTCGATC ACTCGTCTCG GCCGCGCATA CAACACAGTC
GTTCCCTCAT CGGGCAAGGT CCTGACCGGT GGTGTCGACG CCAATGCGCT GCAGCGCCCC
AAGCGCTTCT TCGGCGCGGC TCGTAACATC GAGGAAGGCG GGTCGCTGAC GATCATTGCG
ACGGCGCTGA TCGACACCGG CAGCCGAATG GATGAAGTGA TCTTCGAAGA GTTCAAGGGC
ACCGGCAACT CGGAAATCGT TCTGGACCGC AAGGTGGCAG ACAAGCGCAT CTTCCCGTCA
ATGGATATTC TCAAATCCGG CACGCGTAAG GAAGATCTCC TGGTCCCGCG TCAGGACCTG
CAGAAGATCT TCGTTCTTCG CCGCATTCTC GCGCCGATGG GGACGACCGA TGCGATCGAA
TTCCTCATCG ACAAGCTCAA GCAGACGAAG ACGAACGGCG ACTTCTTCGA CTCGATGAAT
ACATAG
 
Protein sequence
MAEMKLQELK NKTPTDLLAL AEELEVENAS TMRKQELMFA ILKMLAQQEI EIIGEGVVEV 
LQDGFGFLRS ANANYLPGPD DIYISPSQIR RFSLKTGDTV EGPIRGPKEG ERYFALLKVN
TINFDDPEKI RHKVHFDNLT PLYPNERFKM ELEVPTSKDL SARVIDLVAP LGKGQRGLIV
APPRTGKTVL LQNIAHSITA NHPECYLIVL LIDERPEEVT DMQRSVKGEV VSSTFDEPAT
RHVQVAEMVI EKAKRLVEHG RDVVILLDSI TRLGRAYNTV VPSSGKVLTG GVDANALQRP
KRFFGAARNI EEGGSLTIIA TALIDTGSRM DEVIFEEFKG TGNSEIVLDR KVADKRIFPS
MDILKSGTRK EDLLVPRQDL QKIFVLRRIL APMGTTDAIE FLIDKLKQTK TNGDFFDSMN
T