Gene Smed_5128 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5128 
Symbol 
ID5319430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp81810 
End bp83120 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content61% 
IMG OID640776906 
Producthypothetical protein 
Protein accessionYP_001313838 
Protein GI150377243 
COG category[S] Function unknown 
COG ID[COG4655] Predicted membrane protein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.193766 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGA GAACTGACGC TATCGATGAA GCGAGCCGTT CGTCTCGGTG GCAGAAGCCT 
CTGCGCCGGT TTCTGAAAGC CGAAGGCGGC GCCGTCGCCG TCATCGCCGC GGTTGCATTT
CCCGTACTCG TCGGTGCAAT GGGTTTGGGC GCCGAGACGG GGTACTGGTA TCTGGAAAAG
CGCAAGCTGC AGCATGCTGC CGACGTTTCG GCCTATGCCG CAGCGGTCCG CCATCGCGCG
GGTGATCAGC AGTCCGCGCT TGAAGCCGCC GCACGCAGGG TCGCAGGCGG TTCGGGCTTT
TCGCCGGGGG ACCTCACCTT GTCCACGGCT TCGGCCGCTG CCGGCGGCTC GAACAATGTG
ACGGTCGAGC TGACCGAGAC CCATCCGCGC CTGTTTTCTT CCGTCTTCGG CACCGGAACC
ATCACGATAA AGGCACGCGC AGTCGCCCAG GTCACAGGTG GTTCGAAGGC TTGCGTCCTC
GCCCTGTCGA ACTCCGCGTC GGGCGCCGTG ACCGTCACCG GCTCGACGGA AGTCCAATTG
TCCGGCTGCA GCGTGGTTTC CAATTCCAGC GCCTCCGACG CCTTTCTGAT GAGGAACGGC
AGCGCGCTCA TGTCGACCGA TTGCGTCTAT ACCGTCGGCG AAGCCGTGAC GACGACCGGT
CTGACGCTCA CCGGTTGCAG CAAGCCCGTC CAGCAGGTCC CGCCGACGCC GGATCCGTTT
GCCTCCGTCC TCGAACCGGA TCCGCTGCAG ATTCAGCAAC TTCCTTGCCG TTCACTGAGT
TATGTGTCGA ATCTGCTCTA TGTTTTCGAC AGGCTTGCAA GCCCGCTGTA TCCAGGCGGC
GTCGAGGCGA TCAGGTTCTG CGGCGGACTC GACATCAAGG GAACGGTCAA ACTGAAGCCC
GGCCTTTATA TCATAGACGG CGGCGAATTG ACGGTGACGG CCGGGGCAAA ACTCTCCGGC
GATGGCGTTA CCTTTCTTTT CACCAATTCT GCAGCCGCCA ACCTGCTGGG CAACGGCGAC
ATCGATCTGT CCGCTCCGAC GAGCGGTCCC TTTGCGGGCC TGTTATTCTT TTCCAGTCGG
CTCAATACCG GGGTCGTCCA TCAGATAACG GGCAATTCCG AATCTACCCT GGTCGGAAAC
CTCTACGCTC CGACGGGCCG CATCGACTTC ACCGGAAACT CGACCGTTTC GGGCGGATGT
ACGCAGATCG TAGCCGATCA GGTCACTTTT ACAGGCAATT CCAGAATGGA GACCTGCGCT
TCGCCGACAG AGGAGATCCT GGTCGGTCTG TCGGTATCGC TCATAGAGTG A
 
Protein sequence
MSKRTDAIDE ASRSSRWQKP LRRFLKAEGG AVAVIAAVAF PVLVGAMGLG AETGYWYLEK 
RKLQHAADVS AYAAAVRHRA GDQQSALEAA ARRVAGGSGF SPGDLTLSTA SAAAGGSNNV
TVELTETHPR LFSSVFGTGT ITIKARAVAQ VTGGSKACVL ALSNSASGAV TVTGSTEVQL
SGCSVVSNSS ASDAFLMRNG SALMSTDCVY TVGEAVTTTG LTLTGCSKPV QQVPPTPDPF
ASVLEPDPLQ IQQLPCRSLS YVSNLLYVFD RLASPLYPGG VEAIRFCGGL DIKGTVKLKP
GLYIIDGGEL TVTAGAKLSG DGVTFLFTNS AAANLLGNGD IDLSAPTSGP FAGLLFFSSR
LNTGVVHQIT GNSESTLVGN LYAPTGRIDF TGNSTVSGGC TQIVADQVTF TGNSRMETCA
SPTEEILVGL SVSLIE