Gene Smed_4301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4301 
Symbol 
ID5319305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp794438 
End bp795523 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content65% 
IMG OID640776106 
ProductTPR repeat-containing protein 
Protein accessionYP_001313039 
Protein GI150376443 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.406426 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.198903 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTTG CCGGCAGGAC ATTCGGTATC ATAGGCGCGC TCGCCGCTTT TCCGCGGCGG 
CTCGCGGCGC GTGAGGTCGA GCGCCAGGGC GGGCACCTCA GGCGCGGCGT CGCGCGCCAG
ACGAACTACG TGGTCTTCGG GCGGGGGCTG CTCTCCAGAG CAACAGAGGC GGAGATCGAA
AAGCGCTTCG ACGGCGAGAG CGGAGCGCAT GGGCGCGTTC TCAGCGAAAA CGGCTTCCTG
CGCCTGCTGG GGCTGGCGCG CGCGGCGGAA ACATCGGCGC TGGCACGACA GTCGCTCATC
GACCAGTCCG GAATTTCCCC TCGCCACCTC GATCTTCTAT CGCTGTTCGA CGCCTTCGAG
CATGATGGCG AACCATATTC TTTCCGTGAC CTGATCCTCG CCCGGAAATA TGCGGGGCTG
ACGGCCAGCG GCGCCGGATG GAGCGCGATC GCGCGATCGG TTCACCGCTC CGGAAATGTC
GCATCCCTCA CCGCACTCTC CCTGCAACAT GAAGGAAACG ATACGATCTA TGCGCGACGC
GCCGAGGGCT TGAGCGAGCT CGACGGCCAG ATGCTGCTCG ATGTCGGCTC TCCGGACGAG
GAGGCGCTCG AAGACCTTTT CGCGCTGGCC GAAGCGGCCG AGGAAGCGGG AGACTACGAT
GAGGCGGCCG CATTCTACCA GCGCTACCTC GCCATCGACC GCACCGACTC CGTCGCTTCC
TTCAATCGTG CCAACTGCCT CAGGGCCGCC GGACAGGAGG CGGAAGCGGC GCACGACTAT
GCCCGCGCCA TCAAGCTCGA TCCTTCCTTC GTCGAGGCAT GGTTCAACCT TGCGGGGCTG
ATGGAGGAAC GCGGCCGCAG GGACACGGCC AGACGGCATC TCACGAAAGC GATCGAGCTC
GACGGCGGTT ATGCCGATGC GGTCTTCAAC CTGGCGAAGC TGGAGTTCGA TGCGGGCAAT
CTCACCGAGG CGCGCCGCTG GTGGATGCGC TACCTCGAAC TCGATCAGGA TTCCGAATGG
GCGCGCAAGG CCGAACGCGG CGTGCAATTC GTGAACCTTC AACTCCTCTC CAGAACGGCA
GGGTAA
 
Protein sequence
MAVAGRTFGI IGALAAFPRR LAAREVERQG GHLRRGVARQ TNYVVFGRGL LSRATEAEIE 
KRFDGESGAH GRVLSENGFL RLLGLARAAE TSALARQSLI DQSGISPRHL DLLSLFDAFE
HDGEPYSFRD LILARKYAGL TASGAGWSAI ARSVHRSGNV ASLTALSLQH EGNDTIYARR
AEGLSELDGQ MLLDVGSPDE EALEDLFALA EAAEEAGDYD EAAAFYQRYL AIDRTDSVAS
FNRANCLRAA GQEAEAAHDY ARAIKLDPSF VEAWFNLAGL MEERGRRDTA RRHLTKAIEL
DGGYADAVFN LAKLEFDAGN LTEARRWWMR YLELDQDSEW ARKAERGVQF VNLQLLSRTA
G