Gene Smed_5122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5122 
Symbol 
ID5319424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp76062 
End bp77495 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content60% 
IMG OID640776900 
Producttype II secretion system protein E 
Protein accessionYP_001313832 
Protein GI150377237 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0451051 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTTCG GCAAGAAATC CATTTCGGAA AAACAGATAT CCGAGGCAGC GGTCCAGTCC 
GTGATCGAAA ATGAACCGCA GGCCGCGCCT GCCCTTGTCC AGGTTCTAAA GCCAACCCAG
GTTGCCGCAA CGGCGAGATT TGACAAGGCC GAACAGTATT ACAGCCTGAA GAAGGAAATC
TTCAGCGCCC TCATCGCTAC GATCGATGTC GCTGCACTCT CCAATATGGA CGTGGAGCAG
GCGCGCGGGG AGATCGGCGC AATTATCAAC GACATCGTTG CCGCCAAGAA GGCCGGCATT
TCGATGGCGG AGCAGAACAA CCTTCTCAGC GACATCTGCA ACGACATTCT GGGCTATGGG
CCGCTCGAGC CGCTGCTTGC TCGCGACGAC ATAGCCGACA TCATGGTCAA TGGCGCAAAC
CAGGTCTTCA TCGAGGTCAA TGGCCGAGTG CAGGAGACGG GGGTTCGTTT TCGCGACAAC
GAACAGCTCC TCAACATCTG CCAGCGCATC GTTAGCCAGG TTGGCCGCCG CGTCGATGAA
TCAAGCCCGA TCTGCGACGC GCGTCTTGCC GATGGCTCCC GTGTGAACGT GATCGCGCCT
CCCCTGGCAA TCGATGGCCC CACCCTGACG ATCCGCAAGT TCAAGAAGGA AAAGCTGACA
CTCGACCAGC TGGTTCGATT CGGCTCGATT TCGCAGGAAG GCGCTGAGGT TCTCAAGATC
ATTGGTCGCG TGCGCTGCAA TGTCCTTATT TCCGGCGGTA CGGGCTCCGG CAAGACCACG
CTTCTGAACT GCCTGACCGG CTATATCGAT CACGGAGAGC GCGTCATCAC CTGCGAGGAC
GCGGCGGAGC TTCAATTGCA GCAGCCGCAT GTGGTTCGTC TCGAAACTCG GCCGCCGAAC
ATCGAGGGGC AGGGCGAGAT CACCATGCGC AGCCTGGTGA AGAACTGCCT GCGTATGCGG
CCCGAGCGGA TCATCGTCGG CGAGGTGCGC GGACCCGAGG CCTTCGATCT ATTGCAGGCG
ATGAACACCG GCCACGACGG CTCGATGGGA ACGCTGCACG CAAACTCCCC GCGCGAAGCA
ATGGCTCGCG TCGAGGCGAT GATCACCATG GGCGGCAGTT CGCTGCCGGC CAGGACGATC
CGGGAGATGC TCGTCTCATC GGTCGACGTC ATCGTGCAGG CGGCCCGTCT TCGCGACGGT
TCGCGCCGGA TCACCCATAT CACCGAAGTG CTCGGCATGG AGGGGGACGT GATCACGACA
CAGGACCTCT TCATCTACGA CATCCTTGGC GAAGACGAGA AGGGGAACAT CATTGGAAGA
CATCGTTCGA CCGGCATTGG TCGCCCGGCA TTCTGGGACC GCGCCCGTTA TTACGGCGAA
GAGGGCCGCC TTGCCGCGGC CCTCGATGCG GCCGAGATGA AGGCGGCCGC TTGA
 
Protein sequence
MMFGKKSISE KQISEAAVQS VIENEPQAAP ALVQVLKPTQ VAATARFDKA EQYYSLKKEI 
FSALIATIDV AALSNMDVEQ ARGEIGAIIN DIVAAKKAGI SMAEQNNLLS DICNDILGYG
PLEPLLARDD IADIMVNGAN QVFIEVNGRV QETGVRFRDN EQLLNICQRI VSQVGRRVDE
SSPICDARLA DGSRVNVIAP PLAIDGPTLT IRKFKKEKLT LDQLVRFGSI SQEGAEVLKI
IGRVRCNVLI SGGTGSGKTT LLNCLTGYID HGERVITCED AAELQLQQPH VVRLETRPPN
IEGQGEITMR SLVKNCLRMR PERIIVGEVR GPEAFDLLQA MNTGHDGSMG TLHANSPREA
MARVEAMITM GGSSLPARTI REMLVSSVDV IVQAARLRDG SRRITHITEV LGMEGDVITT
QDLFIYDILG EDEKGNIIGR HRSTGIGRPA FWDRARYYGE EGRLAAALDA AEMKAAA