Gene Smed_3355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3355 
Symbol 
ID5324239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3554032 
End bp3555498 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content63% 
IMG OID640792306 
Producttype II secretion system protein E 
Protein accessionYP_001329011 
Protein GI150398544 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGGCA AACGCGGAAA TGAAGGTCCT GGCAAGGGTG GTGCACGCGG TTTCACACCT 
GCGCCTTCAA TACCCGCGGT CGAGCTGGCG GTTGTCGAGC GTCCCTCGGC CCCTGATTAT
GGAGAGCCGG CCGCTCCGCC ATCCCGGCCG CAGGCGGCAG CGCCCCCTCA GCGCCGGCGC
CCGGCGCGGG CAGAGGATTA CTACGACACG AAATCGCAGG TCTTCTCCGC GCTGATCGAC
ACGATCGACC TGTCGCAGCT CTCCAAGCTC GATATCGAGA GCGCGCGCGA GGAAATTCGC
GACATCGTCA ACGACATCAT CACCATCAAG AATTTCGCGA TGTCGATCTC GGAGCAGGAG
GAACTGCTCG ACGACATCTG CAACGACGTG CTCGGCTACG GACCGCTGGA GCCGCTGCTC
GCGCGCGACG ATATCGCCGA CATCATGGTC AACGGCGCCG GGCAAACCTT CATCGAAGTG
GGCGGGAAGG TCGAAGAATC GGAGATACGG TTCCGCGACA ACGGGCAACT CCTGTCGATC
TGCCAGCGCA TCGTCAGTCA GGTGGGCCGC CGCGTAGACG AGTCCAGCCC GATCTGCGAC
GCGCGTCTGC CGGATGGCTC GCGTGTCAAC GTCATCGCCC CGCCGCTCGC GATCGACGGC
ACGGCACTCA CGATCCGCAA GTTCAAAAAG GACAAGCTGA CGCTCGAGCA GCTGGTACGT
TTCGGCTCGG TCACACCGGA GGCCGCGGTG CTGCTGCAGA TCATCGGCCG CGTCCGCTGC
AACATCGTCA TCTCCGGCGG TACCGGCTCC GGCAAGACGA CGCTGCTCAA CTGCCTGACG
CGCTATATAG ACAGCAACGA GCGCATCATC ACCTGCGAAG ACTCCGCCGA ACTGCAATTG
CAGCAGCCGC ATGTGGTCCG TCTCGAGACT CGCCCGCCGA ACATCGAGGG CGAGGGCGAG
ATCACCATGC GCGACCTGGT GAAGAACTGC CTGCGCATGC GCCCGGAGCG CATCATCGTC
GGCGAGGTGC GCGGCCCGGA AGTCTTCGAT CTGCTGCAGG CGATGAACAC CGGCCATGAC
GGATCGATGG GAACCATCCA CGCGAACACG CCGCGCGAAT GCCTGAGCCG AATGGAATCG
ATGATCGCCA TGGGCGGTTA CACCCTGCCT GCCAGGACCG TGCGCGAAAT CATCTCCGGC
TCGGTGGACG TCATCATCCA GGCATCGCGC CTGCGCGACG GTTCGCGCCG GATCACCCAC
ATCACCGAGG TCGTCGGCAT GGAAGGCGAC GTAATCATCA CACAGGATCT GATGCGCTAC
GAGATCGACG GCGAGGATGC CAATGGCCGG ATTGTCGGCC GCCACGTTTC GACCGGGATA
GGCCGGCCGC ATTTCTGGGA CCGGGCCCGC TACTTCAACG AGGACAAGCG GCTCGCCGCG
ACACTCGATG CGATGGAAAA GCAATAG
 
Protein sequence
MFGKRGNEGP GKGGARGFTP APSIPAVELA VVERPSAPDY GEPAAPPSRP QAAAPPQRRR 
PARAEDYYDT KSQVFSALID TIDLSQLSKL DIESAREEIR DIVNDIITIK NFAMSISEQE
ELLDDICNDV LGYGPLEPLL ARDDIADIMV NGAGQTFIEV GGKVEESEIR FRDNGQLLSI
CQRIVSQVGR RVDESSPICD ARLPDGSRVN VIAPPLAIDG TALTIRKFKK DKLTLEQLVR
FGSVTPEAAV LLQIIGRVRC NIVISGGTGS GKTTLLNCLT RYIDSNERII TCEDSAELQL
QQPHVVRLET RPPNIEGEGE ITMRDLVKNC LRMRPERIIV GEVRGPEVFD LLQAMNTGHD
GSMGTIHANT PRECLSRMES MIAMGGYTLP ARTVREIISG SVDVIIQASR LRDGSRRITH
ITEVVGMEGD VIITQDLMRY EIDGEDANGR IVGRHVSTGI GRPHFWDRAR YFNEDKRLAA
TLDAMEKQ