Gene Smed_3786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3786 
Symbol 
ID5317954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp233292 
End bp235013 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content62% 
IMG OID640775599 
Productouter membrane autotransporter 
Protein accessionYP_001312532 
Protein GI150375936 
COG category 
COG ID 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.968521 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGTCG CCAAGGTTCG GAAGTGCTTC AGCACGGTGC TGAGCGGGGC ACTGCTTGCG 
GGGCTCTTTT GCATTGTGGG GAGCGGGGAG GCATCTGCCG CGAACTGCAG TGCAGGCGGC
GGTTTGGCCT ATCTCGGGAA CACCCGAGGG GCAGCCCGCT CGGTCTGGCT TTCCGGGACC
AGCGCCGGAT CGTTCGGACT TGAGGCTGCC CAGGGCAATC AGATGGTCGA TACGTGGAGC
AAAAGCAAAC GTGGCCTCCA TCTGTCGCTC TCACCGGTCG TTTCGAACGG GAACGGCGGT
ACGCACAAGC TTCTTGACCT ATCGGGAGGG AAGCCCTGCC TGCTCGACAC GCAGAACCAG
GTGCGCGATT TCATTCCGCC GGACTTCACC TGGCCCCAGG GGCGGCCCGA CATCGGTCCG
ATTCGGTTCT TCGATTTCGA CTTGCCCGAG GGGCGCCCTG ACACCGGCCC GATCCGTGTC
TTCCCCGATT TCGATCGGCC GGCCGGACGG CCCGACACCG GCCCCATTCG CGTTTTCCCG
GGCTTCGATC GGCCTGACGG GCGGCCCGAC ACCGAACCGG TTCGGGTCTT CCCCGGTTTC
GATCGGCCGG ACGAACGACC CGACAACGAT CTTCCGCGCA TTCCGCCGGA CCGTCCCGAT
GGGCGGCCCG ACAATGACCT GCCGCGTGTG TTTCCCGGGT TCGACCCACC TTCACCCTCA
CCGAATCCCA CGGCGGTTCG GGGCGGGCGT GTGGTCGGGG CTCGGACAGC GCAGAACTGC
ATCGACCCCC GTTCCGTGGG CTCCGATCAA AGGCGCGACG TGCCGATCTG CCCTGAGTTC
GTTGCGGAAG AGGCAGCGAC GCAAGGCGCC GGCCAGTCCC GCGGTGTGCC GCTGACGCCC
GGCCGCGATC TGCCCGCGCC CTCGCTCTGG AATTTCTGGT CCGACATTCG GTTCACGGAT
ATATCAGACG AACGTTACGA CCGAGACAGC GACACCTTCG CCCGCACACT TGACATGGGC
CTCGATCGCC GGATCACCGA CGACCTTGTG GTCGGCATGT CATTTTCTCT GCAGGACAGT
TCGACGCATG CGTTCCACGA CTCGCTCGAC ATCGATTCGG ACGGGTTCAG CTTCGGTCCT
TACGCGGCCT ATCGCCTGTC GAAGCACTGG GCCATCGACG CTTCCCTGAC CTATGGACGC
TACAACAACG ATGTCGAACT GAGCGTGCTC AGCGGCAAGT ATGATTCCGA GCGTTTTGCC
GGTGAGATCT CGCTCAACGG CCAATACAAA TTCGACGCCT ACTACGTCCG TCCGAAAGCA
TCCGTAACCC TGTCCCATGT CAAAAGCGAT GCATACGGCC TCTCCGGAAG TCTTTTCAAT
TTCCCGGTGA GTGTTTCGCT TCCGGGCGAC AGCTACAATT ATGGCGTGCT CGACATGTCG
ACCGAAGTCA GCAGGTTCTT CCGCCTGCCG GACGGGCAGC CGTTCCTGGT CTTCGCGGAG
CTTGGCGCGC AATATGAATT CGAGCGGCCG AACGACGGGA AGATTCTGAC GGGCGACCTG
TCGGAAGTCT CTCCCTCGCC CTGGGCGTTT TCACTGCGCT CGGGATTGAG AATGCTGCTC
AACGAAAACC TCCAGATCGA AGCGACGGGA GGTTATCTAA GCCTCGGTCA GGATGATCTC
GACGTATGGG AGGGCAAACT TCACGTGTCC TGGAGCTTCT AA
 
Protein sequence
MVVAKVRKCF STVLSGALLA GLFCIVGSGE ASAANCSAGG GLAYLGNTRG AARSVWLSGT 
SAGSFGLEAA QGNQMVDTWS KSKRGLHLSL SPVVSNGNGG THKLLDLSGG KPCLLDTQNQ
VRDFIPPDFT WPQGRPDIGP IRFFDFDLPE GRPDTGPIRV FPDFDRPAGR PDTGPIRVFP
GFDRPDGRPD TEPVRVFPGF DRPDERPDND LPRIPPDRPD GRPDNDLPRV FPGFDPPSPS
PNPTAVRGGR VVGARTAQNC IDPRSVGSDQ RRDVPICPEF VAEEAATQGA GQSRGVPLTP
GRDLPAPSLW NFWSDIRFTD ISDERYDRDS DTFARTLDMG LDRRITDDLV VGMSFSLQDS
STHAFHDSLD IDSDGFSFGP YAAYRLSKHW AIDASLTYGR YNNDVELSVL SGKYDSERFA
GEISLNGQYK FDAYYVRPKA SVTLSHVKSD AYGLSGSLFN FPVSVSLPGD SYNYGVLDMS
TEVSRFFRLP DGQPFLVFAE LGAQYEFERP NDGKILTGDL SEVSPSPWAF SLRSGLRMLL
NENLQIEATG GYLSLGQDDL DVWEGKLHVS WSF