Gene Smed_3799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3799 
Symbol 
ID5318097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp249410 
End bp250405 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content64% 
IMG OID640775612 
Producthypothetical protein 
Protein accessionYP_001312545 
Protein GI150375949 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.314785 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATTT CTCTGCCCGC ATTCGGAAGA ACGAAGGAAA TCCTACGTCG GGCCGCCGGC 
GCTTCAGACG GTGCCTACGT TTCCGTCGAC AATCTCGTGG CGCTCGAATC GATGGCCGCC
GACCTGACAT TCCTGCGCAA GGCCCCCGTC CGCCGCTTCC TTGCCGGGCG CCATGAATCG
CGCATGCGCG GCCGTGGGCT GAGCTTCGAG GAGCTTCGGA CCTACATGCC GGGCGACGAT
ATCCGCACGA TCGACTGGCG CGTCACCGCC CGAACGGGTC AACCTTTCGT GCGCGTCTAT
AACGAAGAGA AGGACCGACC CGCCCTCGTC ATCGTCGACC AGCGCACCAA TATGTTCTTC
GGCAGCCGCC GGTCGATGAA ATCGGTGGCC GCCGCTGAAG CGGCGGCGCT CTGCGCCTGG
CGCGTTATGG CGCTTGGCGA TCGCGTCGGC GGCGTGGTCT TCAACGACCT GAAGCAGGAG
TCCATCCGGC CGCACCGCAG CCGCGGATCC GTCATCCGCT TCGCCGAAAC GATTTCGCTC
CAGAACAAGG CACTTAGCGC GGGCTCGGAT ATCGAGAGGG CGCCCGGTCA ACTCAATGCC
GTCCTGGGCA ACGTCGCAGC CGTCGCGCAG CACGACCATC TGATTATCGT CATCAGCGAT
TTCGATGGGC ACGGGCCGGA GACACGCGAT CTTCTCTTGC GGATGTCCGT CTCAAACGAC
GTCATTGCCA TCCTCATCTA CGATCCATTC CTCCTGGACC TGCCGCGCCA GGGCGACATG
GTGGTGAGCG GCGGCGCTCT GCAGGCCGAA CTGCAGTTCG GCCGTAGCAA TGTTCGCGAT
GCGGTCGACA GCTTTGCGCG CAACAGAGGC CGAGAGCTGC TTTCTTGGCA GGAGGAGATG
GGCCTGCCCA TGCTGCCCGT TTCCGCTGCC GAGGAGGTCG CCCCGCAATT GCGCACGCTT
CTCGCTCAAC TCGCCTGGCG GCAAAGGAGG CGATAG
 
Protein sequence
MAISLPAFGR TKEILRRAAG ASDGAYVSVD NLVALESMAA DLTFLRKAPV RRFLAGRHES 
RMRGRGLSFE ELRTYMPGDD IRTIDWRVTA RTGQPFVRVY NEEKDRPALV IVDQRTNMFF
GSRRSMKSVA AAEAAALCAW RVMALGDRVG GVVFNDLKQE SIRPHRSRGS VIRFAETISL
QNKALSAGSD IERAPGQLNA VLGNVAAVAQ HDHLIIVISD FDGHGPETRD LLLRMSVSND
VIAILIYDPF LLDLPRQGDM VVSGGALQAE LQFGRSNVRD AVDSFARNRG RELLSWQEEM
GLPMLPVSAA EEVAPQLRTL LAQLAWRQRR R