Gene Smed_4097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4097 
Symbol 
ID5318912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp560251 
End bp561489 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content63% 
IMG OID640775904 
Producthypothetical protein 
Protein accessionYP_001312837 
Protein GI150376241 
COG category 
COG ID 
TIGRFAM ID[TIGR02276] 40-residue YVTN family beta-propeller repeat 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACTGT CACTTACCCG GGCCCACCTG ATGGCCGCGG TAGCGGCAAC CCTTGTGTTC 
ACCGCCGTCG GCGCGCGCGC AGACGACCAT GAGACGGAGG AGGCCTGGCG CCTGTTCGTC
GCCGATCACA AGCAGCCGGT CGTGCGGGCG ATCGATTTCG CCGACGGCAA GGAACTGGGC
CGCTATGACG TGAAGGGATA TGCCGCCCTG ACGGTCAGCG CCTCCGGGCA GACGGTTTTC
GCCGCGCAGT CGGACCAGGA CATCGTCCAT CTCATCAAGA CCGGAATCGA CTTCTCCGAG
CACGGGGAGC ACCGTGATCT GAAGGTTTCC GACGCTGCGC TTCTTCCCGT GGCACTCGAA
GGCAAGCGAC CCGTCCATGT CGTACCGCAT GACGATCACG CGATCATTTT CTACGACCGC
GGCGGCGTGG CGGAAATCGT CGACGAAGCA GCGCTTCTCG AAGGCAAGAA GGCGCAAGTC
AGGACCGTCG ATGCGACGAA GCCGCATCAC GGTGTTGCCA TCACCATGGG GCAGCACGTT
CTCGTCTCGG TGCCGAACAC GGAAATCGAG CCGGAGCCGG ACAAGCTGCC TCCGCGTATC
GGCCTGCGTG TCGTCGACGA GAAAGGCAAC CAGGTCGGCG AGATCAGCGA ATGCACGGAT
CTTCATGGTG AGGCGATGTC GGCCCGCCTC GTCGCCTTCG GCTGCAAGGA AGGGGTACTT
GTCGCCCGGC CCGGCGGCAT CGACGGACCC AAGCTTGAAT TGCTCCCGTA CCCGTCCGAT
TTCCCGAAAG GCCATACCGG CACGCTGCTC GGGGGAAAAG CGATGCAGTT CTTCCTGGGC
AATTACGGTG ACGACAAGGT CGTCCTCATC GATCCGGACA GCAGCGAACC CTATCGCCTG
ATCACCCTGC CGACGCGGCG TGTCGATTTC CTGCTGGATC CTGCCATCCC GGCCAATGCC
TATATTCTGA CGGAGGACGG CGATCTTCAT GTGCTGGACG TCGTCAAGGG CGAGATCGTT
CGTAAGGGCA GGGTGACGGA GCCGTACAGC AAGGACGGCC ATTGGCGCGA TCCGCGCCCG
CGTCTTGCGG CAGCGGACGG TCAGATCGCG ATCACCGACC CGCGGCACTC GCTCGTGCGC
GTTGTCGATG CGGAGACGCT GAAGGAAATC CGTACCATTT CCGTCGAGGG TCAGCCCTTT
GCGATCGTCG CCGTCGGCGG TTCGGGCGCG TCGCATTGA
 
Protein sequence
MTLSLTRAHL MAAVAATLVF TAVGARADDH ETEEAWRLFV ADHKQPVVRA IDFADGKELG 
RYDVKGYAAL TVSASGQTVF AAQSDQDIVH LIKTGIDFSE HGEHRDLKVS DAALLPVALE
GKRPVHVVPH DDHAIIFYDR GGVAEIVDEA ALLEGKKAQV RTVDATKPHH GVAITMGQHV
LVSVPNTEIE PEPDKLPPRI GLRVVDEKGN QVGEISECTD LHGEAMSARL VAFGCKEGVL
VARPGGIDGP KLELLPYPSD FPKGHTGTLL GGKAMQFFLG NYGDDKVVLI DPDSSEPYRL
ITLPTRRVDF LLDPAIPANA YILTEDGDLH VLDVVKGEIV RKGRVTEPYS KDGHWRDPRP
RLAAADGQIA ITDPRHSLVR VVDAETLKEI RTISVEGQPF AIVAVGGSGA SH