Gene Smed_5201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5201 
Symbol 
ID5319503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp160124 
End bp161521 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content61% 
IMG OID640776979 
Producthypothetical protein 
Protein accessionYP_001313911 
Protein GI150377316 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAGAG GGACGACCCT TGCACTTGCG GGCGCAATAA CGCTCATGCT GTCGATGCAA 
GTCAACGGCT CGCTTGCACA AGAACCCATG AAAGCTGAAA CCGGTACCGC GGCAACAACC
CCGGAGCCGT TGACTGACGA CGAACTGGAG GTGCTGGTGG CGCGGATTGC GCTCTACCCG
GACGAACTGG TCGCGCTGAT TTCAGCCGTA TCGCTATACC CCTTACAGAT CGTCGAGGCG
GACCGGTTTC TTGAGAACCA CAAAAAGAAG CCCGACCTGA AGCCGAAGGA AGGCTGGGAT
GGCAGCGTAA TTTCGCTGCT GAACTATCCA GACGTCGTCA AGATGATGAG TGACGATCTT
GAATGGACGC AGGCCCTCGG TCAGGCGGTC GCCTATCAGC AGAAGGACGT GCTGATCGCG
ATCCAGCAGT TGCGCGACGA GGCGGTAGCC AAGGACATTA TCAAATCTGA CGACAAGATG
ACGGTGGTCC AGGAGGGGGA CAACATCATC ATCCAGTCGG CAAATCCGGA GACTATCTAC
GTGCCGCAAT ATCCTCCGGA AATGCTGTAC GAGCCGGATT ACGCGCCGGT GCCCATCGAC
TACTATGACA CGCCCTATCC CTCCTATTAT TACCCGGGCG CAGCCTTTTT TGCCGGAGCA
GTCACCGGTG CGGTCTTTGG CGCTATCGTC GACTGGGACG ACTGGGGGGT CTGGGGCGGA
GACTGGGGCG GCGACATTGA TGTGGACTGC GACAATTGCT TCAACAACGT CGATATAGAC
GGCAAGGTCA AGTGGAACGA CATCGATTGG AAAAACGTCG ACCGCAGCAA GCTGAAGTTC
GATCGCGACC AGCTCCAGAA GTTGGACAGG ACGAACTTGA AAAACAATAT CAAGGCGAAC
GGCGACAACA ACATTCGTAA CCGCGCCACC GAGATCAATC GCGATCGGTT GAAGTCGGGA
CCGGGCGGTG GCGCTAGCCA ACTAAAGGAC GTTCGCAAGA GCACGCTCGA GGGGCTGAAG
GCGCAGCCGA GACGGGACGC CTCCGCGCGT CCAACCGCAA AGCCCGGCGG CGGACAGGCG
GTGGCAAAGG CGAAGTCCAA GGGCGCAAAA GCCGGTGTCA ACCGCCCCAA GGGTAAGAAG
TCCTCGGCCA ATCGGCCGGC AGGCAAGAAG AAGATGGCTT CGAAGGCGCA GAACAGGTCG
AAAAAACCGT CTGGGCTAGG CAACGTTAAC TCCGGCCGGC GGGAGGTGTC TGCCTCACGG
CGCGGCGGCC ACAGCATGGG CGGCGGGCAA CGAGGCGGCG GGCGGCCGCA GATGAGTCGC
GGCGGCAGCC GGCCGCCGAT GGGGGGACGT GGCGGTGGCG GACGCGGGGG TGGCGGACGC
GGGGGTGGCA GGCGCTGA
 
Protein sequence
MIRGTTLALA GAITLMLSMQ VNGSLAQEPM KAETGTAATT PEPLTDDELE VLVARIALYP 
DELVALISAV SLYPLQIVEA DRFLENHKKK PDLKPKEGWD GSVISLLNYP DVVKMMSDDL
EWTQALGQAV AYQQKDVLIA IQQLRDEAVA KDIIKSDDKM TVVQEGDNII IQSANPETIY
VPQYPPEMLY EPDYAPVPID YYDTPYPSYY YPGAAFFAGA VTGAVFGAIV DWDDWGVWGG
DWGGDIDVDC DNCFNNVDID GKVKWNDIDW KNVDRSKLKF DRDQLQKLDR TNLKNNIKAN
GDNNIRNRAT EINRDRLKSG PGGGASQLKD VRKSTLEGLK AQPRRDASAR PTAKPGGGQA
VAKAKSKGAK AGVNRPKGKK SSANRPAGKK KMASKAQNRS KKPSGLGNVN SGRREVSASR
RGGHSMGGGQ RGGGRPQMSR GGSRPPMGGR GGGGRGGGGR GGGRR