Gene Smed_1984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1984 
Symbol 
ID5322843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2035869 
End bp2036975 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content63% 
IMG OID640790922 
Producthypothetical protein 
Protein accessionYP_001327653 
Protein GI150397186 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.590455 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCTG CCGAAAGGCC ATGCGATGAG ACGATCCGGC GAGCGCTCGC CCGGGTTCTC 
GACAGCAAGG GCTTTCAGCG CTCCGAACGT CTGCGCACGT TTCTCTCCTA TGTGGTGGAA
AAGGAAATCA TCGGTGAAGG CGCTCAGCTG AAGGGCTACT CGATCGCCAT TGATGTTTTC
GGACGCGGCC AGACCTTCAA TGCCGACAGC GATCCTCTTG TGCGCGTCCA TGCCGGCAAG
CTGCGCAAGC TGCTTAAGGC ATTCTACGAA ACGGATGGCG CCGGCGAAGA ATGGCAGATC
GCCATTCCCA AGGGGACCTA TGTGCCGGAG TACCGCCGGT GTTCAAACGG CGTCGAGGCA
TTGCCGGGCC CGGATGCCGC GGGCGCCCGC CGTCGCCAGC CCGGACGCGG GCCACCCTGG
CAGCCCGCCC CCTTTTCGTC CCCCTGGGCC GTGCTTACCG TGCTGCCGCT TCTCCTTTTC
GCCCCTCTGC CGGCCTCGGA AATGTCGCTC GACATCGATG CCGAGGCTAA ACTCGTCAAT
GGTCCGATGG CCGCCGTCAG GGGGCTTCCC TCCGTCAGCA TCAGCGTGAC AGGATCGCAG
CACAGGAGCA CCCGGCGTTT CTCCTCGCAA CTGCGTGATG CGGCGCTTCG GCATGGCACA
CTTGCCCAGG CGCACGTGTC TGATGGGAAC CGCACACCGG CATCCGGCAA TCAGGCGCTT
GCGTTTTCCA TCGCGCTCGC CTGGCACGAT GCGCCTGCGG CGGGCATCCG GGTCACCCTG
TCCCACGATG GGGAAGGCAT CCCTTTGCGC CAGGACTTCA TCTCCGCCGA CCGCCTCGAT
AGCGAGGCCG ATGTTCTCTA CGAAAGCACT TCGCTTGCAG CGAAACTCTT CTCGCTGGAC
GGCGAGATCT ATGCGCATGC CGCACTGGAA GGCTTGCAAA GTACCATGAT GCAATGCATG
TCGGCGACTG CCAAGTACCG GAAGCTGCTG ACGCGCGACA GTTTCCAGCA GGCCTGGAAC
TGCCAGCAGA AGCTCAAGCC CCTCAAAGGC GACGAGCCCT TCTTCATCCT TTCCGTAAGC
AGTCCGCACA AGATCAATGG CCATTGA
 
Protein sequence
MTPAERPCDE TIRRALARVL DSKGFQRSER LRTFLSYVVE KEIIGEGAQL KGYSIAIDVF 
GRGQTFNADS DPLVRVHAGK LRKLLKAFYE TDGAGEEWQI AIPKGTYVPE YRRCSNGVEA
LPGPDAAGAR RRQPGRGPPW QPAPFSSPWA VLTVLPLLLF APLPASEMSL DIDAEAKLVN
GPMAAVRGLP SVSISVTGSQ HRSTRRFSSQ LRDAALRHGT LAQAHVSDGN RTPASGNQAL
AFSIALAWHD APAAGIRVTL SHDGEGIPLR QDFISADRLD SEADVLYEST SLAAKLFSLD
GEIYAHAALE GLQSTMMQCM SATAKYRKLL TRDSFQQAWN CQQKLKPLKG DEPFFILSVS
SPHKINGH