Gene Smed_1219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1219 
Symbol 
ID5322066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1301983 
End bp1303857 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content56% 
IMG OID640790160 
Producthypothetical protein 
Protein accessionYP_001326904 
Protein GI150396437 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.704503 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.108179 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAGC TTATCTCGCG GTCTCAATCA GCAAAGGCAA GGTGGGCTGT TCGTAGCGCT 
CACGGCAAGT CGACGATCAT GAACTTCCTC TTCTTCGGAC TGGGAGGGGA TCTCGACCGT
TCGGCTTGGA GCGAGCACGC TCTGAAATGC GATCACGTGT GGCTTGAGGT TGAATTCAAC
GGGAACCCCG CGGTGCTGCG CCGGGAGATT GATGTCTCCT CGCAATCGGC GATGGAGATT
TTCGGCGGAC GCTATGAACA AGCAGTCGCT GCCCCAATCG AAGCGTGGCA TCGATATCCC
TATGCTCGTT CAAAAAACCA AGAAAGCTTC TCACAGGCGA TCTTCCGGCT CCTGGAGATG
CCGGATGTTG CAGTCGAGGG CACGTCGAGC ATAACCATCC ACCAAATCCT GCGCCTTCTC
TACGCAGATC AGCTCTCGCC GACAGAAAGC CTCTTCCGGT TTGAAGGATT TGATTCTGAG
AACCTGCGCG AGGCCGTGGG CAATCTCCTT TGCGGGGCGT TTGACACCGA GATCTATGAG
CTTCAGCAAC TGAAGCGCAC GAAGGAAAGG GAGTTCACCG AGGCCAACGC CGAGTTGAGG
AGCATATTCA AGGTCATCGG CGGTGGTGAC GAGAGCATGA CCCTTGACTG GATCGAGCAG
CGCCGCAAAA GCCTTGTCCA GGAGCGCGAC ACGACCTCGG CTCAGCTTCT GGAGGCGGAG
GAAGCGTTCT ATTCCAGCCA ATCCTCAGAG AAGATAACTC TGCAGGTTCA GAAAGACCTC
TACAAGCAGG TTCAAAAACT GCAGGGAGAG ATCCGTCAAA AGCAGGCCGA GGTGGATTCG
GTTGAATTCG AAACCTCGGA CTCGGCAATC TTCATCCGAA CCTTGTCCTC GCGGCTGGAA
GCGTTGCGAG ATGCGGATCT CACCGCAAGC GTTGTCGGCA GCGTCCGGTT CGGAACCTGC
CCGGCTTGCT ACGCAGAGGT CATTGAGCAC GACCACGATG TAGAAGCCTG TCATCTATGC
AAGACTCCGC TCGACACCGA GCGTGCCCGT GAGCGAGTGG TCGGTATTAT CAACGAGCTG
TCGATACAGG TAAAGCAGTC CGAGCGTCTG CAAACCATCA GGGCCGAGCG CCTGGAGAAA
CACCAGACCG CCTTGAATGC GCTGGTCGCG AACTGGCGGC TCAAGGCAAA GGAGCTTGCT
GAACTTAGTT CAGTGCCAAC ATCCAAGGCC CGCGAGCAGA TCCGCGAGCT GCAGCGGAAG
CTCGGGTACA TCGACAAGTC CATTGAAGAC GTCGAAGCGC AGGTCAAGCT CGCCAACAAG
ATCGGCCATC TCTCCGCCCG CAAGGCGCAG CTCGATCAGG ACATCCAGTC TTTAGGCTCA
CGCATCACCA CACTACAATT GAGCCAGACC GATCGGTTGC GGACTGCGAA AAACAGCATC
GAGAAGAATA TCATCGCGAT CCTCCAGGGC GATCTCAAGC GTCAGGACAC CTTCGAAAAC
CCCGAGCATG TTCAATTCTC GTTCGGGAAG AACTCCATAA CGGTTGATGA GCATACCTAT
TTCTCCGCAA GCTCCCGCGT AATTCTCAAA GCCGCTTTCC TGGTCGGCTT TCTCAAGGCT
GCCCTCTACG ACCGACAGTT CCGCCATCCC CGCTTCCTCA TGATCGACAT CACCGAGGAC
AAGGGTATCG AGCAGGCCCG GAGCCATAAC TTCCAAAAGC AGGTGATCGA GATCTCGGAC
CAGGCGCCTG TCGAGCACCA GATCATCATA GCGACGGCAA TGCCGTGGCC CGAGATCAGA
CCGGAACTCG TTGTCGGAAG ACATTCGACG CGACAGCACG GCACACTCGC GTTTCTTCCA
AGCAACGGCA ACTGA
 
Protein sequence
MTELISRSQS AKARWAVRSA HGKSTIMNFL FFGLGGDLDR SAWSEHALKC DHVWLEVEFN 
GNPAVLRREI DVSSQSAMEI FGGRYEQAVA APIEAWHRYP YARSKNQESF SQAIFRLLEM
PDVAVEGTSS ITIHQILRLL YADQLSPTES LFRFEGFDSE NLREAVGNLL CGAFDTEIYE
LQQLKRTKER EFTEANAELR SIFKVIGGGD ESMTLDWIEQ RRKSLVQERD TTSAQLLEAE
EAFYSSQSSE KITLQVQKDL YKQVQKLQGE IRQKQAEVDS VEFETSDSAI FIRTLSSRLE
ALRDADLTAS VVGSVRFGTC PACYAEVIEH DHDVEACHLC KTPLDTERAR ERVVGIINEL
SIQVKQSERL QTIRAERLEK HQTALNALVA NWRLKAKELA ELSSVPTSKA REQIRELQRK
LGYIDKSIED VEAQVKLANK IGHLSARKAQ LDQDIQSLGS RITTLQLSQT DRLRTAKNSI
EKNIIAILQG DLKRQDTFEN PEHVQFSFGK NSITVDEHTY FSASSRVILK AAFLVGFLKA
ALYDRQFRHP RFLMIDITED KGIEQARSHN FQKQVIEISD QAPVEHQIII ATAMPWPEIR
PELVVGRHST RQHGTLAFLP SNGN