Gene Smed_1190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1190 
Symbol 
ID5322037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1270069 
End bp1271907 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content58% 
IMG OID640790131 
Producthypothetical protein 
Protein accessionYP_001326875 
Protein GI150396408 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.204201 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.1151 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCGTC TCACGATCAA AGTCCCGTTC CACGATGGCG AGGCAACCAG CGGATATGCA 
TCGCGGCTCG CCGCTGCCAA CGGAGTTGAC GGATCCGACT TCGGCGCGCT CATGGAAATC
GGGTGGCTTC CACTTTTATA CGGAAAGGAG GATGCTTTGT TCCGCCTCTC CAGTGTCTGC
GGTATCGATG TGAGCACGCT GTCACGCGGC TTCGTCCGCC CCGACTCTTC CGGCACGGCA
GCGGCCGTCC AGATTAACGG AGAGCAGCTA ACTAAGGCTC AGGTGCTGCG ATGGGATCCC
CGCTACTGCC CCCAATGCGT CCGGGAGGAC ATCGCCACGA GTACAGGCGC GGTCGAAGCG
AGGCCCTATC AGAGGCTGGA CTCGCTGGTG ACGTTCATCC GCACCTGCGA GGCGCATGAC
GTCATGTTGC GGCGGGCCGA GCCAGTCGTT GAATATCGTC ACCGTAGAGA TTTCGCTAGG
CGGATCAAGT TCGAGATCAT TGAGGGGCAT CTCGACGCCG CCCCTGTCGT CAAGCGTCAC
ACTGCATTCG AGCGATACAT CGCGTCCCGC CTCCATGGGC GTGTAACCGA TACGCCTTGG
ATGGACGGAA TGCCGCTGCA CGTGGTGGGG AGGATGTGCG AGGTCGTCGG CGGTGCGGAG
CTCTTCGGGT TCGACTTCAA GTTCCGGGCG CTCAGCGAAG AGGACTGGGT TGATGCCGCC
GCTCGCGGTT TCGAAATCAT GTCACGGGGA GAAGTCGGCT TCGTTGAGTT TCTCTATTCG
CTTCACGGAA ACTTCTGGAA GTCGAGCGCC GACTTCGGTG GGAGGTCGGT ATACGGCACA
CTCTACGTCT ATCTGCAGGC AAATCCCGAT CCTGGCTATG ACAGCATCAG GAACATCATG
AAAGATATCG CCCTTGATAA CTTCCCGCTC GGTCCCGGCG ATGATTTTTT TGGGACGGTG
ACATGCAGGA GGGTGCACTC CGTAAGAACG GGGTCGCTGA TCAGCCGTTT CGCCGACAGG
ACATTGCACA AGCTGTTGAT CGCAGCCGGT CTGGTTGATC CCGCAACCAA AGGGACGAAC
CCGAACAAGG TCCTTATCAA TGGAGATCGC ATGGAGACCT TCATCCGTGG AGCGGATGAT
CTCCTTGTTG GTCCTGCTGC TCGGGAATAT CTTGGCATCT CGTTGAGTAC GTGGAAGAGC
CTTCTAGCGG ACGGCTTTAT CCAGCCTTGC ATTCGAAGCG CCAGTGGTGA GAATGTAAAG
TCGCTGTTCC GTCGGTCCGA CCTTGATGAT TTCGCTTCGA AGCTGGAGGC ACAGGTCACG
GCGCCACTCG CCAGCCGCAC TAGCTTCATC TCGCTTCCAA CTGCGGCCAG TCGCGTCCAC
AGGCCGATCA CGGAAGTGTT GAACTTCCTA CTCGCGGGAA AGCTCAAGTA TGTGGCGAAG
GCGGAGGTAG GCCCACGGTT TCAGGCAATC TGCCTCGATC TCGAGGAGTT GCGAGACCTC
GTTCGGCTTA CCCCCTTGCC TGGGCATAAC CTGAGGACGG TAGAAAAGCT TCTGAACACG
ACCACTCCAG TCGTGCGGAA ACTAATCTCC AACGGTCATC TTGCTGCCGA GACAGCCATC
AACCCCGGCA ACAAGACACC TCAGGTCGTC GTGCGGCCGG AAGTTCTCGA GGCATTCGCT
GGGACATTCA TTTCCCTTCA TAACATCGCG ATCGCTCAGA ACACGAACAT CAAGGCCGTG
CGCCGTGATC TCGTGGCCCG CGGGATTCTG CCCGCGATCG ATGGCAAAGA CGTTGGTGCG
ACCTTCTATC GGATGAAAGA TCTGGATCCG AAGTCCTGA
 
Protein sequence
MSRLTIKVPF HDGEATSGYA SRLAAANGVD GSDFGALMEI GWLPLLYGKE DALFRLSSVC 
GIDVSTLSRG FVRPDSSGTA AAVQINGEQL TKAQVLRWDP RYCPQCVRED IATSTGAVEA
RPYQRLDSLV TFIRTCEAHD VMLRRAEPVV EYRHRRDFAR RIKFEIIEGH LDAAPVVKRH
TAFERYIASR LHGRVTDTPW MDGMPLHVVG RMCEVVGGAE LFGFDFKFRA LSEEDWVDAA
ARGFEIMSRG EVGFVEFLYS LHGNFWKSSA DFGGRSVYGT LYVYLQANPD PGYDSIRNIM
KDIALDNFPL GPGDDFFGTV TCRRVHSVRT GSLISRFADR TLHKLLIAAG LVDPATKGTN
PNKVLINGDR METFIRGADD LLVGPAAREY LGISLSTWKS LLADGFIQPC IRSASGENVK
SLFRRSDLDD FASKLEAQVT APLASRTSFI SLPTAASRVH RPITEVLNFL LAGKLKYVAK
AEVGPRFQAI CLDLEELRDL VRLTPLPGHN LRTVEKLLNT TTPVVRKLIS NGHLAAETAI
NPGNKTPQVV VRPEVLEAFA GTFISLHNIA IAQNTNIKAV RRDLVARGIL PAIDGKDVGA
TFYRMKDLDP KS