Gene Smed_1817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1817 
Symbol 
ID5322675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1898095 
End bp1899693 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content61% 
IMG OID640790755 
Producthypothetical protein 
Protein accessionYP_001327487 
Protein GI150397020 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0270492 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00149176 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGCCTTT ACGGATCCGC TCCGGAAGCA CCGGACCCGA AGCAAACGGC GTCCGCGCAG 
ACTGCGACGA ACATCGGAAC CGCGGTTGCC AACAACGTCA TGGGCAACGC CAACCAGGTC
ACGCCAGACG GCAACCTGAC CTATACCTAT AACACTCAGA AGTGGACCGA TCCTCTCAGC
GGGAAAGAAT ACGACCTCAA GGTTCCGACG GCGACACAGA CACTTTCGCC CGCGCAGCAG
GCGATCAAGG ACCAGGAGGA CGCCGCCCAG CTGAACCTGG CGACGCTTGC CAACACCCAA
TCGGGAAAGC TCAACGGCCT TCTCGCCAGT AAGTTCGACA TATCCGGCGC TCCAGCGGCC
GGAAAGTCGG ACGCGATCGG GCTGCCGCAG TATCAGAGCT TCACGAGCGG TCCGAAGCTG
CAGACCAGCC TCGCAAATGC CGGCAACGTT CAAAGCTCGA TTGCAGGTGC CGGTTCCATA
CAGAGCCAGG TTGCGGACAG CGGCAAGATC CAGACTTCGC TTGGCAATGC CGGGAACATC
ACCGAGAGCT ATGATTTCGA CATCGACACG TCGAAATACG AACAGGCGCT GATGGACCGC
CTCAGCCCGC AGATCGAGCG GGACCGCGCC GCCCTTGAAA CGAAGCTGAC CAACCAGGGA
CTGCAGCCGG GCTCCGAGGC CTATGACCGT GCGATGGACG AGGCGAACCG CGCGGCGAAC
GACGCCCGGA TAGGGGCAAC CCTGAGTGCC GGGCAGGAGC AATCGCGGAT CGCCGGTCTG
GCGCAGAACC AGGCGCAGTT CCAGAATTCG GCACAGCAGC AAGCCTACGA CCAGATGACC
GGACTGGCCC AGTTCTACAA TTCGGCGCAG GCACAGCAAT ACGCGCAGAA CGCGAACGAC
ATGCAGATGG GAAACGCCGC TCAGCAGCAG CAGTTTTCGC AGAACCAGGC GCAGATGCAG
GCCAACAATG CAGGTCAGGA GCAGAAATTC AACCAGGGAC TGACCGCGGC GCAGTTCGGA
AACGACGCCC TGCAGCAGCA GTACCAGAAC CAGAACACGG CGACGGGCGG CAACAATGCT
CTGGCGGATC AGAGATTCAA TTCTCAGCAG GCGAAGTACA ACCTGCAAAA CCAGGAGCGG
GCACAATATC TGAACGAGCT TTACGCGCAG CGCAACCAGC CGATCAACGA GATCGTCGGG
CTGATGTCCG GGGCGCAGGT CGACAGCCCG AGCTTCGTGC CGACCCAGAG CAACCCCATG
CCGACCGTCG ATTATGCCGG GCTGGTGCAA CAGGACTATG CCAACAAGAT GGGCGCCTAC
CAGCAGAAGC AAAGCACGAT GCAGAACCTC TTTGGCGGCA TGCTCGGTTT CGGCGGGCAA
CTGGCCAGCC TCTCGGACAA GCGCGCGAAG AAGGACATCA AGAAAGTCGG CGGCCTCTAC
GAGTACAGGT ACAAAGGTGA AGGCAGGAAC GCTCCCAAGC GGATAGGCGT GATGGCGCAG
GAGGTGGAAA AAGTGCGCCC CGACGCTGTC GCCAAGGGCG CCGATGGCCT GCGGCGCGTG
GATTACGGAC TGCTCTTCAA CGCAGGGAGA GGCAAATGA
 
Protein sequence
MGLYGSAPEA PDPKQTASAQ TATNIGTAVA NNVMGNANQV TPDGNLTYTY NTQKWTDPLS 
GKEYDLKVPT ATQTLSPAQQ AIKDQEDAAQ LNLATLANTQ SGKLNGLLAS KFDISGAPAA
GKSDAIGLPQ YQSFTSGPKL QTSLANAGNV QSSIAGAGSI QSQVADSGKI QTSLGNAGNI
TESYDFDIDT SKYEQALMDR LSPQIERDRA ALETKLTNQG LQPGSEAYDR AMDEANRAAN
DARIGATLSA GQEQSRIAGL AQNQAQFQNS AQQQAYDQMT GLAQFYNSAQ AQQYAQNAND
MQMGNAAQQQ QFSQNQAQMQ ANNAGQEQKF NQGLTAAQFG NDALQQQYQN QNTATGGNNA
LADQRFNSQQ AKYNLQNQER AQYLNELYAQ RNQPINEIVG LMSGAQVDSP SFVPTQSNPM
PTVDYAGLVQ QDYANKMGAY QQKQSTMQNL FGGMLGFGGQ LASLSDKRAK KDIKKVGGLY
EYRYKGEGRN APKRIGVMAQ EVEKVRPDAV AKGADGLRRV DYGLLFNAGR GK