Gene Smed_5654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5654 
Symbol 
ID5319956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp619466 
End bp620506 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content63% 
IMG OID640777388 
Producthypothetical protein 
Protein accessionYP_001314320 
Protein GI150377725 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.235146 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGCGTA AAGATGCCAT CTCACAAATC TGGTACACCC GCTGCCCTGT CCCGACACCA 
GTGGGCCTTG CCACCCAGCT TGGTCTTCTC GACACGGCAT TTGCCGCCGA GGGCATCACG
CTCAACTCCA TCATCGACAG CAAGGATCGC TCCATCCGGT CCAGCCACTT CGATCACCAT
CTCGACTATT CGTTTCGCCA TGGCGGCAAT GTCCCGCCGG TCCGCGCCCG CTCGGAAGGC
AACCCGACGC GGCTTGTCGG CATCACATGG ACCGATGAGT TCCAGGCCAT CATCACGTTG
CCGGGCACCG GCATCAAAAC GACACGCGAC CTTTTTGGCC GGCGCTTCGG CATTGCGCGC
CGTCCGCCAG GCATCGTCGA CTTCATGGCC GCCACCGCGC TGAAAGGCCT TGTTTCCGCA
CTGTCGCTCG AAGGGCTCGC ACCCTCCGAT GTCGAGATCG TCGATATCCC GCTTTCCGAA
AGCGTGCTCG ATGGCAGAGA GGGTCCCCAG CTCTACGGCC TGCGCAACCG TCAAGCCTAT
GGCCCCGAAA TCGCCGCGCT GCTGCGCGGC GAGGTCGACG CAATCTATGT CAAAGGTACG
CCCGGCATTG CCGTGGCCAA TCTCTTTGCG GCCCACATGG TCGCGGAATT CGGCTTTCAC
CCCGACCCGA AGATCCGCAT CAATTCCGGC TCCCCACGGG TGTTGACCGT CGATGAACGG
CTGGCGCAAG ACCGCCCCGA TCTCGTCGCC AAGCTGATCG CGACTTTGAA GCAGGCTGGC
GCCTGGGCCG AAGAACATCC GGACGAGGTG CGCCGCTTCG TTGCCCGCGA GGTCGGCGCA
TCCGAAGAGG TCGTGGCTGC GGCCAACGGT CCGGATCTCC ACAAACATCT CGGCATCGGC
CTTGAACCGA CACTCGTCGA GGCGATCGGG CACTACAAGG ACTTCCTGCA TGAATGGGGT
TTCCTGGCGA GCAACTTCGA CATCGACACA TGGGTCGACC ACCGCCCCTG GGCGGAACTC
GACATCCGCG CTGTCGCTTG A
 
Protein sequence
MTRKDAISQI WYTRCPVPTP VGLATQLGLL DTAFAAEGIT LNSIIDSKDR SIRSSHFDHH 
LDYSFRHGGN VPPVRARSEG NPTRLVGITW TDEFQAIITL PGTGIKTTRD LFGRRFGIAR
RPPGIVDFMA ATALKGLVSA LSLEGLAPSD VEIVDIPLSE SVLDGREGPQ LYGLRNRQAY
GPEIAALLRG EVDAIYVKGT PGIAVANLFA AHMVAEFGFH PDPKIRINSG SPRVLTVDER
LAQDRPDLVA KLIATLKQAG AWAEEHPDEV RRFVAREVGA SEEVVAAANG PDLHKHLGIG
LEPTLVEAIG HYKDFLHEWG FLASNFDIDT WVDHRPWAEL DIRAVA