Gene Smed_5641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5641 
Symbol 
ID5319943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp606459 
End bp607856 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content59% 
IMG OID640777377 
Producthypothetical protein 
Protein accessionYP_001314309 
Protein GI150377714 
COG category[S] Function unknown 
COG ID[COG5361] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.617076 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCACT CGCCGATTTT CCTCGCGCTC GCCCTATCGA TTTCGACGGC CGACACTGCA 
GCTGGTTTCA CGACGGACGA GCTTCGAAAC CGAACCATTG AGAGGCGGGC CGTCGAAGCC
GTCAACTGGG GCATTCCGGT GGTGAACTTC GACCGGATGC TGCAAGCGTT CAAGGAAAAA
GGCGGCGATT TCAACCAGAT CGTCTACTGG GGCGGACTGT TCGACTGGAG GAACCAGACA
CTCACACCCA ACCCGGACAC GATCTACTTC AAGCCTTTCT GGGACACGAA GATGGCCGGA
CCGATTGTGA TCGAAATCCC TCCTGCGGGA GAAGATGGAT CGATCACGGG AACGCTGATG
GACATGTGGC AGGCGGCACT GGAGGACGTC GGCCCGGCGG GCGTGGACCA GGGTAAAGGC
GGGAAGTATC TGATCCTGCC GCCCGACTAC AAGGAGAAGC CTCCCGAAGG CTTTATCGTC
CTGCCGTCTT CGACCTATGA GGGCTTCGGT CTGCTCCGGT CCGTGATCAA TGGCAGTGGG
CCAGATGCGG CCAGGCGCGC CGTCGATTAC GGCTTGAAGG TCAAACTCTA TCCGCTGGCA
CAGGAAGCCA GCCCGTCGAA GACGAAGTTC ATCGACGTTC TCGGACAGAT GTTCGACTCC
ACCATCCGCT ATGACCTCAG CTTTTTCCAG TCGCTCAACC GGGTCGTCCA GTACGAACCA
TGGTTGTCTC GGGACAAAGT CATGGTCGAC ATGCTGAAGA CCATCGGCAT CGAGAAGGGA
AAGCCGTTCA ATCCAGACGA GTCTGCCCGA AAGGTGCTTG AAAGCGCCGT TGACGAAGCG
CATGCCTGGT TCGACTTCCG TTATGAAACG ACATTCGCTC CGTACTTCAA GGACACGCAC
TGGGCCGTGC CCGCATCTCC GGAATTGATG GAGGTGTCCG ACAGCTTCTA CGAGTCGCCA
GACAGCTACG CCATCGACGC CAGGGGCGTC ACAGATTATT GGGCTTTCAG CACGGTTAAG
CACCTCGGTG CGGGGCAATT CTACCTGATG TCGACAAAGG ACAAGCATGG GGCGCCCCTC
GACGGAGGAA AGGCGTACAA GCTCACCATC CCCGCAAACG TGCCAGTCAC GCAGTACTGG
TCGGCCGTGG TCTATGACAG GGCTACTCAC GCGCTGATCC GCGATGTCGC GAGCCCCAGC
AAATCCTCGC AGACGCCGGG GCTTCAGGTG AACGAGGACG GGACTGTTGA TCTATACTTC
GGCCCCGACG CGCCGTCCGG CAAAGAATCG AACTGGACAC CCACGAAAGC CGGGGGCCGC
TTCGAAGTGC TTTCCGCCTC TACGGCCCGC AAAAGCCTCT CTTCGACAAG ACGTGGACCC
TTCCCGATAT CGTGGTAG
 
Protein sequence
MQHSPIFLAL ALSISTADTA AGFTTDELRN RTIERRAVEA VNWGIPVVNF DRMLQAFKEK 
GGDFNQIVYW GGLFDWRNQT LTPNPDTIYF KPFWDTKMAG PIVIEIPPAG EDGSITGTLM
DMWQAALEDV GPAGVDQGKG GKYLILPPDY KEKPPEGFIV LPSSTYEGFG LLRSVINGSG
PDAARRAVDY GLKVKLYPLA QEASPSKTKF IDVLGQMFDS TIRYDLSFFQ SLNRVVQYEP
WLSRDKVMVD MLKTIGIEKG KPFNPDESAR KVLESAVDEA HAWFDFRYET TFAPYFKDTH
WAVPASPELM EVSDSFYESP DSYAIDARGV TDYWAFSTVK HLGAGQFYLM STKDKHGAPL
DGGKAYKLTI PANVPVTQYW SAVVYDRATH ALIRDVASPS KSSQTPGLQV NEDGTVDLYF
GPDAPSGKES NWTPTKAGGR FEVLSASTAR KSLSSTRRGP FPISW