Gene Smed_2587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2587 
Symbol 
ID5323455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2688482 
End bp2689807 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content62% 
IMG OID640791530 
Productnucleoside recognition domain-containing protein 
Protein accessionYP_001328252 
Protein GI150397785 
COG category[S] Function unknown 
COG ID[COG3314] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.577285 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.214795 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCACA TCACTACCGA GGCAAAGGGC TCGCGCGGCG GCGCCGCATT GCGGCTCGTG 
CTCTGCAGCT TCATCGGCAT ATTCCTGTTC TTCGTGCCGG TCGATATCAA CGGCAAATCG
ACGATCCTGC TCGACCATGC GGCCACCGCC ATATCGACGC AGGCGCGCCC GGTTGCGATC
GGCTTCGTAT TGTTGCTGAT GGCCTATGGC GCCTTCGCAC CCTTCCTGCG CGGTACCTGG
CGCAAGACGG TGACCGATGC CGTGTTTTCG GTCCTGCGGG TCCTCGGACT GGCGCTCGCC
GTCCTTTATC TCGCAGGCAT CGGGCCCGAG GTCCTCTTCG CACCCGACAT GCTTCCCTTT
CTTTTCGACA AGCTGGTGCT CTCGGTCGGG CTGATCGTGC CGATAGGCGC TCTGGCACTC
GCGTTTCTGA TCGGCTACGG CCTTCTCGAA TTCACCGGCG TGCTCGTCCA GCCGGTGATG
CGGCCGATCT GGCGTACGCC GGGTTGGTCG GCTATCGACG CCGTCGCTTC CTTCGTCGGC
AGCTATTCCC TTGCGCTCCT GATCACCGAC CGGGTCTTCC GTGAAGGCAA GTATACGGTA
CGCGAAGCCG CGATCATCGC CACGGGCTTT TCCACCGTGT CGGCTACGTT CATGATCATC
GTGGCCAAGA CGCTGGGGCT CATGGACGTC TGGAATTTCT ACTTCTGGAC CACGCTTGTC
GTCACGTTCA TCGTTTCTGC GATCACGGCC CGGATCTGGC CGCTTGCCGG GCTTGCGCAC
GAGGGCGATC GGGACCAGCC GCTGCCCGCC GGCCGCAGCC GCCTGAGGTT CGCCGTCGAG
ACCGGCCTCG AACAGGCGGC TACTGCCAAG AGCCTGCCGA CACTCTTGCG AGAAAGCTTC
CTTGATGGCC TGCGCATGGC CGCGATGATC CTGCCGAGCA TCATGGCTGT CGGACTCCTC
GGGCTTCTTG CCGCAAAATT CACGCCGATC TTCGACATTC TCGGCCTGAC GCTCTATCCC
TTCACCTGGA TCGTGCAGTT CGGCGAGCCG ATGCTCGCGG CAAAGGCACT CGCCTCCGGC
CTTGCCGAGA TGTTCCTGCC TGCCATACTG CTCAAGGAGG CAGCACCGGA CATGAAGTTC
GTGGCGGCCG TGGTGTCTGT CAGCCAGGTC CTGTTTCTGT CGGCCTCCGT TCCCTGCATG
CTGGCGACCT CTATCCCGCT CAGCTTCCGC AACCTTCTGG TGATCTGGTA TATTCGCGTC
GTATTGAGCA TACTGGTGAC GGCACCGATC GTTTGGATCG GAACATCCAT GGGATGGCTC
GGCTGA
 
Protein sequence
MTHITTEAKG SRGGAALRLV LCSFIGIFLF FVPVDINGKS TILLDHAATA ISTQARPVAI 
GFVLLLMAYG AFAPFLRGTW RKTVTDAVFS VLRVLGLALA VLYLAGIGPE VLFAPDMLPF
LFDKLVLSVG LIVPIGALAL AFLIGYGLLE FTGVLVQPVM RPIWRTPGWS AIDAVASFVG
SYSLALLITD RVFREGKYTV REAAIIATGF STVSATFMII VAKTLGLMDV WNFYFWTTLV
VTFIVSAITA RIWPLAGLAH EGDRDQPLPA GRSRLRFAVE TGLEQAATAK SLPTLLRESF
LDGLRMAAMI LPSIMAVGLL GLLAAKFTPI FDILGLTLYP FTWIVQFGEP MLAAKALASG
LAEMFLPAIL LKEAAPDMKF VAAVVSVSQV LFLSASVPCM LATSIPLSFR NLLVIWYIRV
VLSILVTAPI VWIGTSMGWL G