Gene Smed_4519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4519 
Symbol 
ID5318494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1004915 
End bp1006273 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content65% 
IMG OID640776320 
Producthypothetical protein 
Protein accessionYP_001313252 
Protein GI150376656 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1030] Membrane-bound serine protease (ClpP class) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.829865 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.308497 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCGCA TCCTCTTCCT GTTGTTTAGC CTCCTGTCGG CCTTCATGCT TCCTGTCTCG 
CCGGCTCCGG CAGCCGAGCG GAAAGCAATC GTCCTGCACG TAAACGGCGC CATCAGTCCG
GCCACGGCGG AATATGTAAC GCGCGGCCTG CGGCGGGCCA GGGACCGCGG TGTCGCGCTG
GTGGTCCTGC AGATGGATAC GCCCGGCGGA CTGGATACGT CGATGCGCGA AATCATACGC
GCCATCCTCG ATTCACCCGT GCCGGTCGCG AGTTTCGTCG CGCCCAGCGG AGCGCGGGCG
GCAAGCGCCG GCACCTATAT TCTTTATGCG AGCCACATCG CGGCCATGGC GCCCGGAACC
AATCTGGGCG CCGCCACGCC GATCGCCATC GGCGGTGGAC TTTTTGGCGA TGACGAGCGG
GACGGCGAGG AAACGCCGGG CGAACCGGAC AAGCAGGAAC CCCGCAAGCC GGCCGATGCC
GGCGAGGCGA AACTTATCAA CGATGCAATC GCTTATATCC GCGGCCTTGC GGAGTTGCGC
GGTCGCAATA TCGACTGGGC GGAGCGCGCC GTGCGCGAGG CTGCCAGCCT TTCCTCGGCC
GCAGCCGCGC GCGAACAGGT CATCGACTTC ACCGCGATCA ACCTCAATGA CCTCCTCAAG
CAGGCACACG GTCGCTCCGT TCGCATCGGT CAATCCGACG TCCGGCTCGA TACTGCAGGG
CTCTTCATCG AGGACTTGCC GCCCGATTGG CGCACGCAGC TCTTATCGGT GATCACCAAT
CCCAATGTCG CTCTCCTTCT GATGATGGTC GGCATCTATG GGCTCATCTT CGAGTTTCTC
TCACCCGGCA CCGTTGTGCC CGGGACCATT GGCGGCATAA GTCTCCTGCT CGGTCTCTAC
GCCCTGGCGG TGCTGCCTGT GAGCTATGCC GGCGTTGCCC TCATCCTGCT CGGAGCCGGG
CTGCTGGTCG CGGAAGCGCA TGCGCCGTCT TTCGGCGTTC TCGGCCTTGG CAGCGCCGTC
GCGCTGGTGC TCGGTGCCGC AATTCTTTTC GACACGGACG TACCGGGACT GCAGGTGTCC
TGGCCGGTTC TGAGCGGCAT CGGGTTCGCA AGCCTGGCTT TCGGCCTGCT GGTCGCCCGT
CTCGCTCTTC TCTCGAGCCG ACACAAGATC CTCACCGGAG CGGAGGAGAT GATCGGCATC
TCCGGAAAGG TCGACAGCTG GGAGGGAGCG GGCGGCTACG TGATTGCCCA CGGCGAGCGG
TGGAGCGCAG TCAGCAATGA ACCGCTCGGT CCGGGAGAGG ACGTCATGGT CGTCGGCCGT
CAGAGTTTGA CGCTGGAGGT GGCGCGCAAG CCAACTTGA
 
Protein sequence
MARILFLLFS LLSAFMLPVS PAPAAERKAI VLHVNGAISP ATAEYVTRGL RRARDRGVAL 
VVLQMDTPGG LDTSMREIIR AILDSPVPVA SFVAPSGARA ASAGTYILYA SHIAAMAPGT
NLGAATPIAI GGGLFGDDER DGEETPGEPD KQEPRKPADA GEAKLINDAI AYIRGLAELR
GRNIDWAERA VREAASLSSA AAAREQVIDF TAINLNDLLK QAHGRSVRIG QSDVRLDTAG
LFIEDLPPDW RTQLLSVITN PNVALLLMMV GIYGLIFEFL SPGTVVPGTI GGISLLLGLY
ALAVLPVSYA GVALILLGAG LLVAEAHAPS FGVLGLGSAV ALVLGAAILF DTDVPGLQVS
WPVLSGIGFA SLAFGLLVAR LALLSSRHKI LTGAEEMIGI SGKVDSWEGA GGYVIAHGER
WSAVSNEPLG PGEDVMVVGR QSLTLEVARK PT