Gene Smed_3724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3724 
Symbol 
ID5318590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp165777 
End bp167033 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content63% 
IMG OID640775537 
Productribulose-bisphosphate carboxylase 
Protein accessionYP_001312470 
Protein GI150375874 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1850] Ribulose 1,5-bisphosphate carboxylase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.630821 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCACA TAACCTACCG GATAGAGACA CCAGGAGACG TCGAGGCGCT GGCGAGAAAA 
ATCGCAAGCG ACCAATCAAC GGGGACGTTC GTTGCTGTGC CGGGGGAAAC GGAGGAGCTG
AAGCGGCGGG CAGCTGCACG CGTCGTGGCC ATTCGCCACC TCCCACCGGC CGACCGGGCG
TCGTTGCCGG ACGAGGCTGG AGACGCAACG CGTTTCAACC GGGCCGATGC GGAGATCGCA
TATCCTCTTG AAGCGGTGGG AACGGACCTG TCCGCCCTGA TGACCATTGC AATCGGCGGC
GTCTATGCCA TCAAAGGCAT GACCGGAATC CGTGTTGTCG ACATGAAACT CCCGCCGGAG
TTCGCGGCGG CGCATCCGGG TCCGCAATTC GGGGTCGCCG GCAGTCGCCG CCTGACCGGT
GTCGAAGGCC GCCCCATTAT CGGCACGATC GTCAAGCCGG CGCTCGGCCT GCTGCCGGAC
GAAACGGCAG CGCTCGTAGG CGACCTGCTT TCTTCCGGCG TCGACTTCAT CAAGGACGAC
GAGAAGCTGA TGAGCCCGGC CTATTCGCCG CTGAGTGCGC GCATCGCCGC CATCATGCCG
AAGATACGCG ACCATGAGCA GAAGACCGGC AAGAAAGTCA TGTATGCCTT CGGCATATCC
CATACCGATC CCGATGAGAT GATGCGGAAC CACGATCTCG TGGTCGCCGC TGGTGGCAAT
GCGGCGGTCG TCAATATCAA TTCGATCGGC ATGGGCGGTG TCGCCTTCCT GCGCAAGCGC
TCGAACCTCG TGCTCCACGC CCACCGTAAC GGCTGGGACA TTCTCACGCG CCATGGCGGC
CTGGGGATGG AGTTTTCGGT GTGGCAGCAG TTCTGGCGCC TCCTCGGAGT GGATCAGTTC
CAGATCAACG GCATCCGCGT CAAATACTGG GAGCCGGACG ACAGTTTCGT GAAGTCCTTC
AAGGCGGTAA GCACGCCGCT GTTTTCCAGG GAGGATTGCC CGCTGCCGGT CGTGTGCTCC
GGCCAATGGG GAGGACAGGC GCCCGAGACC TTTGTACGCA CCGGACGTAC GACCGATCTG
CTCTATCTCT GCGGTGGCGG AGTCGTAAGC CACCCGGGCG GAGCCGGTGC CGGCGTGCGA
GCCGTGCGTC AGGCATGGGA GGCGGCAGTC GCCGGGATAC CGCTCTCGGA TTATGCTAAG
GAGCACCCCG AACTGGCGCA ATCGATCGAG AAATTCGCCG ACGGAAAGGG CGCTTGA
 
Protein sequence
MIHITYRIET PGDVEALARK IASDQSTGTF VAVPGETEEL KRRAAARVVA IRHLPPADRA 
SLPDEAGDAT RFNRADAEIA YPLEAVGTDL SALMTIAIGG VYAIKGMTGI RVVDMKLPPE
FAAAHPGPQF GVAGSRRLTG VEGRPIIGTI VKPALGLLPD ETAALVGDLL SSGVDFIKDD
EKLMSPAYSP LSARIAAIMP KIRDHEQKTG KKVMYAFGIS HTDPDEMMRN HDLVVAAGGN
AAVVNINSIG MGGVAFLRKR SNLVLHAHRN GWDILTRHGG LGMEFSVWQQ FWRLLGVDQF
QINGIRVKYW EPDDSFVKSF KAVSTPLFSR EDCPLPVVCS GQWGGQAPET FVRTGRTTDL
LYLCGGGVVS HPGGAGAGVR AVRQAWEAAV AGIPLSDYAK EHPELAQSIE KFADGKGA