Gene Smed_4453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4453 
Symbol 
ID5318605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp937679 
End bp938869 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content62% 
IMG OID640776255 
Productcarboxymuconolactone decarboxylase 
Protein accessionYP_001313188 
Protein GI150376592 
COG category[S] Function unknown 
COG ID[COG0599] Uncharacterized homolog of gamma-carboxymuconolactone decarboxylase subunit
[COG1917] Uncharacterized conserved protein, contains double-stranded beta-helix domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0765828 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTACA TTGCCGCAAG CATTGCTATC TCGGCTCTCG CCGCCACCGC TGTGGAGGCG 
CAGGAAGAGC GGCGAAAGAT CGCTCCACCG GCCGTCTATG ACGTCGCGCC GGGTCTCGGC
CACTTCACCG ACGATGTTCT GTTCGGCGAA GTCTGGGAAC GAACAGAGCT CCCGTCTCGC
GACCGGAGCC TCGTAACCCT TTCGGCGATC GTCTCGACGG GCAAGACGGC GCAGATTGGC
GCCCATGTGA GCCGGGCCCT GGACAACGGT GTGAAGCCTG GAGAGATCGG CGAACTCATC
ACTCATCTTG CATTCTACTC CGGTTGGCCA AACGCGATCT CCGCCGTGAC GGAGACGAAG
AAGGTTTTCG ATGAGCGCCA GATTGCACCC GTCAAGAACA GCGAGGCGGC GCGCATAGAA
TTGGAAGCCG CAGCCGAGGC GGCTCGGAGC GAGACGGTCA GCACCACGGT TGCACCAACG
GCGGCAGCAC TGGCCGACCT TACCAACCGC GTGCTCTTCG GCGATCTGTG GCAGCGCCCG
GATCTGTCGG CGCGCGACCG TAGCTTGGTG ACGATCGCCG CTCTGATCGC GGTTGGTCAG
CCGGAACAAC TGCCGTTTCA TGCCAACCGC GCGATGGACA GCGGCTTGAC GCCGTCAGAA
GCTTCAGAAG TACTGGCGCA TGTCGCTTTC TACGCCGGTT GGCCGAGAGC CATGTCCGCC
GTGCCCGTTC TCAAGCAGGT TCTCAATAAC AGGCAAGGAA CTCAGGTGAG CGCTTCCCAG
GCAGATCTGA AGATTACTCC AGCCGGAATT GGTTCTGCGT CAGCTCCGGA GGAGTACTTC
ACAGGTACCG TCCAGATCTC GGGCCGTTAT CAAGCCGACG CTCCCGCGCG CATTGGCGGG
GCAACCGTCT CCTTCTCCGC CGGCGCTCGC ACGGCCTGGC ACACACATCC TCTCGGCCAG
ACCTTGTTCA TCGTGAGCGG GCGCGGCCTG GTTCAGAAGG AAGGTGAGGC AGTTCAGGAA
GTAGGTTCGG GAGACGTGGT ATGGATCCCA CCGCTGATCC GGCACTGGCA TGGCGCCTCC
AGCACCGGGC CGATGACGCA TTTCGCAGTG GCCGAGGCGC TCGATGGAAG CTCAGTCACG
TGGATGGAAA AGGTGTCCGA CGAGGACTAC GGCAAGGGTG TTCGAGAGTA G
 
Protein sequence
MKYIAASIAI SALAATAVEA QEERRKIAPP AVYDVAPGLG HFTDDVLFGE VWERTELPSR 
DRSLVTLSAI VSTGKTAQIG AHVSRALDNG VKPGEIGELI THLAFYSGWP NAISAVTETK
KVFDERQIAP VKNSEAARIE LEAAAEAARS ETVSTTVAPT AAALADLTNR VLFGDLWQRP
DLSARDRSLV TIAALIAVGQ PEQLPFHANR AMDSGLTPSE ASEVLAHVAF YAGWPRAMSA
VPVLKQVLNN RQGTQVSASQ ADLKITPAGI GSASAPEEYF TGTVQISGRY QADAPARIGG
ATVSFSAGAR TAWHTHPLGQ TLFIVSGRGL VQKEGEAVQE VGSGDVVWIP PLIRHWHGAS
STGPMTHFAV AEALDGSSVT WMEKVSDEDY GKGVRE