Gene Smed_4151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4151 
Symbol 
ID5319200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp622883 
End bp623959 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content62% 
IMG OID640775956 
Productphenylacetate-CoA oxygenase/reductase, PaaK subunit 
Protein accessionYP_001312889 
Protein GI150376293 
COG category[C] Energy production and conversion 
COG ID[COG0633] Ferredoxin
[COG1018] Flavodoxin reductases (ferredoxin-NADPH reductases) family 1 
TIGRFAM ID[TIGR02160] phenylacetate-CoA oxygenase/reductase, PaaK subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.522682 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.83112 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGTT TTCACCCCCT ACAAGTCACC GAAGTCCGGC GCGAGACGCG CGATGCGGTC 
GTCGTCACGC TCGAGCCGCG CGATGAGGAC CGCGCCGCTT TCGATTTCAC GCAGGGGCAA
TACCTGACCT TCCGCCGCAT ATTCGACGGC GAGGAACTGC GCCGTTCCTA TTCGATCTGC
TCCGGCCTCG GCGAGGGCGC CTTGAGGGTA GGCATCAAAC GCGTCGACGG AGGTTGCTTT
TCCAACTGGG CGAATGAGGT GCTCAAGCCC GGCGACACGC TTGAAGCGAT GCCGCCGATG
GGGACTTTCT TCGTGCCTGT CGAACCGGAG GTGTCCAGAC ACTATCTCGG TTTCGCCGGC
GGCAGCGGCA TCACGCCGGT GCTTTCGCTC GTCAAAACGG TGCTCGCGCG CGAACCGCGG
TCCGCATTCA CGCTGGTCTA TGCCAATCGC CACTTCAGCT CGATCATGTT TCGCGAGGAA
CTGGACGACC TCAAGAACCT CTATCTCGGC CGCCTCTCGG TGCTGCATAT TCTCGAGAGC
GAAGCCCAGG ACATCGATCT TTTCAGCGGG CGGCTCGATT TGGAAAAATG CACTGCCCTG
TTCCGGCACT GGATCGACGT GAAGTCAGCC GATATCGCCT TCATCTGCGG CCCCGAACCG
ATGATGCAGG CGGTCGCCGC AACCCTTCGC GCGCACGGTG TGAGCGACAG CCGGATCAGG
TTCGAACTGT TCGGTTCGTC CCAGCCTGGC CGCGCCCGCC GAAGGACGGC AAGCCCCGCC
GGCACCGATG GAGGGTCGCG CTGCGAAGCG ACCGTGACTC TCGACGGAGC CACGCGCAGC
TTCACCCTTC CGAAACGGGG GCAGAGCCTC CTCGAAGCGG CGCTCGAAAA CAGGATGGAT
GCACCTTATG CCTGCAAGGC TGGGGTCTGC TCGTCATGCC GCGCAAAGGT GCTCGAAGGC
GAGGTGGAAA TGGAGAGCAA CAACGCGCTC GAGGATTACG AGGTAGAGCA GGGCTATGTG
CTGATGTGCC AGTCCTATCC GCTGAGCGAT CGCGTCGTCG TCAGCTACGA CGAGTGA
 
Protein sequence
MARFHPLQVT EVRRETRDAV VVTLEPRDED RAAFDFTQGQ YLTFRRIFDG EELRRSYSIC 
SGLGEGALRV GIKRVDGGCF SNWANEVLKP GDTLEAMPPM GTFFVPVEPE VSRHYLGFAG
GSGITPVLSL VKTVLAREPR SAFTLVYANR HFSSIMFREE LDDLKNLYLG RLSVLHILES
EAQDIDLFSG RLDLEKCTAL FRHWIDVKSA DIAFICGPEP MMQAVAATLR AHGVSDSRIR
FELFGSSQPG RARRRTASPA GTDGGSRCEA TVTLDGATRS FTLPKRGQSL LEAALENRMD
APYACKAGVC SSCRAKVLEG EVEMESNNAL EDYEVEQGYV LMCQSYPLSD RVVVSYDE