Gene Smed_0110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0110 
Symbol 
ID5320938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp121776 
End bp123476 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content60% 
IMG OID640789042 
ProductLong-chain-fatty-acid--CoA ligase 
Protein accessionYP_001325805 
Protein GI150395338 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.963067 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGAAG CAAGTACGCA GCAGGCAGGT TCCAGCACGG CGAAGATCTG GCTCGCATCA 
TACCCGCCGG GCGTGCCTGC CGAGATCGGC CCTCTCACCT ATCGTTCCAT CGGAGAATTT
TTCGATCATG CGGTCGCGCA ACATTCCTGG CGGCCCGCAT TCACCTGCAT GGGAAAGGCG
CTGACCTTCT CGGACCTCAA TGCGTATTCG GCAAAGATCG GCGCCTGGCT GCAGTCGCTG
GGTCTTGCGA AGGGCGACAG GGTCGCGGTG ATGATGCCCA ATGTCCTGCA AAATCCCGTC
ATCGTCTATG GAATCCTGCG AGCCGGCTTC ACCGTCGTCA ACGTCAATCC GCTCTATACG
CCGCGCGAAC TGGAGCATCA GCTGGTCGAC TCCGGCGCCA AGGCGATCTT CGTGCTTGAG
AACTTCGCGC ATACTGTCGA GCAGGTTCTT GCCCGCACGG CGGTCAAGCA TGTGGTGGTC
GCCAGCATGG GCGACATGCT CGGCGCAAAA GGGTTGATCG TCAATCTGGT GGTGCGCCGC
GTCAAGAAGC TCGTCCCCGC CTGGTCGATT CCCGGACATC TTTCGTTCGG GGCGGTTCTT
GCCAGGGGAG CGAAACTCGG CTTCAAGCGG GCGAACGTGG CACCGTCGGA TATCGCCTTC
CTGCAATATA CCGGTGGCAC CACCGGCGTT TCCAAGGGCG CAACGCTGAC GCACGCCAAT
CTTCTTTCCA ACATGGCCCA GATGGAACTG TGGCTGAACA CGGCCTTCCT GCGCAAGCCG
CGCCCGGAAA GCCTCACCTT CATGTGCGCG CTGCCGCTCT ATCACATCTT CGCGCTTACG
GTGAATTCGC TGATGGGTCT TGCGACCGGC GGCAACAATA TCTTGATACC GAATCCGCGC
GACATTCCCG CCTTCGTCAA GGAACTCGGA AAGTACCGGA CGAACATTTT CCCGGGCCTG
AACACACTGT TCAATGCGTT GATGAACAAT GCCGAGTTCC GCAAGGTGGA CTTCTCGTCG
CTGATCCTGA CCTTCGGCGG CGGGATGGCA GTGCAGCGAC CGGTCGCCGA ACGCTGGCTG
GAAATGACGG GCTGCCCTAT CCACGAGGGC TACGGGCTTT CGGAGACATC GCCCGTCGCG
ACCGCCAATC GCCTCGATAC CGACGACTTC ACCGGCACGA TCGGCATACC GCTGCCCTCG
ACGGAGGTGG AGATCCGCGA CGAGGACGGA AATACCCTCC CCTTGGGTGA AGTCGGCGAA
ATATGCATCC GCGGGCCGCA GGTGATGGCA GGTTACTGGC AGCGCCCCGA GGAAACGGCG
AAAGCCATTT CGCCGGACGG CTTCTTCCGG ACCGGCGATG TCGGCTTCAT GAACACCGAG
GGGCTCACCA AGATCGTCGA TCGCAAGAAG GATATGATTC TCGTCTCGGG GTTCAACGTG
TTCCCGAACG AGATCGAAGA GGTGGCGGCG ACCCATCCCG GCATTCTCGA ATGTGCCGCA
ATCGGTATTG CCGATCCGCA TTCCGGCGAA GCGGTCAAGC TCTTCGTCGT GCTTAAGGAT
CCGAATCTCA CTGAAGAAGA GATCAAGCGC CACTGCGCAG CCAGTCTTAC CAACTACAAG
CGGCCGCGTT TCGTGGAAGT CCGCACCGAA CTGCCGAAAT CGAATGTCGG CAAGATCCTG
CGCAAGGACT TGCGCGGCTA G
 
Protein sequence
MAEASTQQAG SSTAKIWLAS YPPGVPAEIG PLTYRSIGEF FDHAVAQHSW RPAFTCMGKA 
LTFSDLNAYS AKIGAWLQSL GLAKGDRVAV MMPNVLQNPV IVYGILRAGF TVVNVNPLYT
PRELEHQLVD SGAKAIFVLE NFAHTVEQVL ARTAVKHVVV ASMGDMLGAK GLIVNLVVRR
VKKLVPAWSI PGHLSFGAVL ARGAKLGFKR ANVAPSDIAF LQYTGGTTGV SKGATLTHAN
LLSNMAQMEL WLNTAFLRKP RPESLTFMCA LPLYHIFALT VNSLMGLATG GNNILIPNPR
DIPAFVKELG KYRTNIFPGL NTLFNALMNN AEFRKVDFSS LILTFGGGMA VQRPVAERWL
EMTGCPIHEG YGLSETSPVA TANRLDTDDF TGTIGIPLPS TEVEIRDEDG NTLPLGEVGE
ICIRGPQVMA GYWQRPEETA KAISPDGFFR TGDVGFMNTE GLTKIVDRKK DMILVSGFNV
FPNEIEEVAA THPGILECAA IGIADPHSGE AVKLFVVLKD PNLTEEEIKR HCAASLTNYK
RPRFVEVRTE LPKSNVGKIL RKDLRG