Gene Smed_4336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4336 
Symbol 
ID5318094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp834375 
End bp835922 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content63% 
IMG OID640776141 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001313074 
Protein GI150376478 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0623029 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00463044 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGGTTCG AGCAATTCCT CATCAGGAAC GCCGCGGCGA ACGGGGCAAA AACGGCGCTG 
GTCACCGATC GCCGGCGGCT CAGCTATGCC GAACTGGACG ATCTTTCAAC GCGTCTCGCG
GCTGCTCTTG CCGCAAACGG CGTGAAGCGG AACGATCGCG TTCTGGCGTT CATGGATAAT
TGCTGGGAGG CGGCGGTCGC AATCTTCGCG ATCCTCAAGG CCGGAGCCAC CTTCAGTCCG
ATCAACGCTT CGACCAAAGC AGACAAGCTT GCCTACGTAA TCGCGGATTG CGAGGCGGCG
GCAATCCTGA CGCAGGCGAA ACTGATGCCG GTCGTTACCG AGGCGCTTGC GCTTGCTCCC
GGTTATCGGC CTTTCATTGC CTCGGCCGCG GCGCCAGGCG GGCGCATGCC CGACGGTGCC
GCTTCCTTCG AGGAATGCCT GACAGCCGTA CCCGCCGCTG TTTCGCACGG GGGTATCGAC
ATCGATCTCG GCATGCTGAT TTATACCTCG GGGTCGACGG GACGTCCCAA GGGCGTGATG
ATGACGCATC GCAACATCGA CGCTGCCTCA GAATCGATCA CTACCTATCT CCGCAACACG
CCTGAAGACA TCATTCTGAA CGTACTGCCG CTCGCTTTCG ACTATGGTCT TTACCAGTTG
CTGATGGCGG TCCGGCTCGG CGCGACGCTC GTGCTCGAAA AATCATTCGC CTTCCCGCAG
GCGATTTTCG ACCGGATTCG GGCCGAGGGT GTCACCGGCT TCCCACTCGT GCCGACCATG
GCGGCGATGA TCCTTCAGAT GCGCGATCTC GAGCCCGGCT TCCTGCCAAG CCTTCGCTAT
CTCTCCAACA CCGCGGCAGC TCTCCCGCCG GCCCATATTG CGCGCCTGAG GGAGCTTTTT
CCCGGCGCCC GGCTCTATTC CATGTACGGC CTGACGGAGT GCAAGCGCTG CACCTATCTG
CCGCCGGAGG AGCTGGATCG CCGGCCGGGT TCCGTGGGGA TCGCGATACC GAACACGGAA
GCCTTCGTGG TCGATGACGA GGGAAACCGG CTACCGCCCG GTGTGCCTGG TGAACTGGTT
ATCCGCGGCC CGCATGTGAT GCAGGGCTAT TGGCGCAACG CTGCCGCGAC CGAGCGCATG
CTGCGCTCCG GTCCTGATCC GTGGGAAAGG GTGCTTTATA CCGGCGATCT CTTCCGCACC
GACGAGGAGG GCTTCCTCTA CTTCGTCGGC CGCAAGGACG ACATCATCAA GACCCGCGGC
GAAAAGGTGG CTCCCAAGGA GGTCGAGACC GTGCTGCACG CTCATCCGGG CGTAGCCGAA
GCCGTGGTCA TCGGCGTGCC GGATCCGGTG CTCGGTGCTG CGATCGGCGC GCTCGTCGTG
CTGTCGGACC CGTCTGTGAC CGAGAGGGAG ATTATCCGCC ACTGCGCCCG CCATCTCGAG
GATTTCATGG TGCCGAAAAT CGTCGAGTTC CGGGCTGAAC TGCCGAAGAC CGATACCGGA
AAAGTCAGCC GCCGCCTCGC GGCCGAAACA TTGGAGCCAG CAGAATGA
 
Protein sequence
MRFEQFLIRN AAANGAKTAL VTDRRRLSYA ELDDLSTRLA AALAANGVKR NDRVLAFMDN 
CWEAAVAIFA ILKAGATFSP INASTKADKL AYVIADCEAA AILTQAKLMP VVTEALALAP
GYRPFIASAA APGGRMPDGA ASFEECLTAV PAAVSHGGID IDLGMLIYTS GSTGRPKGVM
MTHRNIDAAS ESITTYLRNT PEDIILNVLP LAFDYGLYQL LMAVRLGATL VLEKSFAFPQ
AIFDRIRAEG VTGFPLVPTM AAMILQMRDL EPGFLPSLRY LSNTAAALPP AHIARLRELF
PGARLYSMYG LTECKRCTYL PPEELDRRPG SVGIAIPNTE AFVVDDEGNR LPPGVPGELV
IRGPHVMQGY WRNAAATERM LRSGPDPWER VLYTGDLFRT DEEGFLYFVG RKDDIIKTRG
EKVAPKEVET VLHAHPGVAE AVVIGVPDPV LGAAIGALVV LSDPSVTERE IIRHCARHLE
DFMVPKIVEF RAELPKTDTG KVSRRLAAET LEPAE