Gene Smed_6089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_6089 
Symbol 
ID5320391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp1023923 
End bp1026844 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content58% 
IMG OID640777732 
Productlanthionine synthetase C family protein 
Protein accessionYP_001314664 
Protein GI150378069 
COG category[V] Defense mechanisms 
COG ID[COG4403] Lantibiotic modifying enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACG CAAACATCCG AAAAGAGGCA CGAGACAATA CGGCGATAGT AGCCCTTTGG 
CCATTGGTGG ACGAGCTTGC CGATCCGGTG GGCAATACAG GCGCGCCTCG TGAGAAAGCC
GGTTCCGAAG ATGCGTTTCT CTCGGGTTTA TGCTTCGCGG CGAGGCAAGA CATTTCGCGA
ACGCCTTGCC TTGCGAAAGA GCTGCCTTTT GTTCACCTGT GGTTTCCTCT GGCCGAAGTC
GCAATGGGGC TTCTGCCGCA AGCCGCGTTG GCGCCAACCG TCATCGGGCA ACTGATTGTA
GCGCTGACAG AGCGCCTTTG CGCGTTGGGG GCAGATGCAC TTCAGCAAAT ACTTCAAGAG
CGGCGTGGCG CTGGCGCGTC CGTCATCGCC GCATTAGCCC CGGAGAAAGA CCCGAGCGCC
GCGAAATGCG AGATCTATGA TCGCTTGACA GACGAACTGC GGCAGACACA TCTGGCGGAA
CTGCTGGATC GCTTTCCGGT CCTTCGACGC TTGATTCCTT TCACAATTGC AGTCTGGATC
AGAAATCTTC GCGAGCTTTT AGCCCGCCTC GAGGCGGATC GGACAGCAAT CGCTCAAAAC
TTCCGCCTTC CTGAAGACGC GGCGCTTTCC GGCATGCAAT TTGCTGTCGG CGATACCCAT
AAGGGTGGAC GGAGCGTCGT GCTGCTGGAA TTCACTTGGC AGGGGCGGCA GACAAAACTC
GTCTACAAGC CCCGTAACCT CGCCCTCGAG GCGGCGTTCC AGAATTTGCT GGCGGATCCG
CGCGCTAGCA TCGGTCTTTC ACCTCTTCCC GGGCTGAAAA TCTGGTGCGC AGAAGACTAT
GGCTATATGG AATTTGTTGA GGGCGCGCCC TGCGACAACG AAGATGTACT GCAGGCATTC
TATCGCAGTG CCGGCAGACT GGCCGCTCTC CTGCATGTTT TAGGTTACAC GGATGGTCAC
CACGAAAACC TCGTGGCGCA TCACAGCCAT CTATATGTTA TAGACGCCGA AACGCTTTTG
ACGCCCTTCG AGAAGCCAGC AATTCGCCGT CCACGGCCGG GGGACAGCCT GGGTGCGGCC
CAGACGGCAT GGTCTTGGTT CGACACCACC GTTGCACGAA CCGAGCTATT TCCAGCCTGG
ACCTTACTGG GCCTCGCACG CGATGCCGTC GATACGAGCG CTTTCGGCAT CGATCCTCAG
GAATGGCCAA GGCAAATTCC CGGCTGGCAC TCCATCAATA CGGATAACAT GGCCCGCACC
GTCATGATCG GGGAGCTCGT TCGCACGCAA TCATTGCCCG TAACGTCAGA CAGGAAGAAT
CCATTTTTCC GTCATCGTAA GCTCTTCATG GCCGGATTCC ATGCGCAGTG CAGGTCCATC
ATCGCCAACA AACCCGTTTG GCTGGGGGAA GATGGCCTGT TGCAGGCCTT CAACGGCTGC
AAGGGGCGCG CTGTCATTCG CGCGACACGG ATCTATGCGG CGCTTGCACG CCGTCAGTTG
ACGGCGGAAG CATTGCGAAG CGAGCAGTCC CAAGCAGACG TTCTTTATCC GCTGGAGCGC
CTTTTTGCGC CGCTTGCAGA TCGTCAGGAC GTTTCAAGCC TATGCGCCTC TGAAAAGCTG
CAAATGCGCC AACTGGATAT TCCCGTATTC ACGCACACGG TCTCGCAAAC GGCCCTGAAC
CTGACCGGCG ATCAACGAAT CGATGACTAT TTCGAGACGA GTGGCTTGGC GGCAAGCCGG
GAGCGAATAG AGGCATTGTC CTGCGAAAGG ATCGAGCGTG AGATCGAAAC AATAGACGCT
TTGGCATCGG CCAAGGCATT GCGGCTCGCG CAGCACGGGC GGAGTTGCGA GAACAGCCTC
CAACCGATCA CCAATCCGAC AGACCAGCTT TTATCGGAAA TCAGGCGTCG ATCAGATGGC
TGGATCGGAT TCGATCTCGC GCCGGATGGT ATCCGGTTCT GCTACCAGCC GCTTGGAATG
TCGCTCGTGA GCGGCACACT TGGTCTTGCC GTCTATCTTG CAGCGACCGG AAGACCAGAG
GCCTGCCATA TTGCAGAAAA TTTGGCTGAG CCGCTTCTGA GGCTGGCGGA GAGCGGAACA
TCGGCTGAGC GCCTGCGGTG GTGGCGAGAC CTGCCCATCG GGTTGCGTGG CAGCGGCGGA
CAGCTTCTGG CATTGTCGGC ACTGCAGAGG CTTAGCATAA TGGCCGATCG AATCCCGCAC
GCCATGCAGC AGTTGGTTGC AGTTCTCGAT CGGGCCTTGC TCGAAGATAA GCCTTGGGAT
GTCTGGTCCG GATATGCGGG CTTGCTGGGT CCGCTCCTAC TCGTGAATTC CGACACAGCG
GACACTCAGG CCGAGATGGT TGCCTGCCGT CTGGCCGACT GGACCAGCAG AGAGGCGGAA
ACGGCACCGT CTGGCTTTGC GCAAGGTGCT TTTGGTGTAT TGGCTGCGCT CAGTCAGTTT
GCGCATGCAT CAGGGAACGC AAACATTCGC GCTTCGGTTC AATCCCTGCT TCAACACAGC
TCGGAAGCCT TCTGGCGCTT CGGCCCTCGC GCCGGTTCTG AATGGAGTTG CGGCATTGCA
GGTCACATCA TCACATGCCT GTGCCTGCGT CATGACCAAG AATGGCGAGC GAATGCGGAT
GCTGCAATCC GGCGCGGGGT GATCGGATTG GCGTCCCTGC CTAACCAAGG AGCCGGTATC
CACAGCGGCA CCGCCGGCGT CGAGATGGCC TGCGCGTCAT CAGCACAGGC AACTGGCAAT
TTGCCATGCT CTTCAGACAC CCAGAAGTAC TGCGATCCAC GCCATCTCGG TCTGTTCACT
GGCGTTGCTG GTATAGGGTT CGCACAGATC CGGGACGAGC GGTCCAACGC GGCTCGCGAC
CTTATCATCT CTGCTGGTCT TCTTCCGACC ATAGGCCATT GA
 
Protein sequence
MIDANIRKEA RDNTAIVALW PLVDELADPV GNTGAPREKA GSEDAFLSGL CFAARQDISR 
TPCLAKELPF VHLWFPLAEV AMGLLPQAAL APTVIGQLIV ALTERLCALG ADALQQILQE
RRGAGASVIA ALAPEKDPSA AKCEIYDRLT DELRQTHLAE LLDRFPVLRR LIPFTIAVWI
RNLRELLARL EADRTAIAQN FRLPEDAALS GMQFAVGDTH KGGRSVVLLE FTWQGRQTKL
VYKPRNLALE AAFQNLLADP RASIGLSPLP GLKIWCAEDY GYMEFVEGAP CDNEDVLQAF
YRSAGRLAAL LHVLGYTDGH HENLVAHHSH LYVIDAETLL TPFEKPAIRR PRPGDSLGAA
QTAWSWFDTT VARTELFPAW TLLGLARDAV DTSAFGIDPQ EWPRQIPGWH SINTDNMART
VMIGELVRTQ SLPVTSDRKN PFFRHRKLFM AGFHAQCRSI IANKPVWLGE DGLLQAFNGC
KGRAVIRATR IYAALARRQL TAEALRSEQS QADVLYPLER LFAPLADRQD VSSLCASEKL
QMRQLDIPVF THTVSQTALN LTGDQRIDDY FETSGLAASR ERIEALSCER IEREIETIDA
LASAKALRLA QHGRSCENSL QPITNPTDQL LSEIRRRSDG WIGFDLAPDG IRFCYQPLGM
SLVSGTLGLA VYLAATGRPE ACHIAENLAE PLLRLAESGT SAERLRWWRD LPIGLRGSGG
QLLALSALQR LSIMADRIPH AMQQLVAVLD RALLEDKPWD VWSGYAGLLG PLLLVNSDTA
DTQAEMVACR LADWTSREAE TAPSGFAQGA FGVLAALSQF AHASGNANIR ASVQSLLQHS
SEAFWRFGPR AGSEWSCGIA GHIITCLCLR HDQEWRANAD AAIRRGVIGL ASLPNQGAGI
HSGTAGVEMA CASSAQATGN LPCSSDTQKY CDPRHLGLFT GVAGIGFAQI RDERSNAARD
LIISAGLLPT IGH