Gene Slin_0903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0903 
Symbol 
ID8724633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1076346 
End bp1079456 
Gene Length3111 bp 
Protein Length1036 aa 
Translation table11 
GC content51% 
IMG OID 
ProductLantibiotic dehydratase domain protein 
Protein accessionYP_003385756 
Protein GI284035826 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.547709 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATCG ATTTGGCCAA TTTCTATCTA TTGCGGGTCC CTGCTCTTGC TTTTCTCAGT 
GTGTCGACGC ACAGCGAAGA GGCCGACGCT CAATTGAAGG CGTTATGGGA GGAAGGAACA
CTTAGCGAAG CACTCTTCGT AGCCTCACCT TCACTTTACC AGCAAACCAT AGACCACCTG
CGAACAACTG GCTGGCCAGT GCCACCTGCC TTCCGCCGAA CGCTCTGGAA ATATGCGCTG
CGTATGAGTA CCCGAGCCAC ACCATTCGGG ATAATGGCGG GCTGCTCAGT AGGGGCAATA
GCCAAGGCAA CGGAGGTGAC CTTTGGTAAA GAGCTGCGTT TCGTTCCTCA CCATCGACTG
GATATGAGCT GCATTGCCCA GCTGGTAAGG GAACTTACAA ACGACCCTCT TATCCGGCCG
CAGCTACGGT TTTCCCCTAA TAACAGTCGC TACATCGTTG GTGATGAATT GCGCTACGCC
GAACACGACG ATAACGGACA GGAGCGGCAC TATTTTACGG CTTCTGTACT AATTTCTACT
CCATTGGAAG CCGTGCTACA ACGGGCACAA AGTGGGGCGA CATCTGCTCA ATTAGCCGAG
ACATTAGTAG CTATCGGAAT TACGTTGGAT GCTGCCACCA CATTTGTGGA TCAACTGATT
GAACAGCAAT TGCTAATCAG TGAGCTTGAA CCGATGCTGA CCGGACCTCC CATGCTGATA
CAACTGCAAA GCCGGTTACG GGAGTTTACC GGAACCGATG CCATACTGGA GGTTTTGACA
GCTATAGGCT CTCTGCTTGC CTCAACAAAC ACAGACGCTT GCACGACCCA TCAAAGGGTT
TTGTCCACAT TGACCCGTCA TTTTTCTCCC CCCCTATCTC AACATCTGAT CCAGACCGAC
TTATTCGTTA ACACACCTGT CAATCAACTC AACCGGCGTG TCGTGCAAAC CATTCTTCGG
CAAATAGAGC AGTTACTCCC GTTGCACCGC CCCCGTCCAA ATGCACAGTT AGTCAGCTTC
GCCCAGCGGT TTCGAAGTCG GTACGAAGAC CGTACTATAC CGCTGCTGAA GGCGTTGGAT
GCTGAGTTTG GGGTTGGCTA TGGCGACGCT GTCGGCACCG AATCGGATTA TTCTTCTCTA
CTGGATGGCT TATCGTTCGT GGCAGCTTCT GAATCGAACG CGTTCACTTG GGAGGCTTAT
GAGAACCTTT TAGTGGCAAC GTTTAGTCGG TCACTACGGG AGCATCAACT GGCGGTCGAA
TTGACCGACG ACGATTTACA TTCGTTAGGT ACGGCTAATG CACCAACCCT ACCCGATAGT
TTTTATTTAT TCGGCAACCT ACTCGGCACA TCATCCATGG CCGTGGATCA AGGAGATTTT
AAATTTAATC TGCTAGCTGC GCAAGGCCCA TCGGTCGCTA ACTTGTTGGG GCGTTTTTGT
GCACATTCGA GGGTATTGAC GAATCACGTT CGCGCATGCC TGCACCGGGA AGAACAGCAG
CGACCCGATG CTATTTTCGC CGAAATTATT CACTTGCCTG ACGACAGGGT GGGTAATATT
CTCCAGCGGC CCGTCCTTCG CCGGTACGAG ATACCCATCG TTAACCAAGC ATCGGTTGAT
GAATCGGACC AACTGTATCT AAACGATTTA CATGTCAGTG TACGGACAAA TGGCCGAATT
AGTTTGTGGT CAGAACGACA CCAGAAGGAA GTTATTCCCC GGTTGTCTAC GGCCCATAAC
TACCGCTTTG GCCCAAGCAT TTATCAATTT CTGGCCGACT TGCAACACCA GGACAGTTCG
CTGAATATAT ACTGGGATTG GGGGCCACTC CGCGAACAGC CCTTTCTCCC CCGGATTAGT
TATCGGAACG TAATCCTAAG TCGGGCACGG TGGCTCCTGC GTTCGGTATC CCTACCCCTT
GGTTCGACTG ACGCGCTTAA AAGTCACCTT CGCACAACCT ATCAATTACC CCGCTGGATT
GCCGTTGCTG ACGGAGATAA TGAATTAGTC CTTGATCTGG ATACGGATCT GGGGTGGCAA
CTATTGGCCG AAGAAGTCCG GCGGCAACCT ACCGTTCATT TAGTAGAATG GGTAGCGACG
CCAGAGCAGT GCTGGCTTCG CGATGATAAG GGAGCTTACG TGAACGAGAT TGTCATCCCG
TGCCAAACGC TAGCTCCCTC ACCAAGACAT CCTTCTCCCA TTGAGCAGCC CGCGTTTTCG
CGGCTGATAA ACCGGCAAAA TGGAGCCGAC GGGTGCCCTG CCGATGAACA GGTTATGCGG
TCTTTTGATC CCGGTAGTGA ATGGTTATAC GTCAAATTAT ATTCTGGGGT GCAGATTGCC
GATGAACTGC TCAAGAAGAT GATTTTCCCG TTTGTTCAGG AACTACTGAC GGCAGGAACG
ATTCAAAATT GGTTTTTCGT TCGGTATGCT GATCCAGAAA CACACCTTCG TTTACGGCTG
CAATGCACGA TTCCTAAATT CTACGGTACA ATTCTGGACC GGCTGTCGGA GTGGACAGCA
CCGTACCGTA AGTCAGGGGT CATATACCGG GTGCAGATTG ATACGTATGA ACGTGAGCTT
GACCGCTACG GAGCCATTAC TATAACTGAA ACGGAACGCC TGTTCGCGGC CGATAGCTGG
GCCATACTGC GTTACTTAGT GCAAGAGACT AATTCCGCTG ATCGCTGGCG GTTTGCAATG
CAAAGCTGCG ATAGCCTATT AGCTGATTTT AACGTAGACA CAAGGGAAAA AGTCAACTTA
TTGCGACACT TGCAGGAACA GTTCCTGGCT GAACACCAAG CCGACCGAAC CCTGCGGCAA
CAGCTTAACG CCCGCTTCCG GTCGGAACAT GACCGGATCG CCTATGACCT TTCCGGTCAT
CAGTCTACAG GAAACTCTAC GACCAGTATC CTTGTAGAAA GGTCGACCCT TTTACGGCCT
TGCGTTCGGG ACATCACCTC AAAGTGTCCC AGTTCATCAA TACCAGACCT GCTGGCGAGC
TACATGCATA TGAGTTTAAA CCGCTTATTT GTCTCCCAGC AGCGAACGCA GGAGATGGTG
ATCTATCATT TTCTGGCTCG TTATTACGAA TCGCAGCAGG CCCGCAGGTA G
 
Protein sequence
MAIDLANFYL LRVPALAFLS VSTHSEEADA QLKALWEEGT LSEALFVASP SLYQQTIDHL 
RTTGWPVPPA FRRTLWKYAL RMSTRATPFG IMAGCSVGAI AKATEVTFGK ELRFVPHHRL
DMSCIAQLVR ELTNDPLIRP QLRFSPNNSR YIVGDELRYA EHDDNGQERH YFTASVLIST
PLEAVLQRAQ SGATSAQLAE TLVAIGITLD AATTFVDQLI EQQLLISELE PMLTGPPMLI
QLQSRLREFT GTDAILEVLT AIGSLLASTN TDACTTHQRV LSTLTRHFSP PLSQHLIQTD
LFVNTPVNQL NRRVVQTILR QIEQLLPLHR PRPNAQLVSF AQRFRSRYED RTIPLLKALD
AEFGVGYGDA VGTESDYSSL LDGLSFVAAS ESNAFTWEAY ENLLVATFSR SLREHQLAVE
LTDDDLHSLG TANAPTLPDS FYLFGNLLGT SSMAVDQGDF KFNLLAAQGP SVANLLGRFC
AHSRVLTNHV RACLHREEQQ RPDAIFAEII HLPDDRVGNI LQRPVLRRYE IPIVNQASVD
ESDQLYLNDL HVSVRTNGRI SLWSERHQKE VIPRLSTAHN YRFGPSIYQF LADLQHQDSS
LNIYWDWGPL REQPFLPRIS YRNVILSRAR WLLRSVSLPL GSTDALKSHL RTTYQLPRWI
AVADGDNELV LDLDTDLGWQ LLAEEVRRQP TVHLVEWVAT PEQCWLRDDK GAYVNEIVIP
CQTLAPSPRH PSPIEQPAFS RLINRQNGAD GCPADEQVMR SFDPGSEWLY VKLYSGVQIA
DELLKKMIFP FVQELLTAGT IQNWFFVRYA DPETHLRLRL QCTIPKFYGT ILDRLSEWTA
PYRKSGVIYR VQIDTYEREL DRYGAITITE TERLFAADSW AILRYLVQET NSADRWRFAM
QSCDSLLADF NVDTREKVNL LRHLQEQFLA EHQADRTLRQ QLNARFRSEH DRIAYDLSGH
QSTGNSTTSI LVERSTLLRP CVRDITSKCP SSSIPDLLAS YMHMSLNRLF VSQQRTQEMV
IYHFLARYYE SQQARR