Gene Slin_2131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2131 
Symbol 
ID8725869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2584570 
End bp2587611 
Gene Length3042 bp 
Protein Length1013 aa 
Translation table11 
GC content46% 
IMG OID 
ProductLantibiotic dehydratase domain protein 
Protein accessionYP_003386960 
Protein GI284037030 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0879912 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.662037 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCAGC TTGCTTTTTA TTACCTGCGT ACGCCTTTAA TTTCTTTCTC TCAACTGCTG 
GATCTGCTCG ACAGGGGAGA TTTAGTGACC TTTGTCCAAC AACCCGTGGT GGGCGAAGCC
ATCTTCTTAG CTTCGCCCAC GCTTTATGAA CAGATTCAAT CCGATCCGCA GTTACGAAAT
GATAAATTAC GGCAAACTGT CCTCAAGTAT GTGTTACGAA TGGGCAGTCG GTCGACGCCC
TTTGGCCTGT TTGCTGGGTG TTGGTTAGGT CATGTAGGAC CAGTAACCAG CTTGGAGGTA
TTGGACCGTC AGGTAAACTG TCATTACCGC TTAAGTGCAC CGATACTGGG TGAGTTGTTA
AACTATTTGA ATACGAACTC TTCGCTTCGG CATTCCCTTA GGTATCAGGT GAACACGAGC
CTGTACCCGG TTGGTAAGGT ATATCGCTAT CTGGAAGTTG ATGCGTCCGA TTCCCCGGCC
CGTTTCTTTA CCAGCCAGCT ACAGGGCAAC CCCGCCTTAC GCTGTGTGCT TCGACTGGCC
CGTCAGTCAA CTCCCTTTCA GACTTACGTA AAGGAATTAA CCCAACTGGG GTATGAGGCA
TGCGAAAGCC AGGCTTTTGT TGATCAGTTG ATTGCTGACC AGGTTCTGGT TAGTGAGTTG
TCACTGAATG TTACCGGGGC TGATGCACTC GATCGACTCA ACGAGGTTCT TCAACATCTT
CCAGAAGGAA AAACCATTTG GGCAAGCCTC AATCAACTAA GCAGCCTACT TGCTCAAAGG
GCCTGTCTTA AAACGAAGAA TGCGGCCATT CGTACCTTGT TTCAAGAGCA GTTTGGACTC
GTGCTGCCCC CGATACCCCT CCTACAAGGT GATACCCTGT TCACTGGTCA GGGGCATCAG
TTGAGCGCTT CCCTGCTGCG CGGTCTTCAA AAAAGTCTAC AAAACCTATT TTGCCTTTAC
GCGGCATTAC GCCCAGCAGA GGCCCTTGTT TCTTTCAAAA AAGCTTTTTA CGCCCGGTAT
GGAGACCAGG AAATTCCTTT ATCCGTAGCC TTGGATGCCG AGTCGGGCAT TGGCTACGGA
AACCAGCCCG TAGCCGGTGA TACACCCATT ATGGAAACGC TAATGGCAGC CAGCTTATTG
GGTTTAAACC AGCAGGAATC GACCCCGCTT AATCCTGCTT GGGACAAATG GCTACTAAAA
CGATACGAGA GTTGGCAAAC GCAGAGCAAG CCTGTTCTTG AGTTGACCAA TGCTGACCTA
ACTGGCTGGT CCGATTCTCC CATCTCAATT CCGGAAAGCT ATTATGTACT GGGCTACTTT
CTGGCAGCTT CGTCAAAAGA CCTAGATACA GGTCGCTATA AATTTCGAAG TAAAGTTATG
GCAGGCCCTT CTGCTTTTTC GTTGCTGGGT CGTTTCTGCA GGGCCGATAA AGAGTTAAAT
AAACAGGTTC AAGGGGCTTA TCAGCAACTA CAAAAACAAA ACACTGACCG CATTTATGCA
GAAATTGTGC ACTTACCATC AATTGCTGTG GGCAATGTCG TGCAACGTCC TCACTTGAGT
GAGTATGAGA TTCCTTATCT GGGCTTGTCG ACTTTACCTC CCGAAAAGCA AATACCAATC
ACTGATCTAT GGGTTTCCGT ACCCAACGGC GAGCGGGTCA TTCTCCGCTC CAGGCGCTTA
AATAAGCAAA TTGTACCCCG CCTTACTACG GCCCATAATT ACCTGTCCGG CTTGCCCACG
TACCGATTTC TTTGTGATCT CCAACAGCAG GAAAGCCCGC TGATGGTACA GTGGCCCTGG
GGCAGTTTGA GTACCTTTCG TTACTTACCA AGGGTTGAGT ATCGCCAAAT TATTTTGCAG
GAAGCACGGT GGCGACTCGA TCAGGCAGAC ATTGACAACT CAATAACCGA TGCTGAAAAT
GTGGCCCGCT GGCGCCAGTT ATGGCAATGG CCTCGCTTTG TCGCCTTAAT TCAGGCCGAT
CAGGAGTTGT TTTTGGATAT GGATAATTCG GACTGCCAAA AACTACTGGT CAGTACGCTT
CGTCGATTAA CCCCCCTATA CGTATTCGAA TGGCTGCAAA CGCCCGATCA GTGTTGTGTG
CATGGACCTC AGGGCCCATT AACTCATGAA GTCATTCTAC CCTTTGTCCA ACGCAGATCC
TCTCTCGCAT CCCCTCCCGT CAAAATTAGT AACCTTTCTA TAGAGCGAAC GTTTATACCC
GGTTCGGATT GGCTTTTTCT TAAAGTTTAT GGCGGACCGC AGGTGTGTAA ACAATTGCTA
ATAAAACTAG GCAAACTTGC CCGCTCATCG ATTCGAGCGG GTACAATCAG TCATTGGTTT
TTTATACGAT ACGAAGATCC TGAACCCCAT CTTCGATTTC GTTTCCATTT AACGGATAAG
AACAACTATA CAAGTCTTCT TTCGGCCTGT CAACGGCAGT TGCAAGTCTG GATCAATACT
GGTGAAATAC ACCGAGTTCA GTTAGATACA TACCAGCGAG AACTGGAACG CTACGGTGTG
GAACGTATAG AAAACACAGA GTGGATTTTT TGGACTGATA GTGATGCTGT TTTAACCATA
CGTCAAAACG AGGATGCTGA GCAAGCCCTA CTTGGGGCTG CACTTTTAGG GACTGACCGC
TATCTTACAG ATTTTGGTTT AAATCTTGTA GATAAAAGTA CTTTTTGTCT AAAGGGGTTT
CAGGCCTTAT TTAACCAAGA AGGTGCACAA TCCAGTTTAA AAAAACAATT GGCTAATCTT
TATCGGCAAA ACCAGTCAAT ATTGATCAAG CTAATGACTA AGGCAGAATC ACCAAGCCTG
ACAGATGAAC TTCACCATTT AATATTTTAT AATCGTAGTC ACCGGGCTAA GCCCTTTATG
CTGTCTCCCA TTTTTACAAA CGATCAATTG CAACCTTATA TCGCGAGTTT GATACACTTG
TTTATAAACC GATTATTCGA TCAACACCAA CGAAGTTACG AAGTACTTGT TTATCATCAT
CTAGCTCGTT TCTATAAATC CCAACTAGCG CAAAACAAAT AA
 
Protein sequence
MQQLAFYYLR TPLISFSQLL DLLDRGDLVT FVQQPVVGEA IFLASPTLYE QIQSDPQLRN 
DKLRQTVLKY VLRMGSRSTP FGLFAGCWLG HVGPVTSLEV LDRQVNCHYR LSAPILGELL
NYLNTNSSLR HSLRYQVNTS LYPVGKVYRY LEVDASDSPA RFFTSQLQGN PALRCVLRLA
RQSTPFQTYV KELTQLGYEA CESQAFVDQL IADQVLVSEL SLNVTGADAL DRLNEVLQHL
PEGKTIWASL NQLSSLLAQR ACLKTKNAAI RTLFQEQFGL VLPPIPLLQG DTLFTGQGHQ
LSASLLRGLQ KSLQNLFCLY AALRPAEALV SFKKAFYARY GDQEIPLSVA LDAESGIGYG
NQPVAGDTPI METLMAASLL GLNQQESTPL NPAWDKWLLK RYESWQTQSK PVLELTNADL
TGWSDSPISI PESYYVLGYF LAASSKDLDT GRYKFRSKVM AGPSAFSLLG RFCRADKELN
KQVQGAYQQL QKQNTDRIYA EIVHLPSIAV GNVVQRPHLS EYEIPYLGLS TLPPEKQIPI
TDLWVSVPNG ERVILRSRRL NKQIVPRLTT AHNYLSGLPT YRFLCDLQQQ ESPLMVQWPW
GSLSTFRYLP RVEYRQIILQ EARWRLDQAD IDNSITDAEN VARWRQLWQW PRFVALIQAD
QELFLDMDNS DCQKLLVSTL RRLTPLYVFE WLQTPDQCCV HGPQGPLTHE VILPFVQRRS
SLASPPVKIS NLSIERTFIP GSDWLFLKVY GGPQVCKQLL IKLGKLARSS IRAGTISHWF
FIRYEDPEPH LRFRFHLTDK NNYTSLLSAC QRQLQVWINT GEIHRVQLDT YQRELERYGV
ERIENTEWIF WTDSDAVLTI RQNEDAEQAL LGAALLGTDR YLTDFGLNLV DKSTFCLKGF
QALFNQEGAQ SSLKKQLANL YRQNQSILIK LMTKAESPSL TDELHHLIFY NRSHRAKPFM
LSPIFTNDQL QPYIASLIHL FINRLFDQHQ RSYEVLVYHH LARFYKSQLA QNK