Gene Slin_4111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4111 
Symbol 
ID8727870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4950224 
End bp4953193 
Gene Length2970 bp 
Protein Length989 aa 
Translation table11 
GC content56% 
IMG OID 
ProductRhs family protein-like protein 
Protein accessionYP_003388897 
Protein GI284038967 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.217652 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0101653 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATCAC CCGCGTTTCG GTCCGGTTTA CGACGGTTAC TTTACCCGGC CATTAACGGT 
GCGTCGCCTG CTGGTGAAGA AAACCCGGAA CAGCTTGCCC TACTGCTGAA TCGACTGGCC
TTAACGCCCG ACGATTGCCA GGTGCTGCCC TGGGAACGAC GGACGTTCAA TGGTCGCTTT
GTCGATGTGG CATCGGGAGA AGTGCTGGTT GAGGAAGTCG ACGTAGAGAT TCCGGGGCCT
GTTCCCTTCC GATGGAAACG GTTTTACCGC TCCCATAAAT CAACAACCGG CTCTTTGGGT
GGCGGGTGGC GTCACGCCTA CGATTTTGCC CTGCTGGAAG ACCAGGTCAG CGGTCGGGTG
GTGGTCCGGT TGCCCGATAA TCGGGGTATA CTGTTTCCAC AACTGGCCAA AGGTGAGCAA
TATCTGAACC GAGCCGAAAA GCTTCAGCTT AACCACGACG ATCACGGCTA CCGGCTCCGG
GGAGCCGACG GGCTGGGCTA CCGTTTTGCC CGAAATCCGG GCGACAGTCT TTATCGGCTG
ATGGCGGTCG ATAGCCCTCA CTTTCCCCAC CGTCTTCAGT TCAGTTATAC CCGTGCGGGA
CATTTACGGC GCATAAGCGA TGATTTTCAG CGCGTGATCG ATGTAACTAC CGATCCGCAG
GGAACTATCA TACGGCTGAC CATCACGGCC CCCGACCAGT TACTTAAAGA GTTTACGTTG
GTAGCCTACG GGTACGACGA AACCCATAAC CTGACCGATG TGCATGTGGC GGGTAGCCGT
ACGGCTCAAT ACAGTTATCG CCAGAATCGA CTCGTCCGGC TAACGGACCG TTTCAACCAA
AATACATTTT TCACCTACGA AAAAATCGAC AATACTTACC GGTGCAACAC GCTGAAATGG
AGCGGAAGCC CGATTTCACT CCGCTTTGAT TACCTGTCCG ATGAAGGGCG AACGCTGGTT
ACGGACTCCG CCGACCGGGT CAGGCAATAC ACGCACGAAG CCGGAGCCGT TCAGCGGTTT
ATCAGCGAAG GGGGGCAACA ACGTGTTTGG CTCTTTAACG AACACGGCGA ACTGGAGAGT
GAGCAGGATA CGCTTGGTAA CACCACGTTC TATACCTACG ACCCAAAAGG CAATAGGATA
GAGATAGATT GGCCAGACGG TGGACAGATG CGGATGCGCT ATAACGACGA CGGCCAGTTA
CTGGAACTGG TAGACCGGGT CAATGGCGTG TGGCTGTGGT CGTATACCGA AACCGGGCAA
CTCCTATCCT GCCTGAATCC GGTTGGTGCC GAAACGACCT TTAGCTATAA CCGGGACGGC
TGGCTTCGCG AACGCAGAAA CTTAAGAGAA GGCTGGACCC GCTGGGCTTA CGATCCGTAC
GGTTACCCGG TAGAGGTAAC AACGGAAACC GGCCGAAAAA CGACATTATC GTTCAACGCC
CTGGGGCAAT TAATAAACTC GAACCCCGAC CCGAGCACGG CAACCAAACC CGTACCCGCA
AACGAGCCGA AAGCCGTTTC TGAATACCAG CCGGTTTACA CAAACGACGG TAAACTCATT
GAGCTTCGGA GCAAAGATAA ACTCAGTTGG CGGTTCGTCC GGGATACCGC CGGGAGAGTG
CGTGACTATT GCCGACCCGA TGGCCGTTCA ACCCGTTTTC ACTACGATGC TGCCGGGCGT
ATGACCGAAG TACTGTTCTC CGACGGAAGC TGGCACCATT ATACCTATCG CCCGGATGGC
TGGCTGATGG AAGCCAGCAC ACCCACCACC CTGGTTCAGT TCGAGCGCGA TCCGCTGGGG
AAAATCATTA CCGAAACAGC GGGTAATACC GTGGTGGAAA CGGTGTACGA CAAGGCCGGT
AACCGAATAA ATCTGCAATC GTCAGCCCAG GCTACTGCGG CCTACACTTA TGACAACCGC
AGCCTACTGA CCCGGCAACA ACACGGACCG TGGCAACTCG AATTTACCCA CGACCGCCAG
GGGCGTTTGG CAGAGTGCCT TATGCCCGGC AGTTTGCGCA GTCGCTGGCA GTATGACCAG
GGGCCATTGC CAACCAGTCA TCAGCTCTTT TGGGGAAGCC GTCTGCAGGC CGCCCGTTCG
CAAACCTACC AATGGCAGCA GAACCAAGTC ACCCGGCTAC AGGATTCGCG TTTCGGCACG
GCCACGCTGC TTTATGATTC AGCCAATGAA CCGATAGAGG CCGTCGGCTC TGCCGGTGCC
GGATGGGTCG ACCGCTGGCT TCCGCAGCGG TCGCGCTATC AGCAGGTGTT ACTGAAGCCA
GCCGCCAGAG CCAGTGAAGT CGGCTGGCAG CTCATTCTGG CGGGACCGGC CCGATTTTAT
TACGATCCTG AGGGGTATCT GCGCGAAAAA CAGATTGCCG GAAAGGTGTG GCAGTTCGTG
TGGCATGAAT CAGGCTGTTT ACATCAGGTC ATCTGTCCGG ATGGCAGCGT GGTTGCGTTC
GAATACGATG CCCTCGGGCG TCGCATCCAG AAAACTGTCA ACGATTACAA GGTCTGCTGG
GCGTGGGACG GCAACCGACT GCTGCACGAA TGGCATGAGC GCACGGGCAG CGAACCGGTT
CAACTTACCT GGTATACGGC GCGGGGAGCC GAAGCCACCA TGCTTCAGGT GGGTAAAAAC
AGCTATAGCG TGGTCTGTAA CTACCTGGGA CAACCCCTGT CCATGCATGA TGAGCAGGGC
GATCCGGTAT GGGAATGGCG CTGGTGTCTG TTCGGCAAAA AGCGCAGCCT AACCGGCCCC
CAACGCTGGC ATACCTTTCT GGGATATGGG CAGTTTGACG ATCAGGAAGC CGGGTTAGTG
TACAACAACT TCAGGTATTT CGACAGTGAA ACCGGGCTGC CTATCAGTCC AGAATATTCC
AGCCCCGCCG GCTGGGTGCG ATCTGGATGG GAGCCTCCTC ATGCACCAGA ATCGTTTCTT
TCCGCTGGCC GATATATTCA AGTGTACTGA
 
Protein sequence
MESPAFRSGL RRLLYPAING ASPAGEENPE QLALLLNRLA LTPDDCQVLP WERRTFNGRF 
VDVASGEVLV EEVDVEIPGP VPFRWKRFYR SHKSTTGSLG GGWRHAYDFA LLEDQVSGRV
VVRLPDNRGI LFPQLAKGEQ YLNRAEKLQL NHDDHGYRLR GADGLGYRFA RNPGDSLYRL
MAVDSPHFPH RLQFSYTRAG HLRRISDDFQ RVIDVTTDPQ GTIIRLTITA PDQLLKEFTL
VAYGYDETHN LTDVHVAGSR TAQYSYRQNR LVRLTDRFNQ NTFFTYEKID NTYRCNTLKW
SGSPISLRFD YLSDEGRTLV TDSADRVRQY THEAGAVQRF ISEGGQQRVW LFNEHGELES
EQDTLGNTTF YTYDPKGNRI EIDWPDGGQM RMRYNDDGQL LELVDRVNGV WLWSYTETGQ
LLSCLNPVGA ETTFSYNRDG WLRERRNLRE GWTRWAYDPY GYPVEVTTET GRKTTLSFNA
LGQLINSNPD PSTATKPVPA NEPKAVSEYQ PVYTNDGKLI ELRSKDKLSW RFVRDTAGRV
RDYCRPDGRS TRFHYDAAGR MTEVLFSDGS WHHYTYRPDG WLMEASTPTT LVQFERDPLG
KIITETAGNT VVETVYDKAG NRINLQSSAQ ATAAYTYDNR SLLTRQQHGP WQLEFTHDRQ
GRLAECLMPG SLRSRWQYDQ GPLPTSHQLF WGSRLQAARS QTYQWQQNQV TRLQDSRFGT
ATLLYDSANE PIEAVGSAGA GWVDRWLPQR SRYQQVLLKP AARASEVGWQ LILAGPARFY
YDPEGYLREK QIAGKVWQFV WHESGCLHQV ICPDGSVVAF EYDALGRRIQ KTVNDYKVCW
AWDGNRLLHE WHERTGSEPV QLTWYTARGA EATMLQVGKN SYSVVCNYLG QPLSMHDEQG
DPVWEWRWCL FGKKRSLTGP QRWHTFLGYG QFDDQEAGLV YNNFRYFDSE TGLPISPEYS
SPAGWVRSGW EPPHAPESFL SAGRYIQVY