Gene Slin_1570 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1570 
Symbol 
ID8725304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1891543 
End bp1894608 
Gene Length3066 bp 
Protein Length1021 aa 
Translation table11 
GC content55% 
IMG OID 
Productglycosyl hydrolase BNR repeat-containing protein 
Protein accessionYP_003386418 
Protein GI284036488 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.371729 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTATT TCGCCTTTTA CTTTCTGGCA ATTAGTTGCA GTCTGGTACT CAGCAACCCG 
CTTTGGGCTC AGCCAGCAAC GAAGCCCGAT ACGGCCATCA ACCCCGTTTT TAAAGGCATG
GCCTGGCGCA ACATCGGCCC AACCCGGGGT GGCCGGTCGC TGGGGTCTGC GGGGTCGCCG
AGCCGCAAAC AGGAATATTA TTTCGGAGCC GTTGGCGGTG GCTTATGGAA AACTACCGAT
GGCGGCCAAA GCTGGGCACC CGTTACGGAT GGTCAGTTGA CAAGTTCGTC TGTAGGGGCC
GTTGCGGTTG CTGAATCCAA TCCCGATGTG GTCTATATCG GTACAGGCGA AACACAGCTT
CGCGGTAACA TCATGCAGGG CGATGGAGTG TACAAATCGA CCAATGCCGG GAAAACCTGG
ACCAACATTG GGTTGAGAAA CACCCAGGCC ATTGCCCGTG TCAGGATTCA CCCCACCAAC
CCTGATATTG TTTACGTGGC TGCTCTGGGA CACCCCTACG GCCCTAACGA AGAGCGGGGC
ATTTTCCGGA CCACCGATGG CGGTAAATCC TGGAAGAAAG TCCTCTACAA AAGCGATAAG
GCCGGTGGTA TCGACCTGAT CATCGACCGA ACCAATCCCA ATGTACTCTA TGCCTCTCTC
TGGCAGGTGT ACCGCAAACC CTGGAAAATG TGGGGGGGCG GGGGTGACTC CGGCCTGTTC
AAATCGACGG ACGGGGGCGA AACCTGGACC GAGCTGACCC GTAAACCAGG CATGCCTAAA
GGAACCGTCG GTAAAATTGG CGTGACCGTT TCGCCCGTCG ACCCCAACCG GGTGTGGGCC
ATCGTCGAGG CCGAAGACGG GGGCGTATAC CGCTCCGACG ACGCCGGGAT GACCTGGAAA
CACGTCAACG ACGAGCGCAA GCTTCGCCAG CGGGCGTTTT ATTACTCCCG AATTTATGCT
GATCCCCTCG ACAAAAACGG CGTTTACTGC CTGAATGTCG ACTTTTTCAA ATCGTCGGAT
GGTGGGGTAA AGTTCAATAA ATCGTTAAAA GTACCGCACG GCGATAACCA CGACTTGTGG
ATCGACCCGG CCGATTCGAC CCGAATGATT ACGTCCAATG ACGGCGGTGC GGCTGTTTCG
GTCAACGGCG GTAAAACCTG GACGGATGAA AACTTCCCGA CCGCGCAACT TTATCACATT
ACAGCTACCA ACGATTTCCC CTACCATGTA GCCGGTGCCC AACAGGACAA CACCACCGTA
GCCGTAGCCA GCGAAGGCTG GGGGAATCAG ATGGCCCGCA GTAATTCCAT CAAAAAGAGC
GAATGGACTT ACGAAGTGGG TGGTGGCGAA AGCGGCTACA TTGCCCAGGA CCCCAAGAAC
CCGAACATCT TTTACGCGGG CAGCCAGGGT GCATTACTTA CCCGCTACGA CCGCACGACG
GGCCAAACCC GCGATGTGCA GGTGTACCCG CGTTTCTTCT CCGGCGAACC CGCCAGTGCT
CTGCCCGAAC GCTGGCAGTG GACGTACCCG ATTGTTTTTT CGCCCAAAGA CCCGAACCGG
CTTTACGTTT GTTCACAGCA CGTATGGGTA TCGACCAACG AGGGACAAAG CTGGGATAAA
ATCAGCCCCG ACCTTACCCT GGCCGACACG GCTACGCTGG GGAAAAGCGG TGGTGTCATT
ACGATGGACA TGAACGGCCC GGAGATTTAC GCGACCGTTT TTGCACTGGC TCCCTCCTAC
CACGACGTGA ACACCATCTG GGCGGGCTCC GACGACGGGC TGATTCACAT CACCCGCGAC
CACGGCAAGA GCTGGCAGAA AATCACCCCG CCGGATATGC CCAAACATAC CCGCGTGAGT
ATTATCGAAG CGTCGCGGCA CAAACCCGGC ACGGCCTATG TAGCGGCCAA ACGCTACCAG
ATGGACGACC GCACGCCCTA TCTCTGGAAA ACGGATGATT ACGGGAAGAC CTGGAAAAAG
ATCATTACTG GATTACGTGC GGACGATTAC GCCCATGCCA TTCGCGAAGA CATCACGCGT
CCCGGCCTGC TCTACGCCGG TATGGAACAT GGCGTTTGGG TTTCCTTCAA CGACGGCGAG
AACTGGCAGC CCATGCAACT GAAACTGCCC GATACCCAAA TCTCAGACAT TCAGGTAACG
GAGAAAGACA TTGTCATTGG TACGCACGGC CGGTCGATCT ACGTGCTGGA CGATGTAGCT
CCCGTTCGGG AGTTTACGCC CGATCTGGCC AAGAAGGCTG TTCACCTCTT CAAGCCCTAC
TATGCTGTTC GTCGGGTACA GCCAGCCGTC TTCCAGTACT ATCTGGCGAA GAAAGCGGAC
AGTGTGAAAA TTGAGATTCT TAATGCCGCC GGAACGCTGA TTCAGTCGTT TACGGGCAAC
AAACCCTCTT ACCCTAAAGA TGATGAGGAT GACGACGATT CAGGGAAACC CAAAATCAAA
CTACCCACCA CGGCCGCTGG TCTGAACCGC TACGAGTGGG ACCTGCGCTA CCCCGGTGCC
ACTTATTTCA AAGGGATGAT CATGTGGGGA GCCCGGCCTA CGTCCGGACC ACTGGCGTTG
CCCGGTCAGT ATCAAGTGCG GTTAACCGTA GGCGATCAAA CGTTCACTCA ACCCTTCGAG
ATTAAGCTTG ACCCACGACT GAAGGGCGTC TCCCAAGCCG ATGTGCAGGA GCAATTCAAG
ATGGCGATGA AACTGCGGGA CGAGACGAGC AAAGCCAACG ACGCGGTGAT TCAGATTCGG
GCGGTGAAGG AGAAGCTGGC CAAACAACCC GACAGCCCTA CCAATAAAAA GCTGAAAGAG
CAGTTGAACA TCATCGAAGA AAACCTCTAT CAGATTCGGA ATCAAAGTGG TCAGGACCCG
CTGAACTTCC CGATCAAGCT CAACAACCGG CTGGCGGCTC TCTGGCGCAG CATTGAATCC
GGCGATGCCA AACCGACCAA CGGCTCTTAC AAAGTTTACG AGGAACTCAC CGCCGACCTA
AACAAGCAAC TAGCCGAACT GGATACGCTG CTCAAAACGA AAACGGCAAA AAATATTGGT
ATGTGA
 
Protein sequence
MKYFAFYFLA ISCSLVLSNP LWAQPATKPD TAINPVFKGM AWRNIGPTRG GRSLGSAGSP 
SRKQEYYFGA VGGGLWKTTD GGQSWAPVTD GQLTSSSVGA VAVAESNPDV VYIGTGETQL
RGNIMQGDGV YKSTNAGKTW TNIGLRNTQA IARVRIHPTN PDIVYVAALG HPYGPNEERG
IFRTTDGGKS WKKVLYKSDK AGGIDLIIDR TNPNVLYASL WQVYRKPWKM WGGGGDSGLF
KSTDGGETWT ELTRKPGMPK GTVGKIGVTV SPVDPNRVWA IVEAEDGGVY RSDDAGMTWK
HVNDERKLRQ RAFYYSRIYA DPLDKNGVYC LNVDFFKSSD GGVKFNKSLK VPHGDNHDLW
IDPADSTRMI TSNDGGAAVS VNGGKTWTDE NFPTAQLYHI TATNDFPYHV AGAQQDNTTV
AVASEGWGNQ MARSNSIKKS EWTYEVGGGE SGYIAQDPKN PNIFYAGSQG ALLTRYDRTT
GQTRDVQVYP RFFSGEPASA LPERWQWTYP IVFSPKDPNR LYVCSQHVWV STNEGQSWDK
ISPDLTLADT ATLGKSGGVI TMDMNGPEIY ATVFALAPSY HDVNTIWAGS DDGLIHITRD
HGKSWQKITP PDMPKHTRVS IIEASRHKPG TAYVAAKRYQ MDDRTPYLWK TDDYGKTWKK
IITGLRADDY AHAIREDITR PGLLYAGMEH GVWVSFNDGE NWQPMQLKLP DTQISDIQVT
EKDIVIGTHG RSIYVLDDVA PVREFTPDLA KKAVHLFKPY YAVRRVQPAV FQYYLAKKAD
SVKIEILNAA GTLIQSFTGN KPSYPKDDED DDDSGKPKIK LPTTAAGLNR YEWDLRYPGA
TYFKGMIMWG ARPTSGPLAL PGQYQVRLTV GDQTFTQPFE IKLDPRLKGV SQADVQEQFK
MAMKLRDETS KANDAVIQIR AVKEKLAKQP DSPTNKKLKE QLNIIEENLY QIRNQSGQDP
LNFPIKLNNR LAALWRSIES GDAKPTNGSY KVYEELTADL NKQLAELDTL LKTKTAKNIG
M