Gene Slin_6264 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_6264 
Symbol 
ID8730048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7597615 
End bp7600665 
Gene Length3051 bp 
Protein Length1016 aa 
Translation table11 
GC content51% 
IMG OID 
Producttype III restriction protein res subunit 
Protein accessionYP_003391022 
Protein GI284041092 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.724284 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTCCC ACACCAACGA ACAAGCACTC GAAGCGGCCA TCGAGAAAAA ACTAACCGGT 
ACTACCCGCG AAGAACTCAG AACACAAGGT ATTACCAGCG CCTGGGCCGA AGCCGCCAGC
CAATACCGGT CGGGTAACGG CTACTGGATG GGTGAAACCA CTGATTTCAA CGCCGAATAC
GCCATCGACA CCCGGAGGTT CTGGCACTTT CTGGAAACCA CACAACCCCT CGAACTGGCC
AAGCTCCAGA CCCAGCATCA ATGGGAGTTG CTGATCCTGC AAAAGGTGGA TCGGCTCATT
AAGAAATATG GCGTACTGCA TCTCTTCCGC AAAGGGCTGG ATATTGACGA TGCTCATTTC
ACCATGCTGT ACGTGCCGCC TTTGGCATCG AGCAGCCAGA GCGTTAAAGA CGCCTTCGAG
CGAAATGAGT TTAGTGTAAC CCGCCAGTTG CGCTATTCGA CCACTAACCC CCGCGAAGAA
GTGGATATGG TGGTGTTCAT CAACGGGCTA CCTGTAGCTA CGATGGAGCT AAAGAACCCC
TGGACCGGCC AAAATGCACG CGTACATGGT ATCAAACAAT ACAAATGGGA TCGGGACGCC
ACGCAACCCC TGTTGCAGTT TGGTCGGTGT GTGGTCCATT TAGCCGTTGA CCCCGATGAG
ATTTTTATGA CCACTCGCTT GAGTGGCAAG GACACGTTCT TTCTGCCCTT CAACAAAGGC
GACCTGAACC ACGGAGCCGG AAACCCGCCC AATTCGTTCG GCCATAAAAC GGCTTACCTC
TGGGAAGACA TCCTTACCCG GCAGAGCCTG AGCACCATCA TTCAGCATTT CGTTACGCTT
GATGGAAAAT CCACTGAACC GCTAGCCAAA CGAACGCTCA TCTTTCCCCG CTATCACCAG
CTGGAGGTCG TACGCCGGTT GTTAAAAAAC GTGAGCCAGA ACGGTGTGGG GCAAACCTAC
CTGATTCAGC ACTCCGCCGG TTCAGGCAAG TCGAACTCCA TAACCTGGGC CGCGTTTCAG
TTAATCGAAG CCTACCCCGA ATCGGCTACC ACGCCGGGCA ACCGGGGCAT CACCCTCCCT
CTGTTCGATT CGGTTATTGT CGTCACCGAC CGCCGATTGC TCGACAAGCA GATCCGCGAG
AACATCAAAG AGTTTTCGCA GGTCAAAAAC ATAGTGGCTC CGGCATTCAG CTCGCAGGAA
TTACGACAGG CGCTGGAGGG CGGCAAGAAG ATCATCATTA CCACCATCCA GAAATTCCCG
TACATCATCG AGGGTATTTC CGACCTAAGC GACAAAAAGT TTGCGGTCAT TATCGACGAA
GCCCACAGTT CCCAAAGCGG CACTGCTCAT GATAGCATGA ACAACGCGAT GGGTAAGACC
GAAGCCAAAG ACGACGATCA GCAGCCCGAC GCGCAGGACC TGATTTTAGC CGCCATGCAA
GCCCGGAAGA TGCGCGGCAA TGCGTCGTAC TTTGCCTTTA CAGCTACGCC CAAACCGAAT
ACGCTGGAGA AGTTCGGTAC CAGGAAAGAC GATGGAACCT TTACCCCCTT CCACTTATAC
TCCATGAAAC AGGCCATTGA AGAAGGCTTT ATTCTGGACG TGCTGGCCAA TTATACAACC
TATCGGAGTT ATTACGAGAT TGAGAAATCC ATTGAAGAAA ACCCCCTTTT CGATACCCAG
AAGGCTCAGA AAAAGCTAAA GATGTACGTG GAGCAGCACC GGCAAACTAT CGGTACCAAA
GCCGAAATCA TGGTGGAGCA CTTTACTACG CAGGTAGTCA ATACCAAAAA GCTCAAGGGC
AAAGCCAAAG GCATGGTGGT TACCCAAAGT ATTGAATCAG CTATTCGGTA CTTCTTCGCC
ATTCGCGCCA TACTGGCCGA CAAGAATGCT TCCTTCAAGG CTGTTATTGC CTTTTCGGGC
AAAAAAACCG TCGACGGAAT CGAACATACC GAAGCCAGCC TGAATGGTTT TGCCGAGAAC
GAAACCAGTG AGAAGTTCGA CAGCGATGAG TATCGCATTT TGGTGGTGGC CAACAAATAC
CTGACGGGTT TCGATCAGCC CAAACTATCG GCCATGTATG TCGATAAAAA GCTACAGGGC
GTACTGGCTG TGCAGGCTCT CTCGCGTTTA AATCGATCCG CGAACAAGCT GGGCAAGAAA
ACGGAGGATT TGTTCATCCT GGACTTCTTC AACTCGGTCG ATGACATAAA AGAAGCTTTT
GACCCGTTCT ACACAGCCAC CTCGCTGAGC CGCGCCACCG ATGTCAATGT ACTTCACGAG
TTGAAAGCGC AACTTGACGA TGTAGGCGTT TATGAATGGG CCGAAGTGGA AGACTTCGTG
GCCAAATACT TCAGTAACGT TGACGCCCAA CTATTGAGCC CTATTATCGA CGTAGCCGCC
GAACGGTTCA GCAACGAGCT AGACCTGGAA GACAACGACA AAGCCGACTT TAAAATCAAG
GCCAAGCAGT TTGTCAAGAT TTACGGCCAG ATGGCGTCTA TCCTGCCCTT CGAGGTGGTG
AATTGGGAAA AGCTATTCTG GTTTTTGAAG TTCCTGATTC CCAAACTGAT TGTCCGGGAC
CCACAGGGCG ATGCGCTGGA CGAGTTGCTT CAATCCGTCG ATCTGTCAAC CTACGGCCTG
GAACGCACAC GGCTGGGCCA TACCATCACC TTGGACGATA CCGAAACCGA AGTAGACCCG
CAAAACCCAA ATCCACGTGG AGCCCACGGC GCAGATGAAG ACCAGGACCC TCTGGAGGAG
ATTATAAGAA GCTTCAACGA GCGGAATTTT CAGGGATGGA ATGCTACGCC CGAAGAGCAA
CGCGTGAAAT TTGTGAGTGT TATCAAGTAC ATGCAGCAAC AGCCCACCTT TGAAACGCAC
GTTTTGAACA ATCCAGACGT ACAGAACCGG GAACTTGCCT TGCAAAAGTT ATTCAATGAT
GCCGTCAACC AACAGCGAAA ACTGGACATT GAGTTTTACA AGCTTTACAC CCAAGATCCA
GCCTTCAAAC AAGCCTGGCA AGACAGCGGC CGGCGTATAT TGGGAGTGTA A
 
Protein sequence
MPSHTNEQAL EAAIEKKLTG TTREELRTQG ITSAWAEAAS QYRSGNGYWM GETTDFNAEY 
AIDTRRFWHF LETTQPLELA KLQTQHQWEL LILQKVDRLI KKYGVLHLFR KGLDIDDAHF
TMLYVPPLAS SSQSVKDAFE RNEFSVTRQL RYSTTNPREE VDMVVFINGL PVATMELKNP
WTGQNARVHG IKQYKWDRDA TQPLLQFGRC VVHLAVDPDE IFMTTRLSGK DTFFLPFNKG
DLNHGAGNPP NSFGHKTAYL WEDILTRQSL STIIQHFVTL DGKSTEPLAK RTLIFPRYHQ
LEVVRRLLKN VSQNGVGQTY LIQHSAGSGK SNSITWAAFQ LIEAYPESAT TPGNRGITLP
LFDSVIVVTD RRLLDKQIRE NIKEFSQVKN IVAPAFSSQE LRQALEGGKK IIITTIQKFP
YIIEGISDLS DKKFAVIIDE AHSSQSGTAH DSMNNAMGKT EAKDDDQQPD AQDLILAAMQ
ARKMRGNASY FAFTATPKPN TLEKFGTRKD DGTFTPFHLY SMKQAIEEGF ILDVLANYTT
YRSYYEIEKS IEENPLFDTQ KAQKKLKMYV EQHRQTIGTK AEIMVEHFTT QVVNTKKLKG
KAKGMVVTQS IESAIRYFFA IRAILADKNA SFKAVIAFSG KKTVDGIEHT EASLNGFAEN
ETSEKFDSDE YRILVVANKY LTGFDQPKLS AMYVDKKLQG VLAVQALSRL NRSANKLGKK
TEDLFILDFF NSVDDIKEAF DPFYTATSLS RATDVNVLHE LKAQLDDVGV YEWAEVEDFV
AKYFSNVDAQ LLSPIIDVAA ERFSNELDLE DNDKADFKIK AKQFVKIYGQ MASILPFEVV
NWEKLFWFLK FLIPKLIVRD PQGDALDELL QSVDLSTYGL ERTRLGHTIT LDDTETEVDP
QNPNPRGAHG ADEDQDPLEE IIRSFNERNF QGWNATPEEQ RVKFVSVIKY MQQQPTFETH
VLNNPDVQNR ELALQKLFND AVNQQRKLDI EFYKLYTQDP AFKQAWQDSG RRILGV