Gene Slin_3039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3039 
Symbol 
ID8726791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3688804 
End bp3690408 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content49% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003387849 
Protein GI284037919 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.117919 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATGA AAAAGATATC TTTCCTGTTA CTCACGGGGC TTTTGGTCGG AAACATCTCC 
TGTAAAGAGG CCGAATTCAC CGATGCTTAC CCAAACCCTG CCAAGATTGC GGAGACAACC
GTTGAAAAGC AATTTACCGG TGTTTTGTAT GCCAATAAAG ATTACGTGCT TCCAGGCTAT
CGGCATTATT TTGTGACCCT GAGAACTTCC CTAAACCACT ATAACCAGGC TACCGGCTGG
ATCAATTCAT CCGGACAATA TATACCGGGT TCGTCGGGGG TAGAAGATGT GTGGTACAAC
TACTACAACA TGCTGGCTCA GTACCGCGAA CTGCAAAAAG TGTACGCATC GAGACCCGCC
GATCAGCAGG CCGACCGCAA GATTTTCATG CTGGCGGCAG CGGTATATGT GTATGATTAC
ACGCAGAAGA TGGTCGACGT TCTGGGCGCA ATTCCGTTTA GTGCAGCCGG GATGCTGAGC
AGCAACGGGG GCGACTACCT TGCATCGAGC GCAAAATTCG ATACGGCCGA GGGAATATAC
ACGTTTCTGC TGGACGACCT GAAGCGGATT TCAACCGAGC TGAATGGGAT TACATTGAAT
GCTGGCTATC AGAAATCGTT TCAGACGCAG GATTTCGTTA ACAAGGGTGA TGTAACTGCC
TGGAAACGGT ACTGTAATTC GCTTCGGCTT CGGATGTTGA ACCGAGTTTC GGATGTGGAG
AGCTTTAAAT CAAGAGCAAA CACCGAGATG GCTGAAATTC TGGGTAATGC AACAACGTAT
CCAATTGTTG AAACCAACGC TCAGAATATT CAGGTCAATG TCTTCAATAT CAATACAGAC
ATTAACTCGA ACGGATTCCG GGATGGCCTG GAGTCGGCAG GCTGGTTTGG CAACACGGCT
GGTAAGGCGA TGATCGACAA CATGAATGCA AATAACGACC CTCGGTTACC GCTGCTTTTT
GAGCCAGGAG CGAACGCCGG TGGTAAGTAC ATTGGCATTG ACCCTACCGC TACGGAAGGT
GCCCAGACAA CGCTCTTCAA TGCTGGCCAG GTGGCGATCT ATAACCGCTA TACGTTGAGC
CGGAATCAGT TCTTTCCTGG TATTCTGATC AATGCCCCGC AAATGAATCT GATTAAAGCC
GAGTATTACC TGAGAACAGG CAATGATGCC TCGGCAAAAA CCGCCTACGA AACCGCTATT
ACGCAGTCTG TCCAATTCTA TAACGGTATT TTGGCACTGA CCAACGCCAC GGGTATTACG
GATTCTCCGA TACCTACAGC AGCAACGGCA GCTTCCATCA GCTCTTACAT CGCTGCTCCG
GCCGTAAGCT GGAGTAGTGC GGCAACCAAC ACCGACAAGC TGAAGCTGAT TGCTACCCAG
AAATGGCTGC ACTACAACTT GGTTCAGCCG TACGAAAACT GGGCGGATAT CCGTCGACTG
GACTACCCGG TACTGAGCTT CCAGGTTGAT AACGCAAATA ACCAGACATT GCCCCCTGTT
CGGTGGACAA TTCCGGGTAA TGAAATCACC TACAATGCCG CCAATTATGC TGCAGTGAAA
GCAACGGATA ACCTGAAAAC CAAGATTTTC TGGGATGTGA AGTAA
 
Protein sequence
MKMKKISFLL LTGLLVGNIS CKEAEFTDAY PNPAKIAETT VEKQFTGVLY ANKDYVLPGY 
RHYFVTLRTS LNHYNQATGW INSSGQYIPG SSGVEDVWYN YYNMLAQYRE LQKVYASRPA
DQQADRKIFM LAAAVYVYDY TQKMVDVLGA IPFSAAGMLS SNGGDYLASS AKFDTAEGIY
TFLLDDLKRI STELNGITLN AGYQKSFQTQ DFVNKGDVTA WKRYCNSLRL RMLNRVSDVE
SFKSRANTEM AEILGNATTY PIVETNAQNI QVNVFNINTD INSNGFRDGL ESAGWFGNTA
GKAMIDNMNA NNDPRLPLLF EPGANAGGKY IGIDPTATEG AQTTLFNAGQ VAIYNRYTLS
RNQFFPGILI NAPQMNLIKA EYYLRTGNDA SAKTAYETAI TQSVQFYNGI LALTNATGIT
DSPIPTAATA ASISSYIAAP AVSWSSAATN TDKLKLIATQ KWLHYNLVQP YENWADIRRL
DYPVLSFQVD NANNQTLPPV RWTIPGNEIT YNAANYAAVK ATDNLKTKIF WDVK