Gene Slin_3089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3089 
Symbol 
ID8726842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3747015 
End bp3748577 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content42% 
IMG OID 
Productrestriction modification system DNA specificity domain protein 
Protein accessionYP_003387899 
Protein GI284037969 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.497517 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.376247 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATTG CAGAGTTTTA TAAAAAAGTC GACACATCAG TCGCAGTTGC TAAGATTGTA 
TTCAACTACC TATCAGATCA TAACATCGTT AAGCGCATAG GTAGTATAGC TGAAACAAGC
AGTGGAGGTA CACCTACAAG GGGGAATCCT GAATTTTATA ATGGCACTAT TCCTTGGCTA
AAGTCGGGAG AATTAAATGA CGGATTAATA ACGGAGTGCG AGGAGTATAT AACTGAAAAA
GGATTAAAGA ACTCGTCAGC AAAACTTTTC CCCGAAGGCA CTCTTTTAGT AGCCATGTAT
GGCGCAACGG CAGGTAAGGT AGGTATTTTA AGTTTTGATG CTTCAACTAA CCAAGCAGTA
TGCGCCGTAT TTCCGAAGGC TGATATTGAA CGAGATTTTC TATTCTGGTA TTTTCGTCAA
CAGCGTTTTG ACTTTATTGA AATAAGTAAA GGTGGAGCAC AGCCAAATAT TAGTCAGACT
GTAATCAACA ATGCTGTAAT ACCAATTCCC GAAGTTGCAG TTCAAAAACA AGTTGTTAAA
TTTCTTAATA TACTGGAAAC CGAACAACGC ATTGATAATA ATTTAGTACT GAATGAGGAA
GTTGCGCAAC AAATCGCTCG CTACTTTAAA ATTAGAACCG AAGCCGCAGA GGTTGAAGAC
ATATATATAG AGCAAAAAAA ACTCCTTACT CAATTACGTC AGTCCATTTT GCAGGAGGCC
GTTCAAGGGA AACTGACAAA GAAGTTTAGG GAGACAGAAA AATTAGCACA ACAGGATCAT
GTTCGAGTCC TGGGTTCGAA TCCCAGCCGG ACCGCAACAC CTCAACTCGA AACCGGCGCT
GATTTACTAG CCCGTATCCG CGCTGAAAAA GCCGAACTCA TCCGGCAAGG AAAACTACGC
AAAGAGAAAC CCCTGCCCCC CATTACTGAT GCCGAAAAGC CTTTTGAGTT GCCTGAGGGC
TGGGTTTGGT GCCGGTTGGG AGATGTGTGC GAGAGTTCCT TTTACGGCCC TAGATTTAGC
AATGGCGACT ATATAAAGAA TGGCATCCCA ACTATCAGAA CTACAGACAT GACTGATGAT
GGTAGAATCG TTTTAAAAAA TACCCCAATG GTTAAAGTGT CATCGTCTAA ACTGGAATTA
TACCAAGTAC TTGATGGAGA TTTACTTATA ACTCGTAGCG GCAGTATAGG CATTATGGCG
GTATTCAGAG GTAGTTACAC AGCTATACCG AGTGCTTACC TAATACGATT CAGATTTGTT
TCGAGTATAT TCCCTGAGTA CGTTTTTAGT GTATTAAAAG CGCCTTTCTG GCAAAGGCTA
ATGGGATTAA GCACAACCTC TACTGCTCAA GTTAATATCA ACGCGAGTTC AATCAACAGT
TTCCTTATTC CACTCCCATC CTTCACAGAG CAACAAGCCA TTGTCGCTCA AGTCAAGCAA
TTATTAAACC AGGTGAGCGC GTTGGAAATT GAAAATAAAC AACAACAAGT CGAGGTTAGC
CAACTGATGC AGGTAGTGTT GAGCGAGGCT TTTGCGGGGA AAGAAACGGC GCTATCAGCG
TAG
 
Protein sequence
MTIAEFYKKV DTSVAVAKIV FNYLSDHNIV KRIGSIAETS SGGTPTRGNP EFYNGTIPWL 
KSGELNDGLI TECEEYITEK GLKNSSAKLF PEGTLLVAMY GATAGKVGIL SFDASTNQAV
CAVFPKADIE RDFLFWYFRQ QRFDFIEISK GGAQPNISQT VINNAVIPIP EVAVQKQVVK
FLNILETEQR IDNNLVLNEE VAQQIARYFK IRTEAAEVED IYIEQKKLLT QLRQSILQEA
VQGKLTKKFR ETEKLAQQDH VRVLGSNPSR TATPQLETGA DLLARIRAEK AELIRQGKLR
KEKPLPPITD AEKPFELPEG WVWCRLGDVC ESSFYGPRFS NGDYIKNGIP TIRTTDMTDD
GRIVLKNTPM VKVSSSKLEL YQVLDGDLLI TRSGSIGIMA VFRGSYTAIP SAYLIRFRFV
SSIFPEYVFS VLKAPFWQRL MGLSTTSTAQ VNINASSINS FLIPLPSFTE QQAIVAQVKQ
LLNQVSALEI ENKQQQVEVS QLMQVVLSEA FAGKETALSA