Gene Slin_4050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4050 
Symbol 
ID8727808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4867228 
End bp4868589 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content42% 
IMG OID 
Producttype III restriction protein res subunit 
Protein accessionYP_003388837 
Protein GI284038907 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0396258 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTCCG ATATTGAATT CCCGAAAAGT GAGGAGTATC GGACCGGTAC GGAAAATGAA 
CCACTTTCTT TCTACCTGGA ATCGCTGGTA GAAAGTACCC GTTTAGATTT GCTTTTGGGC
TATTTTAGTT CGTCGGCTAT TAGCGTGTTG GCGGTAGGAT TTGCCAAATT TATCAGTAAT
GGTGGACAGG TACGTTTAAT TATAAATCAT ATACTTTCTG AGCAAGATAA GACAGCCGTA
CTAAATGGGT TAACAGCAAC GGCAGATCAA TACCCCTTTT CGGTTGCTAA CTTCCAGTCG
CTTAGAACCG CATTGGATTC GTATGGACAT CACTTTTTTG AATGCATTGC CTGGCTTATT
GCCTCCAAAA GAATACAAAT CCGGGCTATC AAACCGAGGG GTAAGCGCGG TATTTCGCAT
TATAAATCCG GTATCTTTTA CGATGGGCTA AATAAGGTCA AGTTTAAAGG GTCGTGTAAT
TTTACGGCAT CGGGACTGCT TGAGAATCTT GAAGAATTAG ATATTAAATT GTCCTGGAAA
ACCGATAGTG ATAGCTTTTC TGAGTATGAG TACGAGTATA ATCAACTATT CGCGGGGCAA
ACCGATTATG CCGAAACTAT CCCATTTGAG CAAATTGAGG CTATAATCGT TCGCGATTTT
GGTGGGAAAG ACCTGGATGA GTTATTAGTT GATGAGCAGA AACTAGCCTC GCAGAAGGCA
AAGCAGATCA GAAGTCAGTT GTACCAAAAA GCGGTAGTCA AAATTTTGCA AAAGATAGAA
ACGTACCTGA CCACACCTCG TTTTCCTTAT GAAAGCGGCC CCCGTGATTA TCAAAAAGAA
GCCTATCAAA ATTGGGTCGA TAATAACTAT CAGGGCATCT TTGCTATGGC CACCGGTACG
GGAAAAACAA TTACGTCGCT GAACTGTGTC TTGAATGAGT CCCAAAAAAC AGGACAGTAT
CATACGGTTA TTTTAGTGCC AACAACAGTA TTGGTGGACC AATGGACACA CGAGGCTCGA
AAATTTAACT TTCGTGAAAT TGTTGCCGTT TCCTCCCATT CAAAAGGTTG GCAGACTGAA
TTAGGACGGA TTACAAATCA GCTAAGCTTT GGTATGACTA CATCATTTGT AGTCATTGTA
ACCTACAAGT CATTTACCAA AGCCCAATTT CAGACGTATT TTAAGCGGTT ACCAGCCTCT
ACAATATTAT TAGCCGACGA AGCCCATAAT ATAGCATCGC CTTCGGTAAG CAGACTACTT
GACGGTGTGC ATTTACTAAA GCGTATTGGT TTGTCGGCTA CGCCCAAGCG AGTTTATGAT
CCGGAAGGTT CAGCGCATAT GGAAGCCTTT CTTCATGTGT AA
 
Protein sequence
MLSDIEFPKS EEYRTGTENE PLSFYLESLV ESTRLDLLLG YFSSSAISVL AVGFAKFISN 
GGQVRLIINH ILSEQDKTAV LNGLTATADQ YPFSVANFQS LRTALDSYGH HFFECIAWLI
ASKRIQIRAI KPRGKRGISH YKSGIFYDGL NKVKFKGSCN FTASGLLENL EELDIKLSWK
TDSDSFSEYE YEYNQLFAGQ TDYAETIPFE QIEAIIVRDF GGKDLDELLV DEQKLASQKA
KQIRSQLYQK AVVKILQKIE TYLTTPRFPY ESGPRDYQKE AYQNWVDNNY QGIFAMATGT
GKTITSLNCV LNESQKTGQY HTVILVPTTV LVDQWTHEAR KFNFREIVAV SSHSKGWQTE
LGRITNQLSF GMTTSFVVIV TYKSFTKAQF QTYFKRLPAS TILLADEAHN IASPSVSRLL
DGVHLLKRIG LSATPKRVYD PEGSAHMEAF LHV