Gene Slin_4089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4089 
Symbol 
ID8727848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4922326 
End bp4924605 
Gene Length2280 bp 
Protein Length759 aa 
Translation table11 
GC content47% 
IMG OID 
Productcapsular exopolysaccharide family 
Protein accessionYP_003388875 
Protein GI284038945 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.286783 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAG GAGAAGTAAG TTACGTATCA TCCGGAGAAA GCAGCTTCAG GCGATTCCTT 
TACAAATACC GGCGTTTCTG GTATCTGTTT GTAATCAGCA TTGGCGCTAG CCTGGTACTG
GCTGTCCTTT ATTTAAAATC GGCTACCCCG CAATACAATG TCAGTATAAG CCTGTTAATC
AAAGATATAG AGAAAGGGCC TGACATTCGG CCGGGTAATC CGATTTTTAA AGAGTTGGAT
ATCTTGAACT CGACAACCAG TATTGAGGAT GAGATCGAAG CGTTAAGATC GGTGACCCTG
ATGCACCGGG TTTTAACGGA ACTTGGCTTG CAGACCAGTT ATTACGTAGC CGATTCGTTC
AAGAAAAGGG AGATTTTTGG CGCCGATCTG CCTATCCGTC TATCCGTTAA AACACTCTCT
CAATCAGCTT ATACAAAGCC GATTGCCATA GTCATTAAAA ACACGCAGGA GTTTCAGCTA
CAGGACGGCC CCCCCCCTGC CAATACTTAC CGGTTCGGTC AGTTGATCCA ACGTCCTTAT
GGCACGTTTA CGGTTCAGGC TAATCCGGAA GCCATGCGGT GGCAGCCCAA AAAAATATTT
ATCTTCTTCA ATAACCTGGA GGATATGGCT GAAAGCTACA GTAAAGCGAC AGCCATTATT
CAGCTTAATA AAAAAGCAAA TGTTTTGAGT GTATACATGC AGAGTGCCGT TCCGGAGAAA
GGAAAGGTTA TTCTGAACAA GCTCATTGAG GTATACAATA AAGAGAACAA GGAAGATCGT
AATATTCTGG CTCTCAATAC GATAAAATTC ATTGAGGAAC GATTAAGAGA CTTAACCGCC
GAATTATCGG ATATAGAGAA GGCGACCGAG GAGTTCAAAC GCCGAAACCA GGTAACCGAT
GTTCGCTCCG AAGCAAATGG GTATCTCGAA GAATCCAGAA TTTATAATAA TCAGCTATCG
GCCAATAAAA TACAACTCGA TATTGCCGAA TCGCTGGAAC GATACCTGGC ACGGCAAAAG
CAAAAATATG AGTTGGTACC CAGTAACCTG ACAATCAACG ACCCAACACT ACAGGACTTT
ATTGGTAAAT TTAACGATCT GCTCCTCCAG CGGGAACGTA TGCTGCGTAC CAGCGAAACA
ACAAACCCAC TGGTCGTACA CATCGATGAG CAGCTGGCCA GCTTCAGACA GTCCATTCTT
GAAAACTTAA AGACCGTAAA ACGGGGCCTG CTCATTACGC AGGGCGACCT GACAGCCAAA
ATCAGTAACT TACAGCAGCA CATCACTCAG GTACCCGATA TTGAGCGCCA GCTCAACGCC
ATAAACCGGC AGGAAGGCGT TAAGCGAAAT CTGTATTCGT TTCTGCTGCA AAAACGGGAA
GAATCGTCGC TGTCGCTGGC AGCTACCCTT TCAAATACCC GCGTCATTGA TCCAGCAACG
GCCTCTAAAA CACCCGTTTC ACCCAAAAAA CCCGTTATTT TTGCCCTGGC ATTTGTACTG
GGGCTGATTT TACCCCTTGC TTTTATTACT GTTCCTGATT TGCTGAGCAA CAAGGTTCGC
CAGCGCAGCG ATGTATCGAC CGCCGTTGCA GTGCCTATTC TGGGCGAGGT AACGCACTAC
AGAAAGAAAG GGATTTTCGT TATATCGCAG GAAACCAGGA AGCCAATAAT CGAGCAGCTT
CGCCTGATAC GAAGCAACCT ACATTTTTCA ACTGCCAACC AGCCGCACCA GGTTATTCTG
GTAACGTCCA GCGTAGCTAA AGAAGGAAAA ACCTTTTTTA GTATCAATCT GGCTCTAAGC
CTCTGCTTTC TGAATAAAAA AGTAGCCCTG CTCGATTTAA ACTTCCGGAA CCCACGCCTT
CTGACCGGTC TGCGGGTGGA GCATGAGGTT GGTCTAACGG ATTACCTGAA CGGTAGCACT
CCCTCCCTGA ACAGCCTGTT GACACCCTTT CCGGGTACAC CCAATCTGTC GGTTATCGGT
ACGGGACCTT TGCCCGCCAA TGCGCCTGAG TTTTTGCTGA ATGCAGGTAT AGGCACATTG
ATCAGCGAGC TGCGGGAACG CTTCGACTAT GTTATTATCG ACTCGGCACC CGTGGGCGAG
GTGGCCGATA CCTTTGCGCT GGCCGATCAT ATCGACACCA CCATTTTTGT TGTGCGTTTC
AACTACACCC CTATAGAACG GCTTGAAAGT ATCCGGGAAG CCCATCTGGA AAACAAATTG
AAACGTCCGC TCATCGTGCT CAACGACGCC CGGAAGGAGA ATAGTTACCG AGTAAAATAG
 
Protein sequence
MTEGEVSYVS SGESSFRRFL YKYRRFWYLF VISIGASLVL AVLYLKSATP QYNVSISLLI 
KDIEKGPDIR PGNPIFKELD ILNSTTSIED EIEALRSVTL MHRVLTELGL QTSYYVADSF
KKREIFGADL PIRLSVKTLS QSAYTKPIAI VIKNTQEFQL QDGPPPANTY RFGQLIQRPY
GTFTVQANPE AMRWQPKKIF IFFNNLEDMA ESYSKATAII QLNKKANVLS VYMQSAVPEK
GKVILNKLIE VYNKENKEDR NILALNTIKF IEERLRDLTA ELSDIEKATE EFKRRNQVTD
VRSEANGYLE ESRIYNNQLS ANKIQLDIAE SLERYLARQK QKYELVPSNL TINDPTLQDF
IGKFNDLLLQ RERMLRTSET TNPLVVHIDE QLASFRQSIL ENLKTVKRGL LITQGDLTAK
ISNLQQHITQ VPDIERQLNA INRQEGVKRN LYSFLLQKRE ESSLSLAATL SNTRVIDPAT
ASKTPVSPKK PVIFALAFVL GLILPLAFIT VPDLLSNKVR QRSDVSTAVA VPILGEVTHY
RKKGIFVISQ ETRKPIIEQL RLIRSNLHFS TANQPHQVIL VTSSVAKEGK TFFSINLALS
LCFLNKKVAL LDLNFRNPRL LTGLRVEHEV GLTDYLNGST PSLNSLLTPF PGTPNLSVIG
TGPLPANAPE FLLNAGIGTL ISELRERFDY VIIDSAPVGE VADTFALADH IDTTIFVVRF
NYTPIERLES IREAHLENKL KRPLIVLNDA RKENSYRVK