Gene Slin_4355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4355 
Symbol 
ID8728115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5282050 
End bp5284353 
Gene Length2304 bp 
Protein Length767 aa 
Translation table11 
GC content53% 
IMG OID 
Productcapsular exopolysaccharide family 
Protein accessionYP_003389135 
Protein GI284039205 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.672208 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.663682 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATGA ATGCAAAACC CAATCAATCG TATGTTTCCT ATCAGGTAGT CGACCCTGGC 
GGCTCCCCGA TCCGAAGCCA TCTGGCGCCT TACCTCCGCT ACTGGCCCTG GTTTCTTTTA
TCGCTGGTGC TCTCTCTGGT TGGTGCTTAT ATCTTCCTGC TCTACAAACA GCCGGTTTAC
CGCATTCAGG CGAGCCTTAT GCTCCAGGAT GAGAAAAAAG GGAGCGGCCA AACGAACCCG
CTGAAAGAGC TGGAAGTGTA CTCGCCCAAG AAAGTAGTTG AAAATGAACT GGAAGTACTG
CGCTCCTCTA CCCTGATGGA ACGGGTTGTC ACCAACCTCC ACCTCGATAC CCGCTACTAC
CGGAAAACGT CTTTTGGCAA GCGGGAAATC TACAACGAAT CGCCGGTTTG GGTACTTGTT
GAAGATGGCG ATGAAGCTTT GTACAAAAAA CCACTTACTC TATCGTTTAT CAATAACACT
TCCGTTCAGA TAAATGACAA GACCTATCCA CTCAACCGAC GGATTGGAAC CCCTTACGGC
CAGCTGCGTA TCCTGACCCG CAAGCCTATC AGCCCCAAAA CCGAACCCCT GATTGTACAG
GCTATGCCCA CCGCAGCCGC CGTTGGCATG TACCTGGACA ATCTGAAGGC CGAACCGACC
AGCAAGACAT CGACCGTTAT TCGACTGACG CTCGAAGATG CCGTTCCTAA AAAAGGCGAA
GCGGTGCTGA ACAGCCTTAT CAACGAATAC AACCAGGCTT CGATCATTGA CAAAAATAAG
GTGGCCGACA ACACGCTCAA ATTTGTGCAG AACCGCCTGG AGATTGTGTC GGGAGAGTTG
GCAGCGGTGG AGAAAAACGT CGAAAATTAT AAATCGACGC TGGGCATTAC CGACCTGAGC
GCGCAGGCGC AGTCGATCAT GCAAACGACC ACCCAGAACG ATGCACAGCT CAACCAGGTA
AACATTCAGT TGGCAGCCTT GCAAGACCTG GATAAGTTTG TTACTACGCA GGCCGACAAA
CGGGGCAGCA CGCCTGCTAC GGTTGGCCTG GGTGACCCCG TCCTGCTCAA CCAGATCGAT
AAACTGTCGC AGCTTGAATT GCAGCGGGAT GAGTTACGGC AGACCAGTGG CGAACAGAAC
CCAAAGCTTC AATCGCTCGA TCAGCAGATA AAAAATACGC AGAACAACAT TGCGCAGAAC
ATCCGCACGA TGAAGTCGAT GCTCAACCGG TCGAAAGAAC AGTATGTAGC AACCAACGCC
CGGCTGGAGG GCGTTATCCG GACGGTTCCC CAAAAGGAAC GGACGCTGCT CGACATTACC
CGTCAGCAGA CCATCAAGAA CAACCTGTAC ACTTACCTGC TTCAGAAACG CGAAGAAATG
GCCGTTACGT TTGCCGCTGC CATTGCCGAC AGCCGCACTA TCGATGCCGC CAAAAGCAGT
TTGGGGCCGG TTAAGCCGGT GGGAGTCGTC ATCTATGCTT TATTCGCGCT GGTGGGGCTG
CTGATTCCAA CGGCTGCCGT AGCGGGTAAA GGGGCGCTGA ACACCAAAGT ACTACGCCGA
AACGATGTAG AGGATGTGAC GCAGGTACCC ATTCTGGGCG AAATCATGAG CAAGCGTAAC
CGCGATGTGC TGATTGTAGC GCCCAACAAC CGCTCGGTCA TTGCCGAACA GATTCGCACC
ATTCGAACCA ATCTCCACAT CGGCAAAAAC GAATCGACCG ATAGTCAGGT TCTGCTCTTT
ACATCCAGCA TCAGTGGCGA AGGCAAGTCG TTCATATCGC TGAATCTGGG TGCCAGCATG
GCGTTGCTGA AACAGCCTAC GGTCATTCTG GAAATGGACA TGCGCCTGCC CCGCCTGCAT
CAGCATTTCG ACATCGACAA CTCGGTTGGC ATCAGCAACT ACCTCAACGG CGAAGCTACC
CTGGCCGACA TTCTGAAACC GGTGCCGGGT TACCCGAATT ACTTTATTGT GCCCAGCGGC
CCCCTGCCGC CCGATCCGTC CGAGCTGCTA AGCGGCCCGA ACCTGAAGCA GCTGATTGGC
GAACTGCGTG AGCGGTTCCG CTACGTCATC ATCGACGCGC CACCCATTGG TATCGTTACA
GATGCGCAGG CAATTGCCCC CTATTCCGAC GCTACCCTGT TTGTTGTCCG CCATGGTGTA
ACGCCCAAGG AAAGCCTCAA GATTCTGGAC ATGCTCCATC GCGAACACCG TTTTCAGAAC
ATGAGTATCA TTCTCAATGC CGTAGGTAGT GGAGATGGTT ACCATTTCAA CAACCGGTAC
AAGAATAGTT ACTCATACCG CTAA
 
Protein sequence
MSMNAKPNQS YVSYQVVDPG GSPIRSHLAP YLRYWPWFLL SLVLSLVGAY IFLLYKQPVY 
RIQASLMLQD EKKGSGQTNP LKELEVYSPK KVVENELEVL RSSTLMERVV TNLHLDTRYY
RKTSFGKREI YNESPVWVLV EDGDEALYKK PLTLSFINNT SVQINDKTYP LNRRIGTPYG
QLRILTRKPI SPKTEPLIVQ AMPTAAAVGM YLDNLKAEPT SKTSTVIRLT LEDAVPKKGE
AVLNSLINEY NQASIIDKNK VADNTLKFVQ NRLEIVSGEL AAVEKNVENY KSTLGITDLS
AQAQSIMQTT TQNDAQLNQV NIQLAALQDL DKFVTTQADK RGSTPATVGL GDPVLLNQID
KLSQLELQRD ELRQTSGEQN PKLQSLDQQI KNTQNNIAQN IRTMKSMLNR SKEQYVATNA
RLEGVIRTVP QKERTLLDIT RQQTIKNNLY TYLLQKREEM AVTFAAAIAD SRTIDAAKSS
LGPVKPVGVV IYALFALVGL LIPTAAVAGK GALNTKVLRR NDVEDVTQVP ILGEIMSKRN
RDVLIVAPNN RSVIAEQIRT IRTNLHIGKN ESTDSQVLLF TSSISGEGKS FISLNLGASM
ALLKQPTVIL EMDMRLPRLH QHFDIDNSVG ISNYLNGEAT LADILKPVPG YPNYFIVPSG
PLPPDPSELL SGPNLKQLIG ELRERFRYVI IDAPPIGIVT DAQAIAPYSD ATLFVVRHGV
TPKESLKILD MLHREHRFQN MSIILNAVGS GDGYHFNNRY KNSYSYR