Gene Slin_3915 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3915 
Symbol 
ID8727673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4694291 
End bp4697593 
Gene Length3303 bp 
Protein Length1100 aa 
Translation table11 
GC content52% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003388704 
Protein GI284038774 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.341556 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.634334 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA TCTACCTATT ATTTTTTTTG GGCAGTCTAA TTGGCAATGT TAACGGCCAA 
AGCGGAGCCC CTGACCCGGC TTTCGGCAAG AATGGGCTTC AAACGACCGC TGTGCCGATC
AATGTTCTAC AGGAGGAGGC TAGAAAAGTA CTGCCACTAC CCAGTGGAAT GTATGTTGTT
ACGCAGATCA ACAGCACGGC CGTTGTGGCC AGGTACCTCA ATAATGGCGC AATTGATACC
TCGTATGGTC AGAATGGCTA CTCGGCTCAG GTAGACGGTG TGGCAGTTGA TGCCGCCCTA
CAGAGCGATG GCAAAGTTGT TATAATAACG GGCTCGTCGG TAGTGCGTTT CACTACCAAC
GGCTTGCTGG ACAATACCTT TAGCGACGAT GGCATCCAAA CCGTTAATTC GTCTTCTATA
TCGATACGGT CCGTTGGTGT GCAGGATGGC AAAATCGTTG TGGCTGGTAG CCGTACGGTA
TTAGCTGAGG CCGATATTGC GGTACTTCGC TATAAGGCTG ATGGATCGCT GGATACCAGT
TTTAGTGATG ATGGCCTGCA AACGACCAAT TTTGGCCAGA ATACCCAGGA TAACGGCCTG
GCACTTGCCA TACAGGGCGA TAAAATCGTG GTTGCGGGCA CCAGCAGTGG GTACCTGGCT
ATTGTCCGCT ACAATTCGGA TGGTTCGCTG GATACCAGTT TCAGTCAGGA TGGTAAGGTG
ACGTACGATA ATGTGGCGTT TTCACCAGCC CTGTCGGTAG CCATTCAGGG AAACAAAATC
CTGGTACCCG GTGGCCTGAA CAACGATTTC GCGGTGGTTC GTTATAATTC GGACGGCACG
CTGGATACAA CGTTTGGTAA ATCTGGTGTG CTGACGACAG ATGTTGGCGA TCAGTTCAAC
GAGTCGGCGC GGACAGTTAC GATTCAGGGC GACAAGTTTG TGCTGACGGG CAATACCGCC
TACGATCTGG CGGCTGTACG GTACAACAGC GATGGCTCGC TGGATACTGG TTTTGGAGCA
AATGGTAAAG TGATCACCCA TTTCGATGGT GTTGATTTCG TCAGGGCGAC ATCGGCCGCA
CTCCAGGGCG ACAAACTTCT GATAGCCGGA TTCACCTACA CCTTTGCCCG GGCGAGCGAC
GAGGATGTGG CGCTGGTGCG GTATAACGCC AATGGCTCGC CCGACACCAG CTTCAGCGAC
GACGGTAAGC TCACGAGCTT TTTCCCATAT GCCCGTACGT TTTTCACTAG TTCCGTTGTT
CAGCCCGATG GGAAAGTCGT AGCCGCCGGA TACACGTTTA TGTACCTGCC CGGCTCCCTG
AGCGATGTCT TCGTTGTGGT TCGGTACAAT ACCGATGGGA CGCTCGATAA AACGTTTAGT
GATGACGGTG TACAAACGAC CGATTTTGGA TCAGGTTCGC AATCCGTAGC GAATGCGGTT
ATGCTTCAGG GGAACAAGAT CGTGCTGGCG GGATATTCCT ATAACTCGGA TAACAACAAT
AACACGGATT TTGCGGTGGC GCGTTACAAT GCCGATGGTA CGCCGGACAA TACCTTCAGT
GGCGATGGTA AACAATTGAC CGATTTAGGG CCTGCATCCA ATGATTTTGG GCAGCTTGTC
ATCGGTCAGG GCGATAAGAT CGTTGTGGGG GGGTATTCCT ACAACTATGA AACGGGCATG
AGCGCGACGC TGGCTCGCTA TAATGCAGAT GGCTCGCTGG ACAACTCTTT TAGTAATGAT
GGTATACAAA CAACCAGCTT TGGCGACGCA GGGATAACCA CCGGCGCTCT GGCACTTCAG
GGCGACAAAC TGCTTTTGGC GGGTACCAGC TATAATTCGC AAACGGGTAC GCTCGGATTT
CTGGTGGCCC GCTACAATGC AGATGGCTCA CTTGATACGA GTTTTGGCGA GACAGGTGTT
GTTACCACGG ACTTTGGCTC CGGCTCCAGT GTATTTGCCA AGTTCATCCG GGTTCAGAAT
GACCGGATTC TCGTTGTCGG GTATCAGAGT AACGAAACGC AGGTAGCAAC GGCCCGCTAT
AAACTGGATG GAACGCTGGA TACCAGCTTC GGTCAGAATG GAAAACGACT AAGTCCGTTT
CCGAACGGTT CGACAACCTA TTTAACGACT GTAGTGGGGC AGGACGACGG GAAATTTATC
GTGGCTGGCA ATACAACGGA CCCGGCTACG TACCGGCAAA CGTTTGCGCT GGCCCGTTAC
AACCCCGATG GTTCGTTGGA CACTGGCTTT GCCACGAGTA CAATCCCTGC CGACAACAGC
ACCACGTATT ATGTACAGGC CCTCAGCACC CGGGGTAATC GGCTTTATGC GCTTGGCTAC
AGGCAATCGT CCTTTTTCAA TGGCTTTGGA CTTGGCCCTA TGAGTGAAGG CGTTGTGGCT
GCTTATAAAC TCGAAACGAG TGTAAACTTG TCCTGTCCGG TCAGTAAAAC GGTTGTGGCT
GATCAGGGTA TTTGTGGCGC CGTCGTAAAA GATATAGATC CGATAGCCGC TAGTTCACCC
ACCAAATACA CCTTATCGGG TGCGACAACG GGTAGCGATA TGGGTAGTGT CAGCGGCAAA
CTATTCGGTA TTGGTACTAC GGTCGTAACC TATACGCAGG CCAGTGAGCC CACCAAAACC
TGTTCGTTTA CGATAACGGT TGTTGACCGG CAGTTACCCA CCATAACGGG TCTTTCGGTT
AGCCCGAACA GATTATGGCC CCCCAATCAC AAGATGGTGG ATGTGACGCT CACATACAAT
GTGCTGGATA ACTGCAAAGC AACCTCCGTC GTGTCGGTAT CCAGTAATGA ACCCCAAACC
GGTCCCGATG ACAATACGCC CGACGACTGG CAGATTATCG ATGCCAATCA TTTGAAACTT
CGGGCCGAGC GAACCGGCAG CGGAAATGGC CGAATCTATA CCATCACCGT CACGGCTACG
GACCCATCTG GTAACAAGGC AACACAAGTG ACGCAGGTGA CCGTACCCAA AAATAACTCG
GGTCGGGCAG GAGCCGACGA GCTAAGCTTA TCGGAAGAAG GCGGTGAAGT CGACGGTCTG
GCGGTAAAGG TGATGCCTAA TCCATCGTCG GGGTATTTTA CTATACTCAC CAAAAGTGCT
ACGGCCAGCG TCCTGACCAT GCGGGTGTCT GACCTTCAGG GACGGCCAAT GCCCGGTTTA
GACAACGTAC CGGCCAACGG AACCCTGCAA TTGGGCCATA CCTATGCACC GGGTGTTTAT
ATCCTCACGG TGATTGACGG TCCGCGAAAA GTAACGGTTA AGCTGCTGAA GCTAGCAGAA
TAG
 
Protein sequence
MKKIYLLFFL GSLIGNVNGQ SGAPDPAFGK NGLQTTAVPI NVLQEEARKV LPLPSGMYVV 
TQINSTAVVA RYLNNGAIDT SYGQNGYSAQ VDGVAVDAAL QSDGKVVIIT GSSVVRFTTN
GLLDNTFSDD GIQTVNSSSI SIRSVGVQDG KIVVAGSRTV LAEADIAVLR YKADGSLDTS
FSDDGLQTTN FGQNTQDNGL ALAIQGDKIV VAGTSSGYLA IVRYNSDGSL DTSFSQDGKV
TYDNVAFSPA LSVAIQGNKI LVPGGLNNDF AVVRYNSDGT LDTTFGKSGV LTTDVGDQFN
ESARTVTIQG DKFVLTGNTA YDLAAVRYNS DGSLDTGFGA NGKVITHFDG VDFVRATSAA
LQGDKLLIAG FTYTFARASD EDVALVRYNA NGSPDTSFSD DGKLTSFFPY ARTFFTSSVV
QPDGKVVAAG YTFMYLPGSL SDVFVVVRYN TDGTLDKTFS DDGVQTTDFG SGSQSVANAV
MLQGNKIVLA GYSYNSDNNN NTDFAVARYN ADGTPDNTFS GDGKQLTDLG PASNDFGQLV
IGQGDKIVVG GYSYNYETGM SATLARYNAD GSLDNSFSND GIQTTSFGDA GITTGALALQ
GDKLLLAGTS YNSQTGTLGF LVARYNADGS LDTSFGETGV VTTDFGSGSS VFAKFIRVQN
DRILVVGYQS NETQVATARY KLDGTLDTSF GQNGKRLSPF PNGSTTYLTT VVGQDDGKFI
VAGNTTDPAT YRQTFALARY NPDGSLDTGF ATSTIPADNS TTYYVQALST RGNRLYALGY
RQSSFFNGFG LGPMSEGVVA AYKLETSVNL SCPVSKTVVA DQGICGAVVK DIDPIAASSP
TKYTLSGATT GSDMGSVSGK LFGIGTTVVT YTQASEPTKT CSFTITVVDR QLPTITGLSV
SPNRLWPPNH KMVDVTLTYN VLDNCKATSV VSVSSNEPQT GPDDNTPDDW QIIDANHLKL
RAERTGSGNG RIYTITVTAT DPSGNKATQV TQVTVPKNNS GRAGADELSL SEEGGEVDGL
AVKVMPNPSS GYFTILTKSA TASVLTMRVS DLQGRPMPGL DNVPANGTLQ LGHTYAPGVY
ILTVIDGPRK VTVKLLKLAE