Gene Slin_2066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2066 
Symbol 
ID8725804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2495667 
End bp2496881 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content53% 
IMG OID 
Productprotein of unknown function DUF418 
Protein accessionYP_003386906 
Protein GI284036976 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0837783 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATGC ACGCTCCACA AGTCCACCCC ACCACGGAGC GCAGCGCACT GGTTGATGCG 
CTTCGTGGGC TGGCCCTGCT GAGTATTGCC CTGGCCAACG TGCCCACGGG CGATGCGCTT
AAAAGCACCC ATTACATCTT CCTAAATAGC CAGGCCATCA ACCCGATTCT GGAAGCGGCT
AAGCACATCC TGATCAGCAC GAAGTTCATT ACCCTCTTTT CCATTCTGTT TGGCTACGGT
TTCTATACCC AGCTAAACCG GGCCACGCAG TGGGGTACTT CTTTCAGGCG CTACTTTACG
ATGCGTATGC TACTGCTGCT GATCATTGGG TGTCTGCATG CTTACCTGCT CTGGTTCGGC
GACATTATCC GGTACTACGC CCTCTGCGGC ATGGCGCTGC TCGTCTTTCA TCAGCTTTCC
ACCCGAAAAC TGCTCATTAC GGCCCTGGTC TTCATGGTTC CGGTCACGGC CATCCTGTTC
ATTCTGAACG GCCTGCTGGA ACTACAACGC TACAGCTACG ATTACACCAT TCCCGACCGG
ATCATCTATG AGACATCTTA CTTAAACTAC CTGCGCGACA ACTTCACCAT TGATCCAATG
GTCAATTTTG TCCAGGATTC GCCCATCACG TTTGCGGCCT GTTTCGGAAA AATCCTGTTT
GGTTACTGGC TGGGTCGAAT TAGCTTTTTT CAGCAACCCC AACGGTTCGG GCGCATGCTG
AAGAAGTGGT TCTGGTGGGG ACTTTCGGTG GGAACTTTTG CCAGCGTGGG CTACTGGGCA
GTTAGTACGG GGCGGCTAAC GTTAGACCTT CCACTACTGT GGTTGCCTTT TGTCATTGCG
GGCGGGCTGG TGCTCCACAG CCTGTTCTAT ATCGCAGCCT TTGTGAGGGT ATTTCAAACC
CAGCGAGGTA AGCGGGTTTT GCTGATTTTC GCTCCCCTCG GAAAAATGGC CCTGACCAAC
TACCTGCTTC AGACGGTCTT TTATCTGCTC TTTTTTTACG CCTGGCCCCA CGCCTGGCCA
ACAAGCCAAC GAATCAGTCT GGCCGAGGTG TATCTGCTCA CGTTGCTCTT TTATGGATTG
CAGGTACTCT TCAGCCACTG GTGGCTACGG TATTTCAGTC AGGGGCCGGT GGAATTCCTT
TGGAAAAAGA TGGCTTATCG GCAGCTTGGG CCGGGTGACC GCCCGGCATC GCAAATCTCT
TCGATTCCGT CCTGA
 
Protein sequence
MTMHAPQVHP TTERSALVDA LRGLALLSIA LANVPTGDAL KSTHYIFLNS QAINPILEAA 
KHILISTKFI TLFSILFGYG FYTQLNRATQ WGTSFRRYFT MRMLLLLIIG CLHAYLLWFG
DIIRYYALCG MALLVFHQLS TRKLLITALV FMVPVTAILF ILNGLLELQR YSYDYTIPDR
IIYETSYLNY LRDNFTIDPM VNFVQDSPIT FAACFGKILF GYWLGRISFF QQPQRFGRML
KKWFWWGLSV GTFASVGYWA VSTGRLTLDL PLLWLPFVIA GGLVLHSLFY IAAFVRVFQT
QRGKRVLLIF APLGKMALTN YLLQTVFYLL FFYAWPHAWP TSQRISLAEV YLLTLLFYGL
QVLFSHWWLR YFSQGPVEFL WKKMAYRQLG PGDRPASQIS SIPS