Gene Slin_5543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5543 
Symbol 
ID8729317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6753641 
End bp6755011 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content51% 
IMG OID 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_003390308 
Protein GI284040378 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATTT TTCAACAAAT TTTATTTGTC GCGGCTTTAG CAGCCGTGGC CTGGTACATA 
ACCAAGCGAA TTCAGCTCAT TTCCCGAGCC ATTAAGCTCG GACGCGCCGA AAATCGGACC
GATCATTCCG ATGAGCGATT AAAAACAATG CTTCTGGTTG CCTTCGGTCA GAAGAAGATG
TTCACCAATC CACTGGTTGG TGTCATGCAC TTTATCATTT ATGCCGGGTT TATTATTATC
AACCTCGAAA TTCTGGAAAT CATTCTGGAT GGTATTCTGG GTACGCACCG GCTATTTGCG
CCTTACATTA CGCCCGTTTA TCCCTTTCTG ATCAACATAT TTGAGATACT GGCTTTTGGG
GTGCTGGCCG TTTGCGTGGT GTTCCTGTGC CGTCGGTTTG TGGCGAAAGT AAGCCGGTTT
CAGCCGGAGC GCCACCGCGA GATGGCTCGC TGGCCCCAGG CTGATGCGGC TATCATTCTG
ACCGCCGAAA TTCTGCTCAT GATCGCGTTC CTGACCTGGA ATGCATCTGA TAGCGTTTTA
CGCGATAGGG GAGTTGGCCA TTATGGCGAG TTACAGGGCA TTGTGCCGGA CTTTATCATC
AGTCAGTACC TGAAGCCGCT GTTCGCAAAC TTCAGTGACA CCGCGCTGGT AGCCTATGAG
CGGATTTCCT GGTGGTTTCA TATTCTGGGT ATTCTGGCCT TCGCCGTGTA TGTGACTTAC
TCTAAGCATC TGCACATTGC ACTTGGCTTT CCGAACGTCT ACTTCTCGGA CCTGCAACCT
AAAGGCGAGA TGCAGAACAT GCCCGAAATC ACCAAAGAAG TTCAACTCGC ATTGGGCCTG
CCTGTTACAA CTGAAATGGA CGGTTCACAA ACGAATGACA ACGGAGAGCA GCCAGCCGAA
ATCGGCCGGT TTGGCGCTAA AGATGTGCAG GATTTGAAAT GGATCAACCT GATGAACGCT
TACAGCTGCA CCGAGTGCGG GCGTTGTACG GCAGCTTGTC CGGCTAACAT CACGGGTAAG
AAGCTTTCGC CCCGCAAGAT TATGATGGAC ACCCGCGACC GGCTCGAAGA AATACAGCAG
GGTTGGAAAA CGAATGGCCC GGACTACCGC GACGATAAAT CCCTACTGAA TGATTACATC
ACCGCCGAAG AGCTCAACGC CTGCACTACC TGCCAGGCTT GTGTAATGGC CTGTCCGATC
AATATTAATC CGCTGGACAT TATCCTTCAG CTACGCCGGT ATCGCGTCAT GGAAGAATCG
CAGGCACCTG CCTCCTGGAA TGCGATGTTC AGCAATATCG AAAACAACAT GGCTCCCTGG
AAATTCTCAC CCAGCGACCG CTTTAACTGG GCTGACCAGG TGAATGATTA A
 
Protein sequence
MEIFQQILFV AALAAVAWYI TKRIQLISRA IKLGRAENRT DHSDERLKTM LLVAFGQKKM 
FTNPLVGVMH FIIYAGFIII NLEILEIILD GILGTHRLFA PYITPVYPFL INIFEILAFG
VLAVCVVFLC RRFVAKVSRF QPERHREMAR WPQADAAIIL TAEILLMIAF LTWNASDSVL
RDRGVGHYGE LQGIVPDFII SQYLKPLFAN FSDTALVAYE RISWWFHILG ILAFAVYVTY
SKHLHIALGF PNVYFSDLQP KGEMQNMPEI TKEVQLALGL PVTTEMDGSQ TNDNGEQPAE
IGRFGAKDVQ DLKWINLMNA YSCTECGRCT AACPANITGK KLSPRKIMMD TRDRLEEIQQ
GWKTNGPDYR DDKSLLNDYI TAEELNACTT CQACVMACPI NINPLDIILQ LRRYRVMEES
QAPASWNAMF SNIENNMAPW KFSPSDRFNW ADQVND