Gene Slin_5164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5164 
Symbol 
ID8728930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6305300 
End bp6307741 
Gene Length2442 bp 
Protein Length813 aa 
Translation table11 
GC content51% 
IMG OID 
ProductTonB-dependent receptor 
Protein accessionYP_003389935 
Protein GI284040005 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACC TTTTCCTGAT TGCCTTCTGG TTGATGCCCC TGCTGGCGTT TGCTCAACTA 
AACCCGGGCC GTATCAGCGG TACACTTACC GATTCGACCA CCGCTGGTGC TGTTCCGTTT
GCCACAGTAG CTCTCCAAAT CCCGGATGGT ACCCTTATAA AAGGGGTAAC AACGGATGAA
AAAGGAAGCT TTTCTATCGA ACAAATAGCT GTTGGGACGT ACAAACTCGT TATTTCTTTT
GTCGGCTATC GTACCCGAAC GATTGATAAA ATCGCGATCA CGGCCGAAAA GCCTGCTATT
GCCTTGCCCC CTATTAGTCT GGCTTCGGAT AGCCGCAGTT TGAAGCAGGT CGATGTGGTT
GGGCAAAAGG CGCTGGTGGA AGATAAAGGC GACCGGCTGG TGTACAACGC CGAGAGAGAC
GCGTCCAATA CGGGTGGAAC GGCTGCCGAT GTACTGCGGA AAGTGCCTAT GCTGACGGTG
GATCTGGACG GTAATCTAAA GATGCGCGGG AGTGGTAACA TTAAAGTGCT GGTCAACGGA
AAGCCGTCGA GTATTATGGC CCGCAATCTG GCTGATGCGC TGAAACAGAT GCCCGCCAAT
AGTATTAAGT CGGTAGAAGT GATTACCAGT CCCGGCGCCA AATACGATGC GGAAGGCTCG
GCGGGGGTTA TCAACATTAT TACGAAAAAA GCGGTACAGG GAACGAACGG AACGGTCAAT
GCCACGGCGG GTAATCTGAA CCGTTCGCTG GGTGGTAATC TGAACATCAA AGGCAAAAAG
CTGGGGTTTG CCGTTTCACT CAACGGGTAT CAATACCGCA ACATTGGTGA GAATACCAGT
ACCCGCACGT CGCTTTCGGC CGGTGTACCC ACCAGTGTTC TTCGGCAGAG TACGTATCGT
GATAATACCG GTACTGGCGG TTATGGCGAA ATGAGTCTCG ATTATGACCC CGATACAACG
AACCGGATTA ATTTTTCGGC CAATGCCTGG GGCGGTAATT TCCCCATGAA CAGTACCCTT
AACAGCCGCC TGACCGATGC ACAGGGCGGG CTGTTGCAGG CGTACCACCG CGATATTCAA
TTCCGGAATC CGTATGGCAA TACCGAATTC AATCTTGGCT GGACTAAGTC ATTCCGAAAA
CCGGGGCAGG AGTTCTCCCT GCTGACCCAA TACGCCCGGA TGCCGGACAA TTATTATTAC
ACCATTCGGC AAAACAACAT TGAATCGAAA GTGCCCACGT ACCTTGAACG AAGCACGAAT
CTGAGCCGTA ACAACGAATA CACATTTCAG GCCGATTACA CGCATCCGTT TACGGCACGT
ACCAAACGGG ATACGCTCAG CTTCAAGTTA GAAGCGGGAG CTAAAGCCAT TCAGCGCAGT
ATCGGCAGTG AGTTTGTCGT TGAACAGGCA ACTACCGGTC TGGATGCTGA CTATATGATC
GATCCGAGTC GCTCCAACGA TTTTATGTAC GATCAGGGGG TAGTATCCGG GTACACCTCG
CTGAAAGTCG ATTCGAAGCG TAAATGGAAT CTAACGGCTG GGGCCCGGCT TGAGCATACG
ACCATATCGG GGGATTTTAT CTCGACGAAG ACCGCGTTCA ACAGCCAGTA TCAGAATCTG
ATTCCGAGTT TTACGCTGGC AAAAACCCTT CGGGACAAGC ACACGTTTAA AGTAAGCTAC
ACGCAGCGGA TTTCCCGACC GCTGATCTGG TACCTGAATC CGTTCCGTAA CTCCAGTGAC
CCTAATAACA TTCAGACCGG CAACCCGTTC CTAAACCCTG AACTGACTCA TGCTACCGAG
TTGTCGTACA GCACCTTCGG GAAGGAAGGC TCGTCGTTCA ATGCGGCCCT GTTCTGGCGG
CAAACCAACA ATGCCATCGA GTGGCTTTCG ACGGTCAATG TACAGGGTGT TGCGCTGACG
ACACCTCAAA ATATTGGCCG CAATGCTAGC TACGGAGCCA ATATGAACCT AACGCTTCAG
CCTACTAAAC AGCTTAATGC TACGATTAGC ACCGATCTCA CCTATGTAGA CCTGACCAGC
CTGGCTCTCA ACCAGCGAAC AAACGGCTGG GTCTGGAGCG TAAGTCCTAA TGTGTCGTAC
AAACTACCGA AAGACCTTAC GATTCAGGCA AACGGCTACG TTGGGTCTGG TTGGATATCG
CTGCAAAGTC GGAATTCGGG CTGGTATTAT TACGGCCTGT CGGCCAAGAA GGAGTTGATG
GACAAGAAAA TCACTCTTAC CCTGAATGTC AATAACCCGT TCAACCGGAG CGTTCGGATA
ATCGGCGATC AGTTTGCGCC CAGTTTCACG GCTCAGAACA CGTCTATGTT CGTCAATCGC
TCGGTGCGGC TAACGCTGAG CTACAAGTTT GGGCAAATGA GCTCGGGTGG TAAGCAGAGC
AAAAAGATCC GCAACGACGA CAGTAAAGGC GGGGGGCAAT AA
 
Protein sequence
MKNLFLIAFW LMPLLAFAQL NPGRISGTLT DSTTAGAVPF ATVALQIPDG TLIKGVTTDE 
KGSFSIEQIA VGTYKLVISF VGYRTRTIDK IAITAEKPAI ALPPISLASD SRSLKQVDVV
GQKALVEDKG DRLVYNAERD ASNTGGTAAD VLRKVPMLTV DLDGNLKMRG SGNIKVLVNG
KPSSIMARNL ADALKQMPAN SIKSVEVITS PGAKYDAEGS AGVINIITKK AVQGTNGTVN
ATAGNLNRSL GGNLNIKGKK LGFAVSLNGY QYRNIGENTS TRTSLSAGVP TSVLRQSTYR
DNTGTGGYGE MSLDYDPDTT NRINFSANAW GGNFPMNSTL NSRLTDAQGG LLQAYHRDIQ
FRNPYGNTEF NLGWTKSFRK PGQEFSLLTQ YARMPDNYYY TIRQNNIESK VPTYLERSTN
LSRNNEYTFQ ADYTHPFTAR TKRDTLSFKL EAGAKAIQRS IGSEFVVEQA TTGLDADYMI
DPSRSNDFMY DQGVVSGYTS LKVDSKRKWN LTAGARLEHT TISGDFISTK TAFNSQYQNL
IPSFTLAKTL RDKHTFKVSY TQRISRPLIW YLNPFRNSSD PNNIQTGNPF LNPELTHATE
LSYSTFGKEG SSFNAALFWR QTNNAIEWLS TVNVQGVALT TPQNIGRNAS YGANMNLTLQ
PTKQLNATIS TDLTYVDLTS LALNQRTNGW VWSVSPNVSY KLPKDLTIQA NGYVGSGWIS
LQSRNSGWYY YGLSAKKELM DKKITLTLNV NNPFNRSVRI IGDQFAPSFT AQNTSMFVNR
SVRLTLSYKF GQMSSGGKQS KKIRNDDSKG GGQ