Gene Slin_4210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4210 
Symbol 
ID8727969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5072358 
End bp5075441 
Gene Length3084 bp 
Protein Length1027 aa 
Translation table11 
GC content54% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003388994 
Protein GI284039064 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAACT TTTTACTACT TCTGTTGGTA ACCTGGGGTT GTCTGATAGC GACGAACGTC 
AGTGCACAGC AGGAGCGGCA AGTGACCGGA AGGGTCACGT CGGCCGAGGA TGGCAACCCA
TTGCCAGGGG TATCTATTGT TGCCAAAGGG ACAACCCGGG GTACGCAGAC GGATGCCAAC
GGAAATTATT CGCTGAGTGT ACCGACTACG ATTAGTACGC TGGTGTACAG CTTCGTGGGC
GTGGTTACGC AGGAAGTATC TATTGGAAAC CGCTCGGTGA TTGATGTAGC ATTGAGTAAC
GATACCCGTT CGCTCGATGA AGTGATTGTG ACCGGATACG GTGCCCAGTC GAAGCGTAAC
CTGACCGGTA ATATCGCCAA AGTAGGTGCT CGCGATATTG AGAACATCCC CGTTCCCAGC
GTTGAACAAG CCCTACAGGG TAAAGTAGCG GGGGTACAGA TTACCTCGCT TAACGGTAAA
GTCGGGCAGG GGTTGCAGAT TAGGGTGCGG GGCTCCTCGT CCCTGACGGC CAGCAGCCAG
CCGCTTTATG TGATCGACGG TATCCCGCTC ACATCGTCGG ATCAGTCGTC GACAACTGCC
CCAACCAACC CCATTGCCGA TTTGAACCCC AACGACATCG AATCGCTGGA AATCCTGAAA
GATGCGTCCT CGGCCGCTAT TTACGGCTCA CGGGCGTCGA ACGGGGTTGT ACTGATCACG
ACCAAGAAAG GACGTGCTGC CAAAACAACC TTCGAAGCGT CGGCACAAAT GGGAGCCAGT
AACCCAACGC ACCTGCGTCA GTGGTTAAAT ACACAGCAGT ATGTAGAGCT GTTGCAGGAA
GCGCGTGCCA ATACGGGTGC TACTTCGGCT ACCTCGCTGG CCAACCGGTT TACGCGCTAC
GCGGCCGGTG ATCCCGCAGG CTGGCAGGGC GAAAATCCAA AATACAATAC CGACTGGCAG
CAGGAAGCCT TCCAGAAAGC GCCTTCGCAG CAGTATGACC TGAGTGCACG CGGTGGCGAT
GCCAAAACCC GGTTCTTTAT TTCGGGTCAG TACTTCGATC AGAGCGGGAT TATCATCAAA
AACCGGTTTC GTCGCCTGAG TGGCCGGGTT AACCTCGACC ATACCGCCAC GGATAAACTG
ACGCTAGGGG TGAACTTCAA CCTGTCGCAC TCCATTAACG ACCGGGTTTC GAACGACAAT
GCTTTCTCGA CACCGATGCA GATTGTGGCC CTGTCGCCCA TGACGCCGGT TGTCGACCCG
CGTACCGGTC GGGTGAGTGG TGCTGATCCT GACCTTTACC CAAGCTTCCC GCTGTACTAC
AACCCGCTGA TCAACCGCGA TTTTGCCAAC CTACAGGCGC GGGTGTACCG GATGATCGGG
AATGTATATG CCGATTATAA ACTACTGCCC GGCCTGTCGT TCCGTACCGA GTTCGGTACC
GACCTCCTGA GCCAGCAGGA AGAAGAATAT TACGGTCGCG AAACGCGGGG CAATACGGGC
GCACCGAATG GTCTGGGGTC AAATACATTC CGACAGGTGG CTAACTATAC CACCAACAAC
TACTTCTCCT TTGGCCGGAC GTTTGCCGAA AAACATGATG TAGACGCTAC GGTGGGTATG
TCGTACCAGG AGTCGCGCAA CAATTACAGC TCCGTAACGG GTCAGCAATT TCCCAGCAGC
GCGTACAAAC AAATCACATC GGCCGCCCGA ATTACGGCGG GCGACGGGCA GGAAACCAGC
TTTAGCTTCC TTTCCTATTT TGCCCGGGTC AACTATCGAT TCAATAACCG CTATTTACTG
GGGGTATCGG GCCGGATTGA TGGATCGTCG CGTTTCGGAA CGAATAACCG ATACGGTTTC
TTCCCGGCGG CATCGGCAGG CTGGATTCTC ACCGAAGAGT CATTCCTGAA AGATGTGAAA
GTTCTGAGCC TGCTCAAACT GCGGGCCAGT TATGGTCTGA CCGGTAATGC CGAAATCGGT
AATTTCAGTT CATTGGCTCT GTATGCGGCC ACGAACAATG CGGGTACATC GGCAGGGTAC
GCTGGTGTGC CGGGACAGGC CCCATCGCAA ATACCAAACC CCGACCTGAC CTGGGAGAAA
ACCTTACAGA CTGATATTGG TCTGGAGTTC GGTTTCCTCA ACAATCGCTT CACCGGCGAA
ATCGACTATT ACCAGAAGAA CACGAGTGGT TTGCTGCTCA ACGTAAATGT ACCGGGCTCA
TCGGGCTTCC GGACGCAGTT GCGCAACGTA GGTAAGCTGG AGAACAAGGG GGTGGAGTTT
GTCTTTAACT CCAACAACTT CAACGGCCCT TTCAAGTGGA CAACCTCGCT GAATCTTTCT
TATAACAAGA ACGTAATTAC CGATTTGGGT GGCACAACCA TTACGGGTAG CTTCCTCAAC
CGGGCGCAGG AAGGACAGCC CCTGGGCGTG TTCGTCGGGC CTGAATATGC CGGTGTCGAT
GTACAGAATG GCGACGCACT CTATTACCGG AATTCAACCA ATGCCGATGG CTCCATCGAC
CGGTCGACGA CGAACAACTA CAACGATGCG GCTTATGTGC CCCTGGGAAA TCCAGCGCCT
AAATTCATCG GAGGTATAAC CAACACGTTC AGCTATGGAG GCATCGACCT GAGCGTGTTG
TTCAACGGGC AGTTCGGTAA CTACATCTAT AATGGTGGCG GTAAATTCCA GTCTGCCAAT
GGCGATTACT TCGATAACCA GTCCATTGAT CAGCTTAACC GCTGGAAAAA ACCGGGCGAT
ATTACCAACG TGCCGCAGGC TCGTTTGCTG GGTGGTAATG GCACCGGCGA ATCGTCGCGG
TACCTGCAAA AAGGTGATTA CGTTCGGTTG CGGACCATTA CCCTGGGCTA CACACTACCC
AAAGAACTGC TGACGCGTAT TCACCTGAGC CGGGTTCGGA TTTTTGCTAC GGGACAGAAC
CTGCTGACAT TCACGAAATA CACGGGCTGG GACCCGGAAG TTAACTCGGA CGCCTACACT
GGCAATCCGG TTAACCTCGG CATCGATTTC TACTCGGCTC CGCAGCCGCG CACAATCATC
GGTGGTTTAC AAATTGGCTT CTAA
 
Protein sequence
MRNFLLLLLV TWGCLIATNV SAQQERQVTG RVTSAEDGNP LPGVSIVAKG TTRGTQTDAN 
GNYSLSVPTT ISTLVYSFVG VVTQEVSIGN RSVIDVALSN DTRSLDEVIV TGYGAQSKRN
LTGNIAKVGA RDIENIPVPS VEQALQGKVA GVQITSLNGK VGQGLQIRVR GSSSLTASSQ
PLYVIDGIPL TSSDQSSTTA PTNPIADLNP NDIESLEILK DASSAAIYGS RASNGVVLIT
TKKGRAAKTT FEASAQMGAS NPTHLRQWLN TQQYVELLQE ARANTGATSA TSLANRFTRY
AAGDPAGWQG ENPKYNTDWQ QEAFQKAPSQ QYDLSARGGD AKTRFFISGQ YFDQSGIIIK
NRFRRLSGRV NLDHTATDKL TLGVNFNLSH SINDRVSNDN AFSTPMQIVA LSPMTPVVDP
RTGRVSGADP DLYPSFPLYY NPLINRDFAN LQARVYRMIG NVYADYKLLP GLSFRTEFGT
DLLSQQEEEY YGRETRGNTG APNGLGSNTF RQVANYTTNN YFSFGRTFAE KHDVDATVGM
SYQESRNNYS SVTGQQFPSS AYKQITSAAR ITAGDGQETS FSFLSYFARV NYRFNNRYLL
GVSGRIDGSS RFGTNNRYGF FPAASAGWIL TEESFLKDVK VLSLLKLRAS YGLTGNAEIG
NFSSLALYAA TNNAGTSAGY AGVPGQAPSQ IPNPDLTWEK TLQTDIGLEF GFLNNRFTGE
IDYYQKNTSG LLLNVNVPGS SGFRTQLRNV GKLENKGVEF VFNSNNFNGP FKWTTSLNLS
YNKNVITDLG GTTITGSFLN RAQEGQPLGV FVGPEYAGVD VQNGDALYYR NSTNADGSID
RSTTNNYNDA AYVPLGNPAP KFIGGITNTF SYGGIDLSVL FNGQFGNYIY NGGGKFQSAN
GDYFDNQSID QLNRWKKPGD ITNVPQARLL GGNGTGESSR YLQKGDYVRL RTITLGYTLP
KELLTRIHLS RVRIFATGQN LLTFTKYTGW DPEVNSDAYT GNPVNLGIDF YSAPQPRTII
GGLQIGF