Gene Slin_4319 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4319 
Symbol 
ID8728079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5229225 
End bp5232440 
Gene Length3216 bp 
Protein Length1071 aa 
Translation table11 
GC content55% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003389100 
Protein GI284039170 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0290506 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.156669 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGAAT ATATACGTAG CGGTCTTCGA CTGACCGTTT GGCTGACGTT CCTGGCGAAT 
GCAGCAGCGC CCATACAGGC ACAGCAATTG GCTGCTGCCC TGACCAGGCA AAAATCGTCG
GCAGCACTGG CCGCCAGCAC GAGCGCGTCG GTTCAGCTAG TAACAGGCCG CGTAACCGAC
GAAACCGGCA CGGGCTTGCC GGGTGCCAAC GTGACCGTAA AAGGCACTAC CACCGGAACC
GCCACTGACG AGAAAGGGCA GTACCGAATC AGTGTTCCAA CGCCCAATGC GGTGCTGGTC
TTTAGTTCGG TTGGCTACCT GAAGCAGGAG GTCAGCGTAG GAAACCGGAC AACGGTAGAT
ATTCAGATGC GCGTTGACAA CCAGAGTTTG AGCGAGGTTG TCGTGATCGG GTACGGCGAG
CAGTCGCGGA AAACGCTCTC CACGGCCATT GCCAAAGTAG AGGGCAAAAA TATTGGTATA
CAGCCCGTCA GTACACCCGG TGAGGCTCTG GCCGGTCTGG CGGCTGGTGT GCAGGTGCAG
TCTGACCGGG GGAGTACGCC GGGCGCACCA CCCACTATCC GTATTCGGGG AGTTGGTTCG
CTGAGTACGG GCAGCACACC GCTGTATGTC GTTGACGGCT ATCCGCTACA GGACCCCGCC
CAGTTTGCGC TCATTAATCC AACGGACATC GAGTCGATGG AAATTCTGAA AGATGCGGCT
TCGGCAGCTA TTTACGGGTC ACGGGCGGCC AACGGGGTTG TTATTGTAAC GACCAAGCGA
GGCAAAGCGG GCAAAACCAG TTTGAATGTA TCCATCTACA CAGGCATTCA GCAGCTGGCC
AAGAAAGTAC AGCTCCTGAA TCGGGATCAG TACATAGAGA ACGCCATTTA TGCCTCCAGA
CTCAAGAATA TACCTTACCC AAAAGTATTT GATACCAAAC CCGACAGTTT GCCCGATACT
GACTGGCAGG ATGCTATTTT CCGGCAGGCG GCCATCAGTA ACTACCAGAT TTCGGCCACG
GGCGGCACCG ATAAGGTTCG TTTTGCTGTC TCGGGCGGGT ATTTCAAGCA GGACGGTATC
CTGAAAGGTT CAGCCTACGA ACGGTATAAC CTGCGCTTTA ACCTGGATGC CGACTTGAGT
CCTAAACTCA AATTAGGGGT GTCGATGGCG CCTTCCTACA GCAGTCAGTT TCAGCAGCAG
GCGGCCGGGC AGTTCAACGG ATCGAACGGT ACCGAAACCA GCGGCACCCG GTCGTTACCC
AGTGCCATTA TTTCGGCCAT CGACATGCCG CCAACCATTC CGGTGTATAC GCCCAATGGC
GATTACGCGC AGACCTTCAA CGGCAACACG AACCCCAATG GTACCAATTT TTACCAGACC
AACCTCTATA ACCCGCTGGC CGTTCTGGAG CTTAGCCGCA ACAACCTGAA AGGCTATCGG
CTGTTCGGCA ATGGCTTTCT GGAATGGCAA CCGATTGCCA ACCTGCGGCT GAAAACAACG
CTGGGGTCAA CGCTGAGTAT TTTCGATCAG TCGGCCTATA TCCCGGCCAA TCTGGCCAAC
GAATCGGCTC CCCGCGCCAA CTCCACGAAC CCGGTGCTGG GTCAGATATT CGCCCGCGAG
TCGCAGACGG TGACGCTGGA CTGGCTCTGG GAAAATACCG CTACCTACAA CAAGACGTTT
GGTAACCACA ACTTCTCGCT GCTGGCCCTG TACTCGCTCC AGAAATTACA GGCTAAAAAC
ACGGCTACGT CGGGCCGGTC GGGTAGCTAC ACGACCAGTC TGCTGGATAA CCCGCTGGCC
TCGCCCGACC GGATTGGTGA GCTGAACTAC GATCAGAACG CCTTTCTGTC ATTGGGCGGA
CGGATCACCT ACGACTTCAA AAGTAAATAC ATCTTTTCGG CCGCCATTCG CCGGGATGCG
TCGTCGCGCT TCGGGCCAAA CAACCGCTTT GCTACGTTCC CATCCATCTC GGGAGCCTGG
CGGATCAGCG AAGAGAAGTT CTGGTCGGGC CTGAAGAACA GCATCAGCGA ATTCAAAATC
AGGGCCAGCT ATGGCGAAAC GGGCAATGCC AATATTGGTA GTTTCAACTG GACAAACAGC
GTACAGGGCC GGAACTACAG CTTCAATCAG GCGCGGACCT TCGGCTATGC CCAGACCGGC
TTTGCCAACT ACGACCTGAC CTGGGAGAAA AACGTGCAGA CCGACCTGGG CCTCGAAATG
GGTTTCCTGA ACGACCGATT CACCCTGGGC CTCGACTATT ACAATCGACT GACAACAGGT
ATGCTGTTCC AGAAAGATTT GCCGGGCATT GTGGGCTACG CTACTAATTT CCGAACCAAC
ATCGGCAGCC TGCGCAACCG GGGGCTTGAA CTCTCAGCCC GGGCGAACCT CACTGTAGGT
GCTGTTCGCT GGACGATAGA CGGCAATATT TCGGGCAACC GCAGCAAGGT GATGGATCTC
GGCGGTCCTT CGTCACTGCC AACGGTAGCG GCTATTTTTG GCTGGAATAA CGTCTATCAG
GTTCGCGTGG GCGACCCGCT GGGCAATATG TATGGCTATC AGGTGGTGGG TATCTTCAAA
AATGCCGATG ACCTCAGCAA GAACGCCCAG TTCACAACGG GCGACAAAGT GGGGAACTGG
ATGATTCGGG ATCAGAATGG CGATAATAAA ATCGACGAGA ACGACCGGGT GTATGTCGGC
AAAGGCGTAC CCAGCTATAT CTGGGGGATG ACCCACAGCT TTCAGTACAA AAACTTTGAC
CTGAGCGTCA TTCTTCAGGG TGTACAGGGC GTCAATGTCA TCAATGGAAA CCTGCGGCAC
ATCTGGGCAA ACCAGGTGTT CAACACCATT CCGCTTTACT TCCGGAACCA GTTCGATCCG
GCCAACCCGA CGCAAAACAC CGACTTCCCG GCGGCTGGTG CGGGGGGTAT TCACCCCGGC
AACAACCTCA CCGACCGGTT GCTTTTCGAC GGTTCGTTTG TCCGCATTCG TAACCTCACC
TTCGGCTACT CCGTACCGAC GGTTTTCCTG AACAAGATCA AGCTACAGTC GGCGCGCATC
TACGTGACGG GGCAAAATCT GTTCACCTTC ACCAGCTATC CCTGGTACAA TCCGGAGACT
AACACCGTGC CCGATTCACC CGTGCAGATT GGCGTCGATC AGGGTACCTA CCCACTGGCA
CGTACCTACA CCATTGGCCT AAATATCGGC TTCTAA
 
Protein sequence
MREYIRSGLR LTVWLTFLAN AAAPIQAQQL AAALTRQKSS AALAASTSAS VQLVTGRVTD 
ETGTGLPGAN VTVKGTTTGT ATDEKGQYRI SVPTPNAVLV FSSVGYLKQE VSVGNRTTVD
IQMRVDNQSL SEVVVIGYGE QSRKTLSTAI AKVEGKNIGI QPVSTPGEAL AGLAAGVQVQ
SDRGSTPGAP PTIRIRGVGS LSTGSTPLYV VDGYPLQDPA QFALINPTDI ESMEILKDAA
SAAIYGSRAA NGVVIVTTKR GKAGKTSLNV SIYTGIQQLA KKVQLLNRDQ YIENAIYASR
LKNIPYPKVF DTKPDSLPDT DWQDAIFRQA AISNYQISAT GGTDKVRFAV SGGYFKQDGI
LKGSAYERYN LRFNLDADLS PKLKLGVSMA PSYSSQFQQQ AAGQFNGSNG TETSGTRSLP
SAIISAIDMP PTIPVYTPNG DYAQTFNGNT NPNGTNFYQT NLYNPLAVLE LSRNNLKGYR
LFGNGFLEWQ PIANLRLKTT LGSTLSIFDQ SAYIPANLAN ESAPRANSTN PVLGQIFARE
SQTVTLDWLW ENTATYNKTF GNHNFSLLAL YSLQKLQAKN TATSGRSGSY TTSLLDNPLA
SPDRIGELNY DQNAFLSLGG RITYDFKSKY IFSAAIRRDA SSRFGPNNRF ATFPSISGAW
RISEEKFWSG LKNSISEFKI RASYGETGNA NIGSFNWTNS VQGRNYSFNQ ARTFGYAQTG
FANYDLTWEK NVQTDLGLEM GFLNDRFTLG LDYYNRLTTG MLFQKDLPGI VGYATNFRTN
IGSLRNRGLE LSARANLTVG AVRWTIDGNI SGNRSKVMDL GGPSSLPTVA AIFGWNNVYQ
VRVGDPLGNM YGYQVVGIFK NADDLSKNAQ FTTGDKVGNW MIRDQNGDNK IDENDRVYVG
KGVPSYIWGM THSFQYKNFD LSVILQGVQG VNVINGNLRH IWANQVFNTI PLYFRNQFDP
ANPTQNTDFP AAGAGGIHPG NNLTDRLLFD GSFVRIRNLT FGYSVPTVFL NKIKLQSARI
YVTGQNLFTF TSYPWYNPET NTVPDSPVQI GVDQGTYPLA RTYTIGLNIG F