Gene Slin_4810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4810 
Symbol 
ID8728574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5865385 
End bp5868606 
Gene Length3222 bp 
Protein Length1073 aa 
Translation table11 
GC content53% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003389587 
Protein GI284039657 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0674411 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.790047 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACACA ATTTACGAAC GAGGATTAGC GCCTGTATTC TAGCAATATG GGTCACTTTC 
CTGATTAGCC ATGCCGCGTT GGCACAGGAT CGACGCGTTA CGGGCCGGGT GGTAGCGGCA
AAGGACCAGC AGCCTATTCC GGGGGTAACA ATCTTGGTTA GAAATACTCA GTTAGGTACT
ACGACCGATG CTAACGGTTC GTTTACACTT AACGTACCAG CCAACTCTAC GCTGGTATTT
AGTGCCATTG GTTTTGCCGG GCAGTCACTG GCCATCGGCA ACCAGACCCA GTTAACAATA
ACGCTTCAGG AAGCCGAGCA AAATCTGGGC GAAGTAGTCG TAACAGCGCT GGGTATCAAA
AAAGAAGCCA AACGGCTGGG CTATGCTACC GCCATTGTCA ATCCTGAGCA GGTAACCACT
AACCGTACGG TTAACTTCAT TAACGCCTTA CAGGGTAAAA TTGCGGGTGT TAACATCAGC
AGCCTGGGTA CGGGTGCCGC CGGAACGAGT AAGATCCGTA TTCGGGGTCA GTCGTCCTTC
TCGGGGCAGA ACAGCCCGCT TATCGTAGTA AACGGTGTGC CAATTGACAA CACCAACTTC
GGTCAGAATA ACGGGAACAC CGGTGGCGAT AACTCCATCG GCAACCGCGA CCGCAACTAC
TCCGACGGCG GTGACGGTCT TTCGTCCATC AACCCGGATG ATATTGAGGG AATGACGGTG
CTGAAAGGCG GTACGGCTGC GGCTCTGTAC GGCTCCCGCG CCAAAGACGG TGCCATCCTG
ATCACGACAA AAACCAAAGG TACCGGTCAG GGTATTGGTG TAACGTTCAA CAGCAACTTC
ACTACAGACC GCCCGCTTGA TTTTACTGAT TACCAGTATG AGTACGGACA GGGTGAATAT
GGAGTGCGGC CAACAGCGGC CAACCCAACA TCGGGCGTAT GGAGCTTCGG GGAGAAGTTT
GCCGGTCAGA CGCAAGTGCT GTTTGGGGGT GTAACGGTGC CCTATGCGCC AGTTCGTAAC
CGGATCAACA CGTTCTACCG GGATGGGTCG ACATGGACCA ATTCGATTTC GGTATCGTCG
GGCAGTGAAA AAGGCGGGTT CAACTTGTCT ATCGCTAACC TGGACAACAA AGGCATCACG
CGCAACAACA CCTTTAACCG GAAGACGATG AACCTTGGTT TCAGCTACAA CCTGTCGCCA
CGGTTAACCG TTACGGGTAC ACTCAACTAC TCCAACGAGT ACAACAAAAA CCCGCCCCAA
ATTGCCCAGC AGGACAACAG TACGCCAACG GTAATTTACA CACTGGCCAA CTCCATGCCG
CTGGACGTGC TGGAAGCCAA CCAGATCAAC CCGGCTACGG GCAACGAGTT CGTGTATTCG
CGCTTCATGA ACCGCACGAA TCCGTACTTT GTCCTCAACA ACAAGTTTGA GAACATTCGC
CGGGATCGCC TGTTCGGTAA CCTTACGGCC CGCTATAACG TGACCGACTG GCTGTACGTG
CAGGGACGGG TTGGGCAGGA TTACTGGTCG CGCGATCAGG ATTATAACTT CCCAACGGGG
CAGGCCTCTC TGGCAGCAGC ACCAGCCGGT TTTGTAAACG GAGCCTATGT ACAGGAAGCC
CGTCGTTTTC GCGAAATCAA CGCCGACTTC CTGATTGGTG CCAACCACAA GTTTGGCGCG
TTCGGTGTCG ATCTTACCGT TGGCGGCAAC CAGTTGTACC GTCGAAGCGA CCTCAACAGC
GTACAGGTAA CCGACTTCAT TGTGCGGGGC TTATACGTAC CGCAGAACGG ACGGGTGAAA
GACCCCATCT ATGGTCTGAG CGAACGGAAA GTTAATTCGC TTTACACAGC AGCCGAATTT
TCGTTCAAAG ACGTCCTCTT TCTGAACGGT ACGCTACGTA ACGACTGGTT CTCTACCCTT
GCGCCAGCTA ACCGCAGCAT TCTGTACCCA TCGTTAACAG GTAGCTTCGT CTTTTCGCAG
GCCTTCGACA ACCTGCCCTC TTTCATAAAC TTCGGTAAGA TCCGGGCTGC ATATGCCGAG
GTCGGCAGCG ACGGCGACGT AGCACCTTAC TCGAACAACC TGTTCTATTC GGTCAATGCC
AACCTGTTCC CGAACCCCGC AGGTCTGGGC CAGCCGGTCG GTAACATTAC ATCCAGTACC
GTGCCAAGCT CCACCCTCAA ACCCAGCCGT ACCGCCGAAA CCGAAGTGGG TCTGGAGTTG
AAGTTGTTCA ACAACCGGGT TGGCCTGGAC ATGGCCGTGT ATCGCAAAAT TACCAGCGAC
CAGATTGTAC AGGCCCAGTC GTCCGATGCG TCGGGGTACA CCTCTACTCT GATTAACAGT
GGACAGAGCC AGAATCAGGG CATTGAAGTA TTACTGAATC TCGCACCTAT TCGCACTAAG
GATTTTTCCT GGGACATTAC GCTGAACGGG GCTTACAACA AGACCAAACT ACTCCGGCTA
CTCACCGACG ACGACGGCTC GCCCGAGAGA GATTATAACA AAGACAAACA GGCCGAGCAG
ATTGTGGTCG GTACGGGTAT TTACGTGGGT GATCTTCGGC AGGTAGTTGG CCAGGAACTG
GGTCAGTTAT ATAGTTTCGG CTACCAGCGC GACGCACAGG GACGTATCAT TCACGGGGGC
GATGGTCTGC CCGTTCGGAC ACCGGCGCCT ATTTCGTTCG GCTCAGCTCT CCCTAAGTAT
ACGGGCGGTA TCACCAACAC GTTCAACTAT AAAGGCGTTA ATCTCTCGTT CCTGATCGAC
TTCAAGCTCG GTGGCAAGAT GATCTCGGGT ACCAACCTGA ACGCTTTCCG TCATGGATTA
CAGAAAGAAA CACTGGTAGG CCGGGGCGAA GCCGACAACA AAATGGTGGG TGTTGGCGTG
AACGATAAAG GTGAGGTAAA CGCCGTTCGG GCGTTCGTGC AGGACTACTA CTCGGTAGGT
CGTTCCAAAA GCCTGGGCGA GCAGGTAGTA TATGACGCCG GTCTGTGGAA ACTCCGCCAG
ATCAGCCTTG GCTACGACTT CACCAAGATG CTGCCTAAAA GCCTGTTCAT TAAAGGTATT
CGGTTGAGTG CGGTAGCTAA CAACGTGGCC ATCATCAAAA AATGGGTACC CAACATCGAC
CCCGAGCAGT TTGGCTTTAG CTCCGACAAC CTGATCGGTC TGGAATCAAC CGGCTTACCG
ACAACGCGCA GCATTGGCTT TAACCTGAAT GTTAAATTCT AA
 
Protein sequence
MGHNLRTRIS ACILAIWVTF LISHAALAQD RRVTGRVVAA KDQQPIPGVT ILVRNTQLGT 
TTDANGSFTL NVPANSTLVF SAIGFAGQSL AIGNQTQLTI TLQEAEQNLG EVVVTALGIK
KEAKRLGYAT AIVNPEQVTT NRTVNFINAL QGKIAGVNIS SLGTGAAGTS KIRIRGQSSF
SGQNSPLIVV NGVPIDNTNF GQNNGNTGGD NSIGNRDRNY SDGGDGLSSI NPDDIEGMTV
LKGGTAAALY GSRAKDGAIL ITTKTKGTGQ GIGVTFNSNF TTDRPLDFTD YQYEYGQGEY
GVRPTAANPT SGVWSFGEKF AGQTQVLFGG VTVPYAPVRN RINTFYRDGS TWTNSISVSS
GSEKGGFNLS IANLDNKGIT RNNTFNRKTM NLGFSYNLSP RLTVTGTLNY SNEYNKNPPQ
IAQQDNSTPT VIYTLANSMP LDVLEANQIN PATGNEFVYS RFMNRTNPYF VLNNKFENIR
RDRLFGNLTA RYNVTDWLYV QGRVGQDYWS RDQDYNFPTG QASLAAAPAG FVNGAYVQEA
RRFREINADF LIGANHKFGA FGVDLTVGGN QLYRRSDLNS VQVTDFIVRG LYVPQNGRVK
DPIYGLSERK VNSLYTAAEF SFKDVLFLNG TLRNDWFSTL APANRSILYP SLTGSFVFSQ
AFDNLPSFIN FGKIRAAYAE VGSDGDVAPY SNNLFYSVNA NLFPNPAGLG QPVGNITSST
VPSSTLKPSR TAETEVGLEL KLFNNRVGLD MAVYRKITSD QIVQAQSSDA SGYTSTLINS
GQSQNQGIEV LLNLAPIRTK DFSWDITLNG AYNKTKLLRL LTDDDGSPER DYNKDKQAEQ
IVVGTGIYVG DLRQVVGQEL GQLYSFGYQR DAQGRIIHGG DGLPVRTPAP ISFGSALPKY
TGGITNTFNY KGVNLSFLID FKLGGKMISG TNLNAFRHGL QKETLVGRGE ADNKMVGVGV
NDKGEVNAVR AFVQDYYSVG RSKSLGEQVV YDAGLWKLRQ ISLGYDFTKM LPKSLFIKGI
RLSAVANNVA IIKKWVPNID PEQFGFSSDN LIGLESTGLP TTRSIGFNLN VKF