Gene Slin_1167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1167 
Symbol 
ID8724900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1423621 
End bp1426836 
Gene Length3216 bp 
Protein Length1071 aa 
Translation table11 
GC content52% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003386017 
Protein GI284036087 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAAT TTCTATTAAC ACAGTTTGTC CTGTGTTTAT TCGCACTTCC ATTGATAGCT 
CAGGATATAG CCATCAGTGG AAAGGTCACA TCGTCGGAGG ATGGTTCAGT GCTTCCCGGT
GTTAACATTA CGGTCAAAGG AACGTCTCGC GGAACCAGCA CCAATGCTGA GGGAACGTTT
CAGCTTAACG CTCCAGCCAA CTCAAGGCTG GTATTTAGTT TTATTGGCTT CACAACACAG
GAAATTGCCA TTGGCAACCG AACAAACATC AGTGTTAATC TTGCCCCCGA CGCGTCTCAG
CTTCAGGAGG TTGTCGTTAC GGCGCTGGGT ATCTCCCGCG ACAAGAAGGC ACTGAACTAT
GCCGTTCAGG ACCTGAGAGC CGATAAACTA AACTTTGCAC GTGACCAGAA TGTGGGTAAC
GCGCTGGCGG GTAAAATTGC CGGTGTGCAG GTACTCGGTC AGTCGGGGGC TAAGTTCGGT
AACCCGAACA TCCGCATCCG GGGCGTTAAC TCGCTGTCGG GTGGCGACCC GCTCTACGTT
GTAGACGGTA CGCCAACGGA CATTAGCCAG GTGAACATGG ACGATGTAGA GAACCTGACC
GTGCTGAAAG GACCCTCTGC AACGGCTCTG TATGGTAACC GTGCTTCGGC GGGCGTTATC
GTCATCACAA CTAAGCGTGC CAAAGCCGGC GAAACCCGTC TGGACATCAA CCACAGTACA
ACGCTCGACA TGGTGGCTCT GCTGCCTAAG TACCAGAACG AATACGGTGG TGGTTACTCG
CAGGAGTGGG AAACGTTCCA GTTCGATCCA TCCATTCACC CGGCGGCATG GTCGTCGTTC
AATGGGCAGA AAATTCTGGA CTACTCGGCC GATGAAAGCT GGGGTCCCCG TATGGATGGC
TCACCCCACC GGTCGGCGTT TTCCTGGCAG CCAGGTGCTG AATTTGGTCA GCTGACGCCG
TTCTCGCCAC AGCCCAACAA CGTACGCGAC TTCTTCGAGA AGCCGATCAG CAACAACACG
AATATTGCCT TTTCGCGCGG AACGGAGGCT TTTCAGAGCC GTATCTCCTA CACGCACATC
ATCAACAACG GGATTATCCC CAACTCGTCG CAGTCTCGTG ATTACGTCAG TGCGAAGAAT
GCCATCAAGT TTGCTGAGAA ATTAACAGCG AACCTGAATT TCAACTACAC ATCGACAAAC
ACGAAAAACC AGCCTGCCGA CCGCTATGGC TCATCGGGAG GAACAACGCC ACAGAACAGT
CCGCTGGGTA TCTCCAACTC GACCCTGAAC GGCTACAACC AAACGATTGG TATGTTCAAT
CAGTGGTTTC AGCGTCAGTT ACGGATTGAG GATTTGCGTA ATTACAAAAA TCCGGATGGT
ACCTTCCGTA GCTGGAACAT CGGCGGTCCC TTGGAAGCGG CCCCTAAATA TTGGGATAGC
CCATACACGC AGGCCTACGA AAATACCAAC ACCAACCGGA GCGACAGGTT ATTTGGTGAC
ATTGGCCTGA CTTACCAGTT CACACCGGCT CTGAAAGCAT CGGCTACGGT ACGCCGTGAC
CAGAACGCTT ACTATCAGCA GGGCCGAGTG GCAATTGGTA CACTGAATGA AGGACAGAAA
GGTGGCTTCT CGACCTTAAC CTCCAACAGC CGCGAAAACA ACTATGAGTT GCTGGTGAAC
TACAACGAGA ACTTCAAAAA CTTGTCTGTT GTCGCCAACG CAGGGGGGAA CATCCGCTAC
AACCGTGTTG ATGGTCTTTT CCAGGCCACA GTAGGGGGCT TATCGGCACC AGGTTTCTAT
AACATCGCGG CTTCTATTGA CCGGCCATTA TCCAACAACT ACCTCTATGA GCGCCGGATC
AACAGTGTGT TCGGAAACGT AAGTGTTGGT TTCCGTGACT TTGTTTTCGT TGAAGCTTCG
ATCCGGAATG ACTGGTCGTC TACGCTGCCT AAAGCGAACA ACGCCTATCT GTATCCGTCG
GTGTCGGCCG GTGTTATCCT GACCGAATTG CTGCCCAAGA GCCAGGTACT TTCCTATGCT
AAAGTTCGGG CGGGCTATGC TCAGGTGGGT ACCGATGTTG GCCCTTACCA AACAGCCCTG
GCGTATACCT CTGGTCAGCC TTATGGCAGC AATGCAACCG CCTTTTTGCC CGGCACATTG
CCCAACGCCA GTCTGAAGCC CGGCCTATCG TCTTCCTATG AAGGCGGTAT CGACCTGAAA
TTCCTGAACA ACCGGATCGG TGTGGAATTT ACTGCCTATC AGAATGACAA CAAGAACCAG
ATCATTCCGC TGCCGGTAGC GCCTACGAGC GGGTATACCA ATGCCGTAGT AAACGCCGGG
TTGATCCGTA CGTCGGGTCT TGAATTGCAC ATTTACGCTA ACCCAATCCG GTCGGCGTCT
GGCTTTAACT GGGAGTTCGA CATCAATGCA GACCGCAACC GTTCGCAGGT GATTGAACTG
GCCAATGGCC TGACCAACTA CCAGATCGAC GGCCCACAGT GGCGTACACT GACGCTGAAC
GCCCGTACTG GTACCGATGG CTCACCCCGC GACTGGGGTA CGCTTGTGGG ACAGGGGATT
CAGAAAGACG CGAATGGTCG GAACATGGTA TATGGCAGCG GTGCAAACGC CGGTCTGTAT
ATCAAACAGG ATAACGTTGA GCTAGGCTCG GTACTGCCCA AGTTCAAAGG CGGCTGGTTG
AATACCTTCA GCTACAAGAA CGTGACCCTG CGCGTAAACA CTGACTTCGT TGTAGGTGGT
AAATTCTTCT CGACCACCAA AATGTTTAAT GCTTACTCGG GTTTGGCAGC TGAAACAGCG
GGTCTGAACG AATTGGGTAA GCCACTGCGT GATGATCCGG CTTCGGGCGG AGGTGTTTTG
CTGGATGGTG TAACCGAAGA CGGAAAGCAA AACACGACTC GTGTTGATGC ACAGAACCTG
TACGAAAACT GGCTGTTCGC CCTGAACGAG CGCTGGATTT ATGACAAAAC GTACGTGAAA
CTGCGCGAAG TTTCGTTCGG TTACAACCTG CCGAAGCAAA TGCTGGGCAA GTGGTTGAAG
TCGGCAAATA TTTCGTTGAT TGCCCGTAAC CCGGTTCTGA TCTACAGTGC CATTGGCGGT
GGTATCGACA TCTCTGAGTC GGAGACGATC TGGTACGAAG GTGGTCAGTT ACCTCCGGTT
CGCTCGTTCG GTGTAAATCT TAGATTAGGC CTCTAA
 
Protein sequence
MRKFLLTQFV LCLFALPLIA QDIAISGKVT SSEDGSVLPG VNITVKGTSR GTSTNAEGTF 
QLNAPANSRL VFSFIGFTTQ EIAIGNRTNI SVNLAPDASQ LQEVVVTALG ISRDKKALNY
AVQDLRADKL NFARDQNVGN ALAGKIAGVQ VLGQSGAKFG NPNIRIRGVN SLSGGDPLYV
VDGTPTDISQ VNMDDVENLT VLKGPSATAL YGNRASAGVI VITTKRAKAG ETRLDINHST
TLDMVALLPK YQNEYGGGYS QEWETFQFDP SIHPAAWSSF NGQKILDYSA DESWGPRMDG
SPHRSAFSWQ PGAEFGQLTP FSPQPNNVRD FFEKPISNNT NIAFSRGTEA FQSRISYTHI
INNGIIPNSS QSRDYVSAKN AIKFAEKLTA NLNFNYTSTN TKNQPADRYG SSGGTTPQNS
PLGISNSTLN GYNQTIGMFN QWFQRQLRIE DLRNYKNPDG TFRSWNIGGP LEAAPKYWDS
PYTQAYENTN TNRSDRLFGD IGLTYQFTPA LKASATVRRD QNAYYQQGRV AIGTLNEGQK
GGFSTLTSNS RENNYELLVN YNENFKNLSV VANAGGNIRY NRVDGLFQAT VGGLSAPGFY
NIAASIDRPL SNNYLYERRI NSVFGNVSVG FRDFVFVEAS IRNDWSSTLP KANNAYLYPS
VSAGVILTEL LPKSQVLSYA KVRAGYAQVG TDVGPYQTAL AYTSGQPYGS NATAFLPGTL
PNASLKPGLS SSYEGGIDLK FLNNRIGVEF TAYQNDNKNQ IIPLPVAPTS GYTNAVVNAG
LIRTSGLELH IYANPIRSAS GFNWEFDINA DRNRSQVIEL ANGLTNYQID GPQWRTLTLN
ARTGTDGSPR DWGTLVGQGI QKDANGRNMV YGSGANAGLY IKQDNVELGS VLPKFKGGWL
NTFSYKNVTL RVNTDFVVGG KFFSTTKMFN AYSGLAAETA GLNELGKPLR DDPASGGGVL
LDGVTEDGKQ NTTRVDAQNL YENWLFALNE RWIYDKTYVK LREVSFGYNL PKQMLGKWLK
SANISLIARN PVLIYSAIGG GIDISESETI WYEGGQLPPV RSFGVNLRLG L