Gene Slin_3169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3169 
Symbol 
ID8726922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3838540 
End bp3840981 
Gene Length2442 bp 
Protein Length813 aa 
Translation table11 
GC content53% 
IMG OID 
ProductTonB-dependent receptor 
Protein accessionYP_003387979 
Protein GI284038049 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTACGCT ACTGTATACT CATTTTTCTG TTACTCGCAG GCCGAATAAT CACCGGACAA 
TCGTTGTCTG GCGCAGTACT TGACGCTTCC CGCCAACCCG CTCCTTTCGC TGTGGTTAAA
CTGCTGAGAG CTACTGATTC TACACTTGTG AAGGGAACGA TCACGAGCGA AATGGGGCAG
TATTCATTTA CGGATGTCAT CGATGGGAAC TACTATGTAC AGGCATCGGG GGTGGGAATG
GCAAGCGCTA CGAGTGCGAT GGTAGCTGTT ACAGCCGGGA GGCCAGTTAA TGTGGCCCCA
CTGCTACTTG TAGCCGTTGA GCAAACGCTG AGCAGTGTTG TGGTGCGTGC CAAGCGAGCT
CTTGTTGAGC AACAGGTTGA TAAGACGGTG CTGAATGTTG CGGCAGATGC TACGGCACAA
GGCAAGACGG CCTATGAACT GCTCCAGCAG GCACCCAGCG TCGTTATTGA TCCGAATGAT
AACATACGAA TGGCGGGCAA GCAGGGTGTC AACGTATTTA TCGATGGTAA ACCGACAAAC
CTGTCCGCCA ACGATCTCGC CAATCTGCTC CGGGCAACAC CCGCGTCGAG CATTGATAAG
ATAGAGTTGA TTACCAACCC CTCCGCCCGA TTCGATGCCC AGGGGGGCGC CGGTATCATT
AACATCCGGT TCCGGCGCGA CAAAAGCCTT GGGCTGAATG GAAATGTCTC TGCCGGATAC
GGCCAAAGCG ATCACCATCG GGCCAATGCT GCCGTCGACC TGAACTACCG CACCCGGCAG
CTGAACCTGT TTAGTAATGT AGCAGTAAGC GATAATTATC AGATTACCAA TGTCCGGCTC
GACCGGCAAA CGGGAAGTGG GCAATTTCTC CAGCGTGGCT ACGACACCGA CGGCACAAGA
GCCGTTGTTT ATAAAGCCGG TGTCGACTTC ACCATCGATA GCCACCAGAC CGCGCCAAGG
CAAACGATTG GGCTGATCGT GTCGGGAAAT ACGTCGGATA ATCGGTTCGG CACATTCACC
ACTACCCAAC TGATCAATAG CCGGAATAGG CTGGATTCGA GTATTGTGAA CCGGGCAACG
AACGGAACGT CGGCTCAACC TGCCCATAAC AACCGAACCA ACGCGTCTCT GAATTACCGC
TATGCCGATA CGCTGGGACT GGAACTGAAC CTCGATGCCG ATCTGACCTA CTTCCAGAAC
ACATCGCCCA ACCTGATCAC AAGCGATTAC TATAACGCCG ACGGCCAATC CCTTTTCCGA
CCTCAGCGCC GGTTCGATGC CAGTACAAAC ATCAAAATTG GTACGTTGAA GACTGATTTA
GTAAAGGAGT GGAAAGCGCT TCATCTTAAA CTCGAAACAG GGCTGAAACA TACAGATGTC
TCGACTGATA ATGATTTGCT GGCATTTACG GGATCAGCGC CCGAACAGCC GGATGTGAAC
CGGACCAATC GGTTTACCTA CCGGGAAATT GTGAATGCCG CTTATGCTTC GCTGAATCAT
TCAGCGGGCA AATGGTCGGT GCAGGGTGGC CTGCGAGTCG AGCATTCCAA CGTGAACGGT
CGGTCGACCG ATCTGTTTGA GCGAACGATT CAGCGACCGG ATACGACCTA CCTGAACCTA
TTTCCGACGG CGTTTGTGCA GTACCGCGCT ACCGACAACA GCCAGTTGGG TGTCAATTAC
GGACGGCGGA TCGGGCGACC CAGTTATCAG GACATGAACC CTTTTATTTA CCAGATCGAC
CCCTATACGA GCCAGCGCGG GAATCCCTAT TTACGTCCCA CCTACACCCA CACCCTTGAA
GCCAGTTATA CGTACAAGTG GGCGTCGACG GTAAAGCTGG CCTACAGTCA TACCAATGAT
TTTACGACGG ATGTGATTCG GCAGGAGGGA CTGACCGCTT ACCAGACGGT TGCCAATGTG
GGTCAGGTCG ACGCGCTGAA CCTGTCGGTC AGTACACCGT ATCAGTTTAC GAAATGGTGG
AGTACCTATA CCTATGCCGG GGCCACCTGG AACCGGTTTC GGGGAAGCCT TTCGCCAGCG
GAGCGTTTCG ACCAGCGGAC GTTTGCTTTC GAAGGCTATA TGCAACACTC CTTTACGATC
TCGAAAATCT GGTCGGCACA GGCGTCGGGC TTTTGGAGTG CCCCTACGCA GCAGACGATT
TACCGCATTG GTGGGCTTGG GGCGTTGAAC CTGAGTGTGC AGAAGAAAGT CATGCAGGAG
CGGGGTAAGG TCACATTCGG TGTGGATGAT GTGTTGAACA CCATGCGCTG GAGACAGTCA
GCTGATTTTC AGACCCAGCA GTTTGCCATT GATCGTAAAT GGGAGAGCCG CCGGGTCACT
ATCCGGTTCA GCTATCAGTT TGGCAGCAAA GACATCAAAG CCGCCCGCGA ACGAGAGACT
AACAGCGATG CCGGTCGAAT TAAAGTAAAA GGGAATCCAT AG
 
Protein sequence
MVRYCILIFL LLAGRIITGQ SLSGAVLDAS RQPAPFAVVK LLRATDSTLV KGTITSEMGQ 
YSFTDVIDGN YYVQASGVGM ASATSAMVAV TAGRPVNVAP LLLVAVEQTL SSVVVRAKRA
LVEQQVDKTV LNVAADATAQ GKTAYELLQQ APSVVIDPND NIRMAGKQGV NVFIDGKPTN
LSANDLANLL RATPASSIDK IELITNPSAR FDAQGGAGII NIRFRRDKSL GLNGNVSAGY
GQSDHHRANA AVDLNYRTRQ LNLFSNVAVS DNYQITNVRL DRQTGSGQFL QRGYDTDGTR
AVVYKAGVDF TIDSHQTAPR QTIGLIVSGN TSDNRFGTFT TTQLINSRNR LDSSIVNRAT
NGTSAQPAHN NRTNASLNYR YADTLGLELN LDADLTYFQN TSPNLITSDY YNADGQSLFR
PQRRFDASTN IKIGTLKTDL VKEWKALHLK LETGLKHTDV STDNDLLAFT GSAPEQPDVN
RTNRFTYREI VNAAYASLNH SAGKWSVQGG LRVEHSNVNG RSTDLFERTI QRPDTTYLNL
FPTAFVQYRA TDNSQLGVNY GRRIGRPSYQ DMNPFIYQID PYTSQRGNPY LRPTYTHTLE
ASYTYKWAST VKLAYSHTND FTTDVIRQEG LTAYQTVANV GQVDALNLSV STPYQFTKWW
STYTYAGATW NRFRGSLSPA ERFDQRTFAF EGYMQHSFTI SKIWSAQASG FWSAPTQQTI
YRIGGLGALN LSVQKKVMQE RGKVTFGVDD VLNTMRWRQS ADFQTQQFAI DRKWESRRVT
IRFSYQFGSK DIKAARERET NSDAGRIKVK GNP