Gene Slin_2412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2412 
Symbol 
ID8726156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2909307 
End bp2912396 
Gene Length3090 bp 
Protein Length1029 aa 
Translation table11 
GC content54% 
IMG OID 
ProductTonB-dependent receptor 
Protein accessionYP_003387231 
Protein GI284037301 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.525644 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATTTT CTACCACTAG TTCAGCCCAG TCAAAAATCG GGTGGCTCGT AGGCCTGCTG 
CTCGTACTGG AGCTGTCTGC GCTGGCCCAA TCGAACCGGC CCGTTACGGT CAGCGGTCTG
GTTACCTCAG CCGAAAGTAG CCAGGGAATA CCCGGCGCCA ACGTTATGGT TAAAAACACC
CAGCAAGGCA CGACGACCAA CGCCAACGGG GAGTTTACGC TGGCAGCCCC GGCGGGTTCA
CTTGTGCTGA TCGTATCGTC GATTGGCTTC CAGACACAGG AAGTACCGGT TTCGGGCCAG
AGCAAGCTAA CCATCGCCCT GCAAGCCGAT AATCGTTCAT TGAACGAGGT CATCGTTGTC
GGCTACGGTA CGGTAAAGAA GAGCGACTTA ACCGGCTCGG TTTCATCAGT GAGAGCCGCT
GAACTAAAAC AAACTCCGAT TGCCAACTTC GTTCAGGGTC TTCAGGCCCG GGCTTCGGGG
GTGCAGGTTA CACAGAACTC AGGCGCACCG GGGGGCAGCA TCAGCGTTCG CATTCGGGGC
AATAACTCGA TTAGCGGCAG CAGCGAACCG CTCTATGTAG TCGACGGATT CCCCATTGCC
GGGGGCGACA ACCCGGTGGC GGGGGGCGGC AGCGGACTCG GAAATGACAA TGGCAACCGG
CTCTCGGTGC TTTCTACGTT GAACCCCAAC GATATTGAGT CGATGGAGGT GCTGAAAGAT
GCCTCGGCCA CCGCTATTTA CGGCACGCGT GGGGCCAATG GCGTAGTGTT GATTACGACC
AAACGGGGTA AATCCGGCAA AACCCGCGTG AGCTATGACG GCTATTACGG GCAGCAGCAA
ATCCGAAAAA CGCTGGATGT AATGAACGCC ACGCAGTTTG CCAAGTACGA AAACGAAATT
ACCGGTACGC AACTTTATCC CAATCCCGAT CAATTAGGGC AGGGTACCGA CTGGCAGTCG
CTTATTTTCC GCAAAGCCCC TATGCAGAGC CACCAGCTAT CCGTATCGGG CGGCAACGAA
CGCTCGCAGT TCGCGCTGTC GATGAATTAT TTCGATCAGG ACGGGATCAT CATTAACTCC
AACTTCAAGC GGGGATCGGT GCGGGTCAAT CTGGATAATA CGATCAGTAA AAACCTTAAA
ATAGGCACGA GCCTGACCTA TACCTACTCC GTCAACAATG GAGCCATTAC CGCCACGCTG
GGCGATGGTG GTCCGGCGGG GGGAATTATT TTGTCGGCAC TTACGGCTCC GCCCGTCTTT
TCCCCCTACA ATGCCGACGG TTCACCGACT ATTTTCACAA ACCGCTACCT GGACCTCAAC
AACCCCGTTG CGCTGGCTAC GGAGGTGATG AACCGGAACA CAACCCGCCG TTTTCTGGGT
AACATCTTTG CCGACTGGAC CATTACCAAT GGCTTAACTT ACCGGGCTTC GTTTGGGGGC
GATCTCGTTA CGGACACCCG CGACTCATAC GTTACCCGCA ACATCCGGGC GGGTTCGCAG
GTGAACGGCA TTGGCGGAAA AGGGAATGCC AACACCAATA CGGTACTGCA CGAGAGTCTG
CTGAATTACC ATCGGCTGTT TGGTGTGCAT GATGTAAACG TGACCGGCGT GTTTTCGACC
CAGGGGCAGC TACAAACTGC CGATGCCATG ACCGGTCAGC AATTTCCCAA CGACCTTGTG
CTGAACAATA ACCTCTCGCA GGCATCCATT TTAACCATTG CCAGCAACAA ACAGGCGTGG
CGTTTAGACT CGTACACGGG GCGGATCAAC TACAATTACA AAAGCAAATA CCTGCTCACG
CTGACGGGCC GGGTAGATGG CTCAAGCCGG TTTGGCGATA ATAACAAGTA CGGATTTTTC
CCCTCGGTAG CGGGGGCATG GCGAGTATCG GAAGAAGGAT TCATGCAGGG GCAGCAGGTG
TTGAGCGACC TGAAACTCCG GGCCAGTTAT GGCATAACGG GCAACGCCGA TATTCCACTC
TACAACTCTT TATCGCGACT CAACTCGGTT GGAAATTACA ACTTCAATAA CGTGCGTACC
ATCGGTATTG CCGCAGCCAA CATCAGCAAT CCCGACCTGA AATGGGAGAA AAGTGCACAG
GCCGATATTG GCCTGGATTT CGGCTTGCTC AACAACCGGA TTCAGGTAAC AGCCGATGTG
TATTACAAAA AAACGACCGA TCTGCTGCTG TCGCGTACCA TTCCACTCTC GTCGGGTTTC
GGGTCGGTAT TCGGCAACTT TGGTTCGGTC GAGAACCGTG GCATCGAAGT AACCGTCAAT
GCGGGCGTAC TGAATGGTCC GCTGAAGTGG GATATTAACG GCAACATCTC GGCCAATCGC
AACAAGCTCA CGCTCATCGA CGGGACCCGT ACCGAAATCA TTCCCGGTGG TGGCGATGCC
TCTATTGGTG CTTTTACCAA CAACAGCATC CTGCGGGTAG GTGCGCCCAT CGGCTCGTTC
TATGGCTATG TGTTCGATGG CATTTACCAA ACCGGCGACA ACATCCCGAC GGGGCGTATT
CCGGGCAACA TCCGGTATCG TGATCTGAAT GGCGACGGGG TCATTTCGGG AGCGGATCAG
ACCATTATCG GTAACCCGAA CCCGAGCTAT ATTTTCGGAA TCAACAACAC CCTGAAGTAC
AAAGGCTTCG ACTTAAGCTT ATTCGTGCAG GGTGTACAGG GCAACCAGAT CTTTGCCGTT
TCGAGAGTCC GGCTGGAAGC CGGGGCGGGT GCCATCAATC AGTATGCCAC CTACGTAAAC
CGCTGGACAT CCACCAATCC ATCGAACCAG TACCTGAAAG CCTCGACGGG GCAGCGGGTA
AACCAGTCGG ACATTCACAT CGAAGATGGT TCGTTTGTCC GCTTCAAGAA CATAACCCTC
GGTTACACGA TTCCTGCCGC TGGCAAACTG GCCTGGCTGG CGAATTCGCG GGTGTATGTG
AGTGCCAACA ATTTTGCTAC CCTGACCAAC TATTCGGGCT ATGATCCGGA GGTGAACACC
GCAGGGCAGA ACAACCTCAA CCTGGGTGTA GACAACATCG GATTCCCCGT ATCCAAGTCG
TTCATTGCCG GTCTTCAACT CAACTTCTAA
 
Protein sequence
MPFSTTSSAQ SKIGWLVGLL LVLELSALAQ SNRPVTVSGL VTSAESSQGI PGANVMVKNT 
QQGTTTNANG EFTLAAPAGS LVLIVSSIGF QTQEVPVSGQ SKLTIALQAD NRSLNEVIVV
GYGTVKKSDL TGSVSSVRAA ELKQTPIANF VQGLQARASG VQVTQNSGAP GGSISVRIRG
NNSISGSSEP LYVVDGFPIA GGDNPVAGGG SGLGNDNGNR LSVLSTLNPN DIESMEVLKD
ASATAIYGTR GANGVVLITT KRGKSGKTRV SYDGYYGQQQ IRKTLDVMNA TQFAKYENEI
TGTQLYPNPD QLGQGTDWQS LIFRKAPMQS HQLSVSGGNE RSQFALSMNY FDQDGIIINS
NFKRGSVRVN LDNTISKNLK IGTSLTYTYS VNNGAITATL GDGGPAGGII LSALTAPPVF
SPYNADGSPT IFTNRYLDLN NPVALATEVM NRNTTRRFLG NIFADWTITN GLTYRASFGG
DLVTDTRDSY VTRNIRAGSQ VNGIGGKGNA NTNTVLHESL LNYHRLFGVH DVNVTGVFST
QGQLQTADAM TGQQFPNDLV LNNNLSQASI LTIASNKQAW RLDSYTGRIN YNYKSKYLLT
LTGRVDGSSR FGDNNKYGFF PSVAGAWRVS EEGFMQGQQV LSDLKLRASY GITGNADIPL
YNSLSRLNSV GNYNFNNVRT IGIAAANISN PDLKWEKSAQ ADIGLDFGLL NNRIQVTADV
YYKKTTDLLL SRTIPLSSGF GSVFGNFGSV ENRGIEVTVN AGVLNGPLKW DINGNISANR
NKLTLIDGTR TEIIPGGGDA SIGAFTNNSI LRVGAPIGSF YGYVFDGIYQ TGDNIPTGRI
PGNIRYRDLN GDGVISGADQ TIIGNPNPSY IFGINNTLKY KGFDLSLFVQ GVQGNQIFAV
SRVRLEAGAG AINQYATYVN RWTSTNPSNQ YLKASTGQRV NQSDIHIEDG SFVRFKNITL
GYTIPAAGKL AWLANSRVYV SANNFATLTN YSGYDPEVNT AGQNNLNLGV DNIGFPVSKS
FIAGLQLNF