Gene Slin_4954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4954 
Symbol 
ID8728718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6038101 
End bp6041649 
Gene Length3549 bp 
Protein Length1182 aa 
Translation table11 
GC content55% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003389731 
Protein GI284039801 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0461208 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTGTT CGTTCTTTCT CGGCTGGATA ACGGCCCTTC TGCTTCTGGG ATCGACCCGG 
ATGGCTCATG GACAAACGGC CGCCGTACTG ACCGGCTCCG TAACGGATGC CACGACGGGA
AAACCCATGC CGTTTGCCAA CGTCTACGTA AATGGATCGA CGCGCGGGAC AATCACCGAC
GAAAAGGGGG TTTATACGTT GACGGGCGTG CCACTGGGTA CGGTCGAAAT AGCCGCTTCC
TTTGTTGGCT ATCAACCCGC CAGGCAAACA ATCCGATTCG ATAATACAAC ACCGCAGAGA
GCAAATTTCC AGCTGAAACC GAGTGAACAA ACGCTGGAAG CTGTAACCGT ACGGGGCAAT
CCGAAGCAAT GGGAACGACA GTTGCGGCAG TTTAAAAAGC AACTCTTCGG CGAGCCGTTT
GGCGGCCAGT GTCTGCTGGT CAACAGCGAA GTGATTAGTT TTAACGAAGC CAAAGATCAT
CTGTATGCAA CGGCTAACGA GCCGTTGATT ATTGAGAATC AGGCGCTGGG CTATCGGTTG
ATTTACGCGT TGCAGCATTT CGATGCGACT TCATCCGGAA ATGTTTATTC GGCGGGTACG
GCCCGTTTCG AAGAACTAAA ACCCCAGAAC GAGCGGCAAG CCAACCAGTT TCGGCGTAAC
CGGTTGACCG CCTACAAAGG CTCAATCCGG CACCTGATGG CCAGCCTGGC GGATAATACG
TTCAAAGAGG CAGGTTTTCT GGTCTACCAG GAAGATGTTA CGAAGCCGAT TTTCCTCGAA
AGTCGGTCCA TTACGCTGGC CGCTGCGGTT AGTGACTACA AGCGTCTGAT TCCAGTTAAG
GTGTCTACGC TGATTCAACC CGGTCGATTA GCGACGGAAC GACGGCTGGT TTCGCCCATG
AAACTGATTG TGTTCTATAC GAATGCGGTC TCGGGTTTCT CGCCCTATCC CGATGCCCGT
TATGCCTACA CGGAAATGAC CTTGCCCCGC AGTCAGCTGC AACTGACCGT CGATGGGATT
ATTACGATGC CGGAAGGGAT GGAAGCGAAA GGCTCGATGG GGAACGACCG GCTATCGACC
ATGCTGCCCG CCGACTGGAA ACCCGATGGA ACTCAGAAGC AGACGCTTAC GAACGACCCG
CTGGCTACCC AGGGCAAACT TACCCTACCC GACGCCCGCC TGGAACGAAT AACGACGGCC
TTCAACGAAC GATTCAAATT ACTGGCTCCC AACGTGTTTG TACACATCGA CAAACCGGTT
TACGCCACCG GCGATCGGCT CTGGATGAGT ACGTACTTCC TCGACGCAGC CAATCACCAG
CGAGCCAGTG GCGAAACGGC CCTGCATGTC GACCTGCTTT CGTCAACCGG CAAGCTTGTG
CAGCACCAAT GGGTACGCAT CGTTGACGGA CGGGGCGAGG GTAACTTCCG CCTGTCGGAT
ACCCTCCTGA CGGGAACGTA CCGCCTGCGC GCCTATACCG ACGAAGATGA TGCCCAGCGA
CGCCCGGCTT TCGAGCGGTC GGTGGCAATC TATAATCTGT TTCGGAACGA GGTATCCATA
CGGAGTGATT CGGTTTCTCA GCCAGTAGAT GTACAGATTT TGCCGGAAGG TGGTCGCTGG
ATTACGGGAC TACCCACCCG TCTGGGCGTA AAAATCATAC AGCCCAACGG GCATGGCTTG
CTCATCGCGG GGCGTATTGT CGACGACCAG GGCGCTGAAG TTGCTCGTTT CAGGACCAAT
CCGCAGGGAA TGACCAGCGT ATCGATGGAG CCTAAGCCGC AGCGGACCTA TTACGCCGAC
ATTGTGTATA ACAACCAGCC GCAGCGTGTT CCGTTGCCCA AACCCGAAAC GGAAGGGCTG
TTGCTTTCGG CCGATGCAAT CAGCGATACA ACCCGTCTGG CGTTGACCAT ACTAAGCACT
AACCGGGCGG TTATCGACTC AGTGTATATC CTGATTCAAC AACACGGCCG GGTGGTCGAT
CAGCGAAAAC TTCTCCTGCA AAACGGGGTG GCGAAGTTAA GCCTGCCCAT GATGAGCTGG
TTACCGGGTC TTACGCAAGT CACCCTGTAC GACGCCACCG CTAAGCCGCA AGCCGAACGC
CTGGTTTTCG TGCCCGAATT CGTGGCACCG GTCCGTGTGT TGATGGGCAT AAACAAAACC
CGATATCAGC CCCGCGAGCA AGTTAGCCTG AGCGTCAATC TGACCGATAA CGGATTACCG
GCACTGGCCG CTTTATCAGC GTCAATTACC GATGCTGACC AGGTGCCGGA TGATACCGCC
GAGGCTACGT TACCCGCACA CCTTCTGCTG ACGGGAGAGC TTCGGGGACG TATCGAAAAC
CCCAACCAAT ACGTTGCGAA TCACTCGGCA GAAACCCGTC GAGCGCTGGA CGACCTGCTA
CTGACGCAGG GCTGGCGACG CGTAACAGGC ACGCCCGAAA CGGAACGACT CGGTGGTGTA
TCACTCAGGG GACGTATTGT AAATGCGAAA AATCAACCCA TTTCCGGCGC ACAGCTCATG
GTAGCACCAA CGGCCGGACA ATCGTCCATA AAATCGGCTG GGGTTGATGA ACAGGGGCGC
TTCCGGCTGG CGGGGCTGGC CATTGCCGAT ACGGTACGGT TACTGACGCA AATCACGGAT
CGTCAGTTTA AAAATATCCC AACCAAAGAC GCCATTCTAG TTCTGGAAGG GCCGGGTAAA
TTGTGGGAAT ACGCTAAGCT ACCGGTTGCG CCAAACGGGC CGACGTTGCG GTCCCAGTTG
GAAGCGGCCC GCATTCGACA GGAAGCAAAC GCCGGTTTTT ACCGTGACAA AACAGCAAAA
GTGCTAAAGG AGGTGACCGT TCGAGCACAG AAATTCGATA AAAGACCCGA GGATATTCAA
CAGCGTAGTT TACACAATGA GGCTGATGCG GTTCTAGTGG TCGACGAGAA ATCGCCCGCT
TACCAGAATC TATACGAAAT GATTCAGGGA CGATTGGCAG GCGTGACAGT TACCCGAGAA
GGGGCCGCTG TTGCGCGTAG CTACAAGGTG TTTATACGGG GCGTGAACAG TTTCAAAAGT
GGTATACAAC CCTTGTATCT GATGGATGGT ATACCCATAG AGGACCCCGA TGGAACCGCT
TTGCTATTTT TCAGTACAAG TGATATTGAA CGGATTGAAT TGCTCAAAAG TGCGATTACA
ACGGGTACCT ACGGAGCTCG GGGCGGAAAC GGGGTGATAG CTTTTTACAC CAAAAGCTAT
CGATCCATGC AGGGAAGTGA GAAGCTTAAG GAGGGAATGA CGCCGATTCA ATTTATTGGC
TACCCGTCGG TGCAGCGCGA GTTCTACGTA CCGCACTACG ATGCGGAGGC AACCACCAGC
CCGATTTCCG GCCCCGTCGA TAACCGGGAT GTATTGTATT GGAAACCAAT GATGCAAACC
GACAGTCAGG GACGCAGCCA GCTACGTTTT CCGCTGTCGG ACGTCGTTCG GACGGTACGC
GTGGTGATAC AGGGTGTTAC GGCCGATGGT CGTCCGGTGC TGGGCGTTCA GCTAATCCGG
GTTCAATAA
 
Protein sequence
MRCSFFLGWI TALLLLGSTR MAHGQTAAVL TGSVTDATTG KPMPFANVYV NGSTRGTITD 
EKGVYTLTGV PLGTVEIAAS FVGYQPARQT IRFDNTTPQR ANFQLKPSEQ TLEAVTVRGN
PKQWERQLRQ FKKQLFGEPF GGQCLLVNSE VISFNEAKDH LYATANEPLI IENQALGYRL
IYALQHFDAT SSGNVYSAGT ARFEELKPQN ERQANQFRRN RLTAYKGSIR HLMASLADNT
FKEAGFLVYQ EDVTKPIFLE SRSITLAAAV SDYKRLIPVK VSTLIQPGRL ATERRLVSPM
KLIVFYTNAV SGFSPYPDAR YAYTEMTLPR SQLQLTVDGI ITMPEGMEAK GSMGNDRLST
MLPADWKPDG TQKQTLTNDP LATQGKLTLP DARLERITTA FNERFKLLAP NVFVHIDKPV
YATGDRLWMS TYFLDAANHQ RASGETALHV DLLSSTGKLV QHQWVRIVDG RGEGNFRLSD
TLLTGTYRLR AYTDEDDAQR RPAFERSVAI YNLFRNEVSI RSDSVSQPVD VQILPEGGRW
ITGLPTRLGV KIIQPNGHGL LIAGRIVDDQ GAEVARFRTN PQGMTSVSME PKPQRTYYAD
IVYNNQPQRV PLPKPETEGL LLSADAISDT TRLALTILST NRAVIDSVYI LIQQHGRVVD
QRKLLLQNGV AKLSLPMMSW LPGLTQVTLY DATAKPQAER LVFVPEFVAP VRVLMGINKT
RYQPREQVSL SVNLTDNGLP ALAALSASIT DADQVPDDTA EATLPAHLLL TGELRGRIEN
PNQYVANHSA ETRRALDDLL LTQGWRRVTG TPETERLGGV SLRGRIVNAK NQPISGAQLM
VAPTAGQSSI KSAGVDEQGR FRLAGLAIAD TVRLLTQITD RQFKNIPTKD AILVLEGPGK
LWEYAKLPVA PNGPTLRSQL EAARIRQEAN AGFYRDKTAK VLKEVTVRAQ KFDKRPEDIQ
QRSLHNEADA VLVVDEKSPA YQNLYEMIQG RLAGVTVTRE GAAVARSYKV FIRGVNSFKS
GIQPLYLMDG IPIEDPDGTA LLFFSTSDIE RIELLKSAIT TGTYGARGGN GVIAFYTKSY
RSMQGSEKLK EGMTPIQFIG YPSVQREFYV PHYDAEATTS PISGPVDNRD VLYWKPMMQT
DSQGRSQLRF PLSDVVRTVR VVIQGVTADG RPVLGVQLIR VQ