Gene Slin_6053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_6053 
Symbol 
ID8729834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7344601 
End bp7347621 
Gene Length3021 bp 
Protein Length1006 aa 
Translation table11 
GC content55% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003390814 
Protein GI284040884 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAC TTGGTTACTT TTTAACGGTA ATCTGGCTTG TTGTTTCTGG TAGTATACAA 
GCTCAGGATC AGCAGGTTAC GATTACGGGG AAGGTTGTCA ACAAAGCGGA TAAGCAGGGC
CTTCCGGGCG TCAACGTTTT GGTACGCGGA ACAACAACCG GCACGGCTAC CGATGTAGAG
GGGAATTATA AGATCAATGT CCCCAGTGGT GCGGTGCTTC AGTTCAGTAT GATCGGGATG
ACTACTCAGG AAGTTCCGGT AGGCAATCAA ACAACCATCA ACGTTGAACT CGCCGACGAC
GCCCGCGCCT TACAGGAGGT TGTCGTAGTC GGTTACGGTA CGCAGCGCAA AATCGACGTG
ACCGGCTCGG TGGCGCAGGT GAAGGGCGAA GAACTGGTAC GGCAGCCGGT GCTGTCGGCA
ACGCAGGCCC TGCAAAGTAA AGTGGCAGGA GTGCAGATCA TCAACTCCGG CGCACCGGGG
CAGGCTCCCA CCGTCCGGAT TCGCGGAACA GGTACGCTAC TGGGCGGGGC CGACCCGCTC
TACGTGGTCG ACGGGATTAT TACCGAAGAC ATTCGGAACA TCAACACCTC CGATATTACC
TCGGTCGACA TCCTGAAAGA TGCCTCGGCA ACGGCTATTT ATGGCGTTCG AGCGGCCAAT
GGCGTCGTAC TGATTACGAC AAAACGCGGC AAATCGGGCG CACCTACTGT CAGCTATGAT
GCGTATGTGG GAGTTCGGAC GCCCGCCTAT CGGGTAAAAA TGGCCGATGC GCAGTTGTTT
ACGCAATACA ACAACGAAGC GGTTCGCTAC GACGATCCGA CAGCCACGCT GCCCTTCGAC
CCGGCGGCTA CCAATGCAGC CAACACCGAC TGGTTCAAGG CCATTACCCG GACGGGTATT
CAGCAGAATC ATTCGCTCTC TGTTAGTGGC GGCACGGACA AAACCACTTA TCTGTTCAGC
GCGGGTTATT TCAGCGAAAA AGGGATTCTG AAAGGAACGG ACTACAACCG CCTGACCCTG
CGCCTGAACA ACGAGTACAC GCTGTCGCCG GTGCTGAAAC TGGGACACAA CCTGAGTCTG
GCCAACGACA ACTCGGATCT GACGGGCACT TCCGACCCTG CCATACCGAG CACAACGGCC
TATTCTGCTT TTACGAATGC CTACAAACAG GCTCCGGTCG TGCCGATTCG TAACGCCAAT
GGAACCTACG GATTTACCGC AAGGAACAAC GTAGCGAATC CCGTTGCGCA ACTGGATTAC
ACCAATTACA ACGCGAAAGG ACTACGGCTA CAGGGGTCGT TCTACGGCAA CCTGACACTG
CTGAAAAAGA TCACGGTGCT GTCGAATTTC GGGATTGAAA GCAATGCCAA CCGAGCTTAC
AACTATGATC CGGTCTATCA GGTATCGGCC AATCAGCGCA ACCTGACGAG TTCATTGAAC
GTTGGCCGGT CGACCAGTTC GCGCTGGCTA TGGACCAACA CGGCCGAGTA TACCAACACC
TTTGCCAAAC AGCATACGGT GAAGATTCTG GGTGGGTATG TGGTCGAACG GTTCCAGAAC
AATATTCTAC AATCGGCCCG GCAGGATGTG CCGCCCCAGT CGAACTATTT TTACCTGAAC
ACGGGCAATG CCTCGTCGGC CACGAACAGC GAAGTGGGCA GCATTCAGAC CCGGCAGTCG
TACATTGGCC GGGTAAACTA TAACTTCGCC GACCGCTACC TGTTTACCGG TACGGCGCGG
TACGATGGGT CGAGCAAGTT TCCGACCAAC AACCGCTGGG GTTTCTTCCC CTCGCTGGGC
GCGGGCTGGG TCATCAGTGA AGAGCCGTTT CTGCGGGGCA AAACCCCTTT CGATCAGTTG
AAAGCGCGGG TAAGCTGGGG TAAAACCGGG AACGACCGCA TCGATCCGAG CGCGTTTCAG
TATACCATCG CCAGCGGATT GGATTATCCA CTCGGGCCTA ATCAAACCCT GCAACAGGGC
CGCACGATCA CCAACCTGAT CGACCCGAAT CTGCATTGGG AAGTAACGAC CGGTACGGAC
ATCGGTCTGG AGTTTGCCCT CGCCAAGAAC CGGCTGACGG GAGAATTTAC GTATTACAAC
AAACTCACCA GCGACGCCCT GTTCCAGCGG CCCATCGACG CCATTTTTGG TGATGTCGAC
GCGGCTTACC TGACCAATGC CGCCAGCGTA CGCAACCGGG GATTTGAATT TGCCCTGAAC
TGGCGCAATA ATTCTACCAG CGGCTTCAAC TACAACATCG GGGCAAACGC GACCATCAAC
CGCAATCGAC TGGAAGATGT GCAGGGCGGC CTGCCGATCA ATGAAGGCGG CATTGGCAAC
GGGCAGTACA CGACACGCAC GGCGGTGGGT CAGCCGGTGG GTAGTTTCTG GGTATGGCAA
ACCGATGGTA TTTTCCAGAC ACAGGAGCAG GTGAACAATA CGGCGGCTAA AATCCAGGGC
ACCAAACCCG GCGACTTCCG TTACGTCGAC CAGAACGGCG ATGGTGTTAT TGACGACAAC
GACCGCGTGT TTGTCGGCTC GTATCAGCCC AAACTGTATT TCGGTATCAA CGGCGGCTTC
ACCTACGACG GCTTCGACTT CTCGACCGAT TGGTACGCCA ACTTTGGCAA TAAAGTCTAT
AATGGTAAAA AGGCGCAACG GTTTGGTAAC GAAAACATCG AAGCCTCCCG CGCCGACCGC
TGGACGTCAA CCAACCCCAG TAACACCGAG CCACGCGCCA GTAATTCCGT GCCCATTTCG
TCGACCTATT ACGTCGAATC GGGGTCGTTC TTCCGAATCA ACAACATAAC GCTGGGCTAT
ACCTTGCCGA AAGAGCTGGT GTCGTCGCTG AAAGTGAACC GGGTGCGGTT TTACGTGACG
GCTCAAAATG CGCTGACGAT CAAAGCGTTC TCCGGATATA CGCCCGAACT GCCTGGCAGT
AATCCGCTGA ACGCCGGTAT CGAGCTGGGC ACCTATCCGG TCACGTCGGC TTATTTGGCA
GGTCTGAATA TTGGCTTTTA A
 
Protein sequence
MKTLGYFLTV IWLVVSGSIQ AQDQQVTITG KVVNKADKQG LPGVNVLVRG TTTGTATDVE 
GNYKINVPSG AVLQFSMIGM TTQEVPVGNQ TTINVELADD ARALQEVVVV GYGTQRKIDV
TGSVAQVKGE ELVRQPVLSA TQALQSKVAG VQIINSGAPG QAPTVRIRGT GTLLGGADPL
YVVDGIITED IRNINTSDIT SVDILKDASA TAIYGVRAAN GVVLITTKRG KSGAPTVSYD
AYVGVRTPAY RVKMADAQLF TQYNNEAVRY DDPTATLPFD PAATNAANTD WFKAITRTGI
QQNHSLSVSG GTDKTTYLFS AGYFSEKGIL KGTDYNRLTL RLNNEYTLSP VLKLGHNLSL
ANDNSDLTGT SDPAIPSTTA YSAFTNAYKQ APVVPIRNAN GTYGFTARNN VANPVAQLDY
TNYNAKGLRL QGSFYGNLTL LKKITVLSNF GIESNANRAY NYDPVYQVSA NQRNLTSSLN
VGRSTSSRWL WTNTAEYTNT FAKQHTVKIL GGYVVERFQN NILQSARQDV PPQSNYFYLN
TGNASSATNS EVGSIQTRQS YIGRVNYNFA DRYLFTGTAR YDGSSKFPTN NRWGFFPSLG
AGWVISEEPF LRGKTPFDQL KARVSWGKTG NDRIDPSAFQ YTIASGLDYP LGPNQTLQQG
RTITNLIDPN LHWEVTTGTD IGLEFALAKN RLTGEFTYYN KLTSDALFQR PIDAIFGDVD
AAYLTNAASV RNRGFEFALN WRNNSTSGFN YNIGANATIN RNRLEDVQGG LPINEGGIGN
GQYTTRTAVG QPVGSFWVWQ TDGIFQTQEQ VNNTAAKIQG TKPGDFRYVD QNGDGVIDDN
DRVFVGSYQP KLYFGINGGF TYDGFDFSTD WYANFGNKVY NGKKAQRFGN ENIEASRADR
WTSTNPSNTE PRASNSVPIS STYYVESGSF FRINNITLGY TLPKELVSSL KVNRVRFYVT
AQNALTIKAF SGYTPELPGS NPLNAGIELG TYPVTSAYLA GLNIGF