Gene Slin_1571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1571 
Symbol 
ID8725305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1894943 
End bp1897357 
Gene Length2415 bp 
Protein Length804 aa 
Translation table11 
GC content53% 
IMG OID 
ProductTonB-dependent receptor 
Protein accessionYP_003386419 
Protein GI284036489 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.579738 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0739696 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGGC TTTTAGGTTT AATTCTCGGT TTATTGCCTT TTTGGACATC TGCTCAGACG 
CTATTGGTGC GTGATAAAAC TACGCTTCAA TCCATTGAGA ATGTGGAGGT CAGAAGACTG
TCGCCGGGGG CATCACAGCC GATTTTTACA GACCGTTCAG GTCAGGCAGA TGCATCAGCG
CTAACCGGTA CCGACAACGT TGTTTTCCGC CGGGTGGGTT ACCAGACCGT TCGGTATTCA
ATGGAGCAGC TTCGGACGCT GAATTTTACC GTGCTGATGG CCGAAAAGCA ACTGGCAATC
AATGAGGTCG TCGTGGCCGC TAGCCGTACT ACCGAGTCGC TTTTGAAAGT GGCTCAGCCT
ATTCGGGTCT TTACCCGGAA TGAGCTGCGC TTTCTGAATC AGCCAACCAT GGCCGAGGTG
TTGCAGCAAA GTGGTCAGGT ACTGGTTCAG AAGAGCCAAT TGGGTGGTGG TAGCCCGATT
CTTCGTGGTT TTGAAGCCAA TAAAGTGCTG ATGGTTGTCG ACGGCGTTCG GATGAACAAT
GCCATCTTTC GGGGAGGGCA CCTTCAGAAT ATCCTGACCA TCGACAATGC CGCTGTCGAA
CGGATGGAAG TCGCGCTGGG GCCAGGATCG GTGGTGTACG GCAGCGATGC GCTGGGCGGT
GTTATCTATG TACAGACTTT ATCGCCCAAA CTGAGCGTGT CCGAAAACAC GGCCGTCAAC
GCCAATGGCT TTGTTCGATA CGGCAGTGCC ATGAACGAAA AAACCGCTCA CGCCGACTGG
AACCTTGGCT TTCGGAAGTG GGCGTTGACA ACCAGCGTAA CCGGTTCGGA CTTTGGTGAT
CTCCGGCAGG GAAAACAGCG GAACGCCGAT ATGGGACAAC TGGGTTTACG GCCCTTCTTC
GCAGGGTTTG AAAACAATAC GGATGTGAAG ATCACCAATC CTGACCCGCT GGTTCAGACA
CCGTCGGGCT ATAAACAGAT CGATCTATTG CAAAAAGTGT TGTTTCAGCC GAATGAACGG
ACGCAGCATT TGCTGAACGT TCAGTTTTCG ACCAGCAGTG ATATTCCGCG CTACGACCGG
CTCACGGAAG TCGACGCAAA AGGGAATCCG AGCCATGCTC AGTGGTATTA TGGGCCGCAG
AAACGCCTGT TAACATCGTA CGGTCTGACC AAACAGTTTA CTTCCGGTAT AGCCGATGAA
CTCAAATTGA TTGCCGCTTA CCAGTCAATA GAAGAAAGTC GGCATAACCG TCGCTTCGGA
AATTACGGAT TGCAGCACCG AACGGAAAAC GTGAATGTCT GGACGCTGAA CGCCGATTTG
AAAAAGAAAC TAGCCGACTC GCATACCCTG CGCTACGGCC TGGAGGGAAC CTACAACACC
GTTCAGTCGA CGGCGTACCG ACAAAATGTA CAGACCGGAA AAATAGACCC GCTGGACACG
CGCTACCCCG ATGGCGGAGC CAATACCCAG TCGTTGGCGG GGTATGTGTC GGGAACGCTG
GACGTGAGCA CTCGTTCCAC ACTGACCTAT GGCGCCCGCT ATGCCTATAA TCGATTGTAC
GCGAAATTCA ATGACAAAAC ATTCTTCCCG TTTCCGTTCA ATGATATCAC CCAGCAGTCG
GGTGCCGTTA CGGGTAGTCT TGGTTGGGTA ACGCGCCTGC AGGGAGAGTG GCAACTGGCC
ACGTCGGTTT CGTCGGGGTA TCGCGTGCCG AATGTGGATG ATCTGGCCAA AGTGTTCGAG
TCGGTGGCCG GAAATCTGAT CGTTCCCAAT CCCAATCTGA AACCGGAGCG CACCTACACC
TTCGATGCCG GTGTTCGCAA GCAGATTGCC GAACGCGTTT CGTTCGAAGC AGAAGGCTTT
TATACGATCT ACAATAATGC TATCAACACC CAGCCGGGCA TGTTAAACGG ACAATCCCAA
ATCGACTACA ACGGTCGCAG CAGTCGGATC GTTACCCAGG TCAATTCGCA GCAGGCGCGG
TTATTCGGGT TCAACGCGCA GCTTTCGGCC GATCTGACTC AGTCGCTTAC CGTGTTCGGC
ACCGTAACCT ATACAAAAGG CCGTATCCGG ACCGACTCCG TGGGCTACCC CCTCGACCAC
ATTCCACCGC TGTATGGCAA AGGCGGCATC CGGCTAACGA TCAGACAATT TCGGGCTGAG
GCCAATGTTC TATTTAATGG ATGGAAACGG TTGAAGGATT ACAATCTGGT AGGGGAGGAT
AACATCGTGT ACGCAACATC ACAGGGTATG CCCGCCTGGC AAACGGTTAA TCTCAGAACC
AGCTATCAGG TGAATCGCAA CTTGCAGATG CAGGCCTCGC TGGAAAATAT TCTGGATCGA
AACTATCGCG TTTTTGCATC GGGAATCAGC GCGCCCGGCC GGAATCTAAT ACTCACCTTG
CGGGGAACGC TATAA
 
Protein sequence
MKRLLGLILG LLPFWTSAQT LLVRDKTTLQ SIENVEVRRL SPGASQPIFT DRSGQADASA 
LTGTDNVVFR RVGYQTVRYS MEQLRTLNFT VLMAEKQLAI NEVVVAASRT TESLLKVAQP
IRVFTRNELR FLNQPTMAEV LQQSGQVLVQ KSQLGGGSPI LRGFEANKVL MVVDGVRMNN
AIFRGGHLQN ILTIDNAAVE RMEVALGPGS VVYGSDALGG VIYVQTLSPK LSVSENTAVN
ANGFVRYGSA MNEKTAHADW NLGFRKWALT TSVTGSDFGD LRQGKQRNAD MGQLGLRPFF
AGFENNTDVK ITNPDPLVQT PSGYKQIDLL QKVLFQPNER TQHLLNVQFS TSSDIPRYDR
LTEVDAKGNP SHAQWYYGPQ KRLLTSYGLT KQFTSGIADE LKLIAAYQSI EESRHNRRFG
NYGLQHRTEN VNVWTLNADL KKKLADSHTL RYGLEGTYNT VQSTAYRQNV QTGKIDPLDT
RYPDGGANTQ SLAGYVSGTL DVSTRSTLTY GARYAYNRLY AKFNDKTFFP FPFNDITQQS
GAVTGSLGWV TRLQGEWQLA TSVSSGYRVP NVDDLAKVFE SVAGNLIVPN PNLKPERTYT
FDAGVRKQIA ERVSFEAEGF YTIYNNAINT QPGMLNGQSQ IDYNGRSSRI VTQVNSQQAR
LFGFNAQLSA DLTQSLTVFG TVTYTKGRIR TDSVGYPLDH IPPLYGKGGI RLTIRQFRAE
ANVLFNGWKR LKDYNLVGED NIVYATSQGM PAWQTVNLRT SYQVNRNLQM QASLENILDR
NYRVFASGIS APGRNLILTL RGTL