Gene Slin_3236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3236 
Symbol 
ID8726989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3918463 
End bp3920889 
Gene Length2427 bp 
Protein Length808 aa 
Translation table11 
GC content50% 
IMG OID 
ProductTonB-dependent receptor 
Protein accessionYP_003388046 
Protein GI284038116 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.833355 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCGAA CGTCTACACA CCTTATTGTT CTCTTCTTCT TTTTGTTGCC AACTCTGTCT 
GTCGCCCAGG CGATTGAAAA TCAACGTCAG GCGTGGGTAC GGGGTACCAT TATTGACGAA
AAGAAAGCAC CCATTCCCTT TACGACCGTT GCCTTGCATC GAGCAGCAGA CTCCGCACTG
GTGAAGGGAG TAGTGGCCGA TGAAGCCGGT ATGTTTGCGC TGCAAACCAA GCCGGGAAAC
TACTTTTTGA GAGTGAGTGC CGTTTCCTTT AAGGAAAAAA TTAGTCCGGG GTTTACGATT
GGTGAACAGG ATAGGCAACT CGGCACCATG ACTCTGATGG CAACGGCAAA ACGACTCGAT
GAGGTTGTGG TGATGGGTGA GAAAAGCCAG ATGGAGCTAG CCCTCGACAA ACGAATCTTT
AACGTTGGTA AAGATCTGGC AAACGCCGGT GGAACTGCGT CGGATATTCT GAAAAATATA
CCCTCGGTAG CCGTCGACGG GGAGGGAAAT GTGAGTCTAC GCGGGAGTAA CAGCGTCCGC
ATTCTGATTG ATGGTAAACC CTCCGGTCTG GTGAGTTTTA AAGGCGGAAG CGGCCTCCAG
CAATTGCAGG GCAGCATCAT TGAACGGGTT GAGGTCATTA CTAATCCATC GGCCCGTTAC
GAAGCGGAAG GCATGGGGGG CATTATCAAC ATTGTACTCA AGAAAGAGCG GAAAGAAGGC
ATCAACGGTT CATTCGACGT GATTACGGGC TATCCGAGTA ATTTTGGGGC TGCCGCCAAC
GTCAACTACC GGCGTAAGAA TCTGAATTTT TTCGTCAATT ACACGGCTTC GTTCCGAAAT
ACGCCCGGCC GCAGTTCGCT GTATCAGGAA GTGTACGATG CCAATACAAC CTATGTTTAC
CGGCAAAACT CCACCAACAA CCTTAAGGGC CAGAATAACA ATGCCCGCGC CGGTATCGAC
TATTACTTTA GCCCCAAAAG TATCCTTACG GGTTCGTACA CCTGGCGATT AAGCAAGGGC
AAGCGCTTCG CCGATATTCA GTATCTTGAC TACCTGCCGA ATTCAAACCA TACGATTCAG
GCAATTACAA ACCGGACGCA GGATGAGACG GAAACCGAAC CGAACTCGGA GTATGTGGTG
AGCTATAAGA AAACATTTGC CCGGCAGGGA CATGAACTGA CCGCCGACAT TCGCTACCTC
GATAACTGGG AAAAATCAGA TCAGTTCTTC AACCAGAAAA TGCTTCAGCC GGACGGTAAA
CCCTCGGCCG TACCGGATGT TCTCCAGCGG TCGATTAATG ACGAAACCGA GAAACAACTG
CTGATTCAGG TCGACTATGT TCAGCCGTTT GCTAAAGATG GCAAGCTGGA AGGGGGAATG
CGCCTGAGTT CGCGCGACAT GACCAACGAT TACACCGTTA CGCAGCAGAG TTCAGGGGGC
TCCTGGACAC CGTTGCCGGG CCTGACCAAT GACTTTCTGT ACGTCGAGAA GATTAACGCG
CTGTACGGAA TTGTGGGTAA CAAGATGCGG AAGTTTTCGT ACCAAATGGG ACTTCGCGCC
GAGTGGACCG ACGTGACAAC CACTCTCAAG CAAACCAACG ACGTAAATCC GCGCAGCTAC
GCCAATCTGT TCCCGAGTGT ACACGCCACT TACGACCTGC CCCACCAGCA TGCGTTACAG
ATAAGCTACA GCCGCCGGGT GCGTCGGCCG CAGTACAATG ATCTGAGCCC GTTCATGACG
TTCAGCGACA ACCGTAACTT TTTCAGCGGT AATCCCGATC TTAATCCCGA ATTCACCAAT
GCATTTGAAA TTGGGCACAT CAAGTACGTC GAAAAAGGGT CGATGAGTTC GTCGATCTAT
TACCGGCACA CAACGGGAAA GATCATCCGG ATTCGTCGGG TCAATGAACA GGGGAATTCA
ACCACGCGCC CCGAAAATCT GGCAACCGAA GATTCTTACG GTGCCGAGTT TACGGGCTCA
TATGCGCTCT ATAAATGGTG GAAGCTGGAC GGGAGCGTCA ATTTTTTTCG GGCTATCACG
AACGGTGGCA ATCTCGACGC GAGTTACCAG AGTGATACCT ACAGCTGGTT CACCCGCTTA
ACATCGCGTT TTACGGTTTT AAAAAATACG GATATCCAGA TGCGGGGCAA TTACGAAGCT
CCACAAAAAA CGCCCCAGGG AAGTCGGAAA GCTATTGCCA CGCTGGATCT GTCGTTCAGT
AAGGACATTC TCAACAATAA TGGAACACTG GTCTTCAACG TCATTGATGT GTTCAATTCG
CGTCGGTATC GGTCTATTAC CGAAGGACAA AATTTCTACA CGGAAAGTAC GTCGCAGGGG
CGCTTACGCC AGTTTAACCT AACATTAAAT TACCGGCTGC ATCAGGCTAA GAAGAAAATG
AAAGACCCCG GCGAAGGGGA ATTTTAA
 
Protein sequence
MHRTSTHLIV LFFFLLPTLS VAQAIENQRQ AWVRGTIIDE KKAPIPFTTV ALHRAADSAL 
VKGVVADEAG MFALQTKPGN YFLRVSAVSF KEKISPGFTI GEQDRQLGTM TLMATAKRLD
EVVVMGEKSQ MELALDKRIF NVGKDLANAG GTASDILKNI PSVAVDGEGN VSLRGSNSVR
ILIDGKPSGL VSFKGGSGLQ QLQGSIIERV EVITNPSARY EAEGMGGIIN IVLKKERKEG
INGSFDVITG YPSNFGAAAN VNYRRKNLNF FVNYTASFRN TPGRSSLYQE VYDANTTYVY
RQNSTNNLKG QNNNARAGID YYFSPKSILT GSYTWRLSKG KRFADIQYLD YLPNSNHTIQ
AITNRTQDET ETEPNSEYVV SYKKTFARQG HELTADIRYL DNWEKSDQFF NQKMLQPDGK
PSAVPDVLQR SINDETEKQL LIQVDYVQPF AKDGKLEGGM RLSSRDMTND YTVTQQSSGG
SWTPLPGLTN DFLYVEKINA LYGIVGNKMR KFSYQMGLRA EWTDVTTTLK QTNDVNPRSY
ANLFPSVHAT YDLPHQHALQ ISYSRRVRRP QYNDLSPFMT FSDNRNFFSG NPDLNPEFTN
AFEIGHIKYV EKGSMSSSIY YRHTTGKIIR IRRVNEQGNS TTRPENLATE DSYGAEFTGS
YALYKWWKLD GSVNFFRAIT NGGNLDASYQ SDTYSWFTRL TSRFTVLKNT DIQMRGNYEA
PQKTPQGSRK AIATLDLSFS KDILNNNGTL VFNVIDVFNS RRYRSITEGQ NFYTESTSQG
RLRQFNLTLN YRLHQAKKKM KDPGEGEF