Gene Slin_0823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0823 
Symbol 
ID8724554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1000428 
End bp1002881 
Gene Length2454 bp 
Protein Length817 aa 
Translation table11 
GC content51% 
IMG OID 
ProductTonB-dependent receptor 
Protein accessionYP_003385684 
Protein GI284035754 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.173477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.677151 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTCG CTACAGTTTA CTACATAACC TTCATCTTAC TACTTCCCTG TGGGGCAATC 
GCTCAGCAAT CCGTTTCCTC AACGGCCTCT CAATCGATAG GCCAGATCAG CGGAACGGTG
CTGGATTCGG TAACCCGACA GCCCGTGCCT TTTGCAACAG TAGCCCTTTC AACGTCTACT
GGTAATGTAC TTGCCGGCAA AATAACGTCT GAGACGGGCA CATTTATGTT TTCTGGCTTA
CTGCCTGGCT CGTATACGCT GCGACTGACA TTTGTCGGCT ACCAGACCCG TACCCTGACC
GCGTTCGCGC TGACGGACCA GAAGCCCGGT GTGCAACTGG GTACGCTGCT CCTGCATCCT
GAAAGTCGCC AACTCAATGA GGTCGTGATT ACGGGGCAGA AAGCCCTGAT CGAAGAGAAA
TCGGACCGAC TGGTGTATAA CGCGGCCAAC GATCTGACCA ACAAGGGCGG AACGGCGGTG
GACGTGCTGC GGAAAGCCCC CATGCTTACG GTGGACGTAA CGGGTAATGT GCAGCTACGC
GGCAGCTCAA ATCTGAAAGT ATTGCTAAAC GGTCGTCCGT CGGGTTTGCT GGCTCGTAAT
TTGAGTGAAG CGCTGAAGAT GATACCGGCC AATACGATTC AGTCGGTGGA GGTCATTACC
AGCCCGTCTG CCCGATACGA TGCCGAAGGG TCGGGCGGGG TCATCAACAT CATCACCAAA
AAACAATTGA AAGGTTCTTC CGGCAATCTG GATGTAACGG CGGGTAATTA CACGCAGTCG
ATTGGCGGCA GTTATGGAGT CAAACGGGAG AAATTTGGAT TGACATTTTC GGGCAATGGG
AATGCTGAAC GCGAAAAAAG CGTGTCCGAA ATGACCCGTA TTTCGCTGCT CAATGGGCAA
CCCGCCGGTG AGTTGTTCCA GCGACGTAGC GCCAACAATG TGCATCGGGG CTGGTTTGGC
GATCTGAGTT TGGACTACGC CTTCGATACA CTAAACCGGG TAAATCTCTC GATCAGTACG
TGGGGCGGGG CCTGGCCCAA CTCAAGTTCG CTCTATAATC GCTTTCGAAA TGCAGAAGGG
ATCGTAACCC AGGCGTATAA TCAGGCGGTA AATCAACAGG AGCCCTTTGG TAATATTGAA
TTTAATTTAG GCTATACCCG TGCCTTTAAA AGGCCAAAAC AGGAGTTAAT TCTCCTGGGA
CAGTACAGTT ACACCTTCGA CAACACGAGC TATACCAGTG ATCAATTTAC GCCAATAGGC
GTGCCGATTT ATCGGGAAAC CAGTACGAAT CAAAGTCATA ATCCACAGTA TACGTTTCAG
CTTGATTACA CCCATCCATT TTCCTCATCA GGTCGGCAGG TTTTCGAGGT GGGTGCCAAA
GCTATTCGGC GGGATGTGAG CAGTCGGTAC GCTATTTATA ACAGTAGTGT GGGTGCCGTC
GATGTTCTGC TCTACAACGT CAGCCGTTCC AATAATTTCG TTTACGATCA GCAGGTGCTG
GCCACGTATG CGTCGCTAAA ACTGTCAAAC CAGACGAAGT GGATGCTTCA GTCTGGGGTT
CGATTGGAGA ATACGATCAA TGAAGGACGA TTTGCTGATT CCATTCCACC CTTTCGGATT
CAATTTACGA ACTTCATCCC AAGTCTAACG CTGAGTAAAC AACTGAGCGA ACGGCAGTCG
TTTAAAATCA GTTACACCCA GCGTATTTCC CGGCCCATGA TCTGGGATTT GAATCCCTAT
ATCAATGCAA GTGACCTTAA GAACCTGAGT GCGGGCAATC CGCAGCTTCG CCCCGAACTG
ACGCATCTGG CCGAGCTGTC CTATAGTCTG ACAACCAAAA GCGGAGCCTA TCTCAACCTG
GCCCTGTATC GACGTGAGAC AAACAATTCC ATTGAGGAAG TTCGAACCGT CGATACGTCC
GGGGTGTCCC GATCCATTAA GCAGAACGTG GCGCGTAACC AGCGAACAGG ATTGAATGTA
AACGCAGCGG GGCAGTTTAA CCGAAACTGG AAAATCAACG GGGGCGGTGA ATTCTACCAT
ACCCAGTTCA GCAGCACCGC CTTGCAGGTA CAGAATTCAG GCTGGCTCTG GCAACTTAAT
CTGAACATGG CCTACCAGTT GCCGCAGAAT TATTCATTGC AGGCTTACGG GATGTACAGC
ACTGGCTGGA TTCTATTGCA AGGTAAAAAC TCGGCCTGGT ATCATTACAG CCTGGCAGCC
CGCAAGGAGT TCTGGGACAG GAAAGCCAGT CTGACGCTGG GTGTTAACAA TCCCTTCACG
CAGCCATTCC GGCAAAACAA CGAGTCGCAG TCCAGTTCCT TCCGGGCGCA TACAGCTAAT
CAATACGTCA CGCGATCGGT TAAACTGACC TTTAGCTGGC AGTTTGGGCA GATTCGGGCG
GGTAATGAAC CAGCGGGCAA AAAAATTATT AACGATGATG CCAAGGCGAA ATAA
 
Protein sequence
MKFATVYYIT FILLLPCGAI AQQSVSSTAS QSIGQISGTV LDSVTRQPVP FATVALSTST 
GNVLAGKITS ETGTFMFSGL LPGSYTLRLT FVGYQTRTLT AFALTDQKPG VQLGTLLLHP
ESRQLNEVVI TGQKALIEEK SDRLVYNAAN DLTNKGGTAV DVLRKAPMLT VDVTGNVQLR
GSSNLKVLLN GRPSGLLARN LSEALKMIPA NTIQSVEVIT SPSARYDAEG SGGVINIITK
KQLKGSSGNL DVTAGNYTQS IGGSYGVKRE KFGLTFSGNG NAEREKSVSE MTRISLLNGQ
PAGELFQRRS ANNVHRGWFG DLSLDYAFDT LNRVNLSIST WGGAWPNSSS LYNRFRNAEG
IVTQAYNQAV NQQEPFGNIE FNLGYTRAFK RPKQELILLG QYSYTFDNTS YTSDQFTPIG
VPIYRETSTN QSHNPQYTFQ LDYTHPFSSS GRQVFEVGAK AIRRDVSSRY AIYNSSVGAV
DVLLYNVSRS NNFVYDQQVL ATYASLKLSN QTKWMLQSGV RLENTINEGR FADSIPPFRI
QFTNFIPSLT LSKQLSERQS FKISYTQRIS RPMIWDLNPY INASDLKNLS AGNPQLRPEL
THLAELSYSL TTKSGAYLNL ALYRRETNNS IEEVRTVDTS GVSRSIKQNV ARNQRTGLNV
NAAGQFNRNW KINGGGEFYH TQFSSTALQV QNSGWLWQLN LNMAYQLPQN YSLQAYGMYS
TGWILLQGKN SAWYHYSLAA RKEFWDRKAS LTLGVNNPFT QPFRQNNESQ SSSFRAHTAN
QYVTRSVKLT FSWQFGQIRA GNEPAGKKII NDDAKAK