Gene Slin_5636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5636 
Symbol 
ID8729410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6864470 
End bp6866851 
Gene Length2382 bp 
Protein Length793 aa 
Translation table11 
GC content49% 
IMG OID 
ProductTonB-dependent receptor 
Protein accessionYP_003390400 
Protein GI284040470 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.734932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC CGACGACCTC CTCGTTTCTT CTGATAGCTT TTTTATGGCT ATCTATGCGG 
CCCGCTTGGG CTCAACTCAC CGTTTCGGGT ACCGTCAGCG ATGCCGATGG CGGCCCGCTA
CCCGGTGCTG CCGTTGTACT GATAGATACG TATAAGGGCA CCTTTACAGA TGCTTCCGGC
GCTTTTCAGT TGACCAATCT TAAATCTGCC CCCGTAGTGT TGCAGATTTC TATGCTGGGC
TACGAAACTA AAAACCAGAC GTTTAATTCA AATCAAAACG CTTCATTAAC AATAAAATTA
TCAAAAACTG CTGTAGCTGT CGATGAGGTG GTTGTGAGTG CCACCCGGGC GAACCAGAAA
TCAGCCATTG CCTACACGGA TGTGACCCGC CGGGATCTTA ATAAATTGAA TCTGGGGCAG
GATATTCCGC AATTGCTCAA TTTCACGCCA TCCATTGTAA CGACTTCCGA TGCCGGAGCG
GGCGTTGGTT ACACCGGCAT TCGCATTCGA GGGTCCGATG CCACACGGGT AAACGTGACC
CTCAATGGCA TTCCGTATAA TGATGCCGAG TCGCAGGGGA CGTTTTTTGT CAATATGCCG
GACTTTGCAT CGTCAGTGAG TACGATTCAG ATTCAGCGTG GGGTGGGTAC GTCGACCAAT
GGGGCCGGGG CGTTTGGGGC ATCAGTCAAT ATTCAGACGA ATAAGCTGGA AGAAAAGCCG
TATGCCGAGA CAAATCTGTC GGGTGGTTCG TTCGGGACGC GGAAAGTGAA TGTGCTGGCC
GGAACGGGCC TTCTGAATAA TCATTTTGTA CTGGATGCGC GCCTGTCGAA AATTTACTCA
GACGGCTACA TCGACCGCGC TTCCTCCGAT TTGAGGTCTT TTTACCTCTC CGGTGGGTAC
TATACTAAAA AGAGCTTTTT TCGGTTGAAC GTGTTTTCGG GTCAGGAAAA AACCTATCAG
GCCTGGAACG GCGTGCCGGA GCAATATCTA AAAACCAACC GGACGTATAA TTCCTTTACG
TACGATAACC AGACCGACAA CTACCAGCAG GATAATTACC AGTTGATCAC CTCACACGAG
TTGACCAAAA ACTGGCGAGC AAACCTGTCG TTCTTTTATA CGAAAGGGAA AGGGTATTAC
GAAGAGTACA GAACAGCGGA TAATTTTAGC AGTTATGGGC TACCTAATGT AGTCATTGGT
GATTCGACCA TTAAGCAGAC TGATCTGATC CGGCGGCTGT GGCTCGACAA TGACTTCTAC
GGCACCGTTT TTTCATTCGA CTACAACAGT TTCGGGAAGT TAACGGCCAA CATCGGCGGT
GGCTGGAATC AGTACAAAGG CATCCACTAC GGCGAAATTA TCTGGTCGCG GGTGGCCGGG
AGCAGTAATA TCCGGGATCG CTACTATCAG GACGATGCGA CCAAACGTGA TTTCAACCTG
TTTGCTAAAG CGTTTTATCA ATTGACTCCA AAGCTGAACG GCTTTCTCGA TGCGCAGATT
CGGACGATTA ATTATTCATT TTTAGGCTAT AATAGTCAAT TGCAGAACGT TCAGCAGGAC
GCCAAGCTCA CGTTCTTCAA CCCGAAAGCC GGTTTTACCT ACACCATAAA TGACCGTAGC
ACGGTATATG CCTCGGTAGG CGTGGGCCAG CGGGAGCCAA ATCGCAATGA TTATACGCAA
TCGACGCCCG AAAGTCGCCC GAAAGCGGAA AAACTGATTG ATTACGAAGC AGGCTACAAA
GTTCAGGGTG AGAAACTGGC CTTTACAGCC AACGCCTATT ACATGGACTA CAAGAATCAA
CTGGTACTGT CGGGTCAGTT GAACGACGTA GGAGCGTATA ACCGGGTCAA TATTCCAAGT
AGCTATCGGG CGGGTATTGA ACTCGAAGCG GGCGCACGTC TGGCTAAACA ACTGCGCTGG
AATGTTAACG CGACCTTCAG CCGCAATCGG GTGAAAAACT TTACCGAGTA CCTGGATAAC
CTCGACAACG GACAGCAGGA AACACGTCAG TATCGTGAGA CGGATATTTC GTTTTCCCCG
AACGTCATTG CCGGTTCACA ACTGTTGTTT ACGCCCGCTA AAGGGCTGGA ACTTGCCCTT
CTCTCGAAAT ATGTTGGTAA GCAATACCTT GACAACACGT CGAACGAAAG CCGCAAGCTG
AATCCGTACT TCACAAACGA CATTCGGGTG ATTTATTCGA TTAAGCCAAA ATTCGCGCAG
GAAATCGCCT TTACATTATT GTTCAACAAT GTGCTCAACG AACTATATGA GTCGAACGGC
TTCACGGTTC CGTACATCGC AGAAGGAAAA GTAACTGCCG ATAACGGGTA TTATCCGCAA
GCGGGGCGGA ATTTTCTGGC CGGGGTGCGG GTGCGTTTCT AG
 
Protein sequence
MKKPTTSSFL LIAFLWLSMR PAWAQLTVSG TVSDADGGPL PGAAVVLIDT YKGTFTDASG 
AFQLTNLKSA PVVLQISMLG YETKNQTFNS NQNASLTIKL SKTAVAVDEV VVSATRANQK
SAIAYTDVTR RDLNKLNLGQ DIPQLLNFTP SIVTTSDAGA GVGYTGIRIR GSDATRVNVT
LNGIPYNDAE SQGTFFVNMP DFASSVSTIQ IQRGVGTSTN GAGAFGASVN IQTNKLEEKP
YAETNLSGGS FGTRKVNVLA GTGLLNNHFV LDARLSKIYS DGYIDRASSD LRSFYLSGGY
YTKKSFFRLN VFSGQEKTYQ AWNGVPEQYL KTNRTYNSFT YDNQTDNYQQ DNYQLITSHE
LTKNWRANLS FFYTKGKGYY EEYRTADNFS SYGLPNVVIG DSTIKQTDLI RRLWLDNDFY
GTVFSFDYNS FGKLTANIGG GWNQYKGIHY GEIIWSRVAG SSNIRDRYYQ DDATKRDFNL
FAKAFYQLTP KLNGFLDAQI RTINYSFLGY NSQLQNVQQD AKLTFFNPKA GFTYTINDRS
TVYASVGVGQ REPNRNDYTQ STPESRPKAE KLIDYEAGYK VQGEKLAFTA NAYYMDYKNQ
LVLSGQLNDV GAYNRVNIPS SYRAGIELEA GARLAKQLRW NVNATFSRNR VKNFTEYLDN
LDNGQQETRQ YRETDISFSP NVIAGSQLLF TPAKGLELAL LSKYVGKQYL DNTSNESRKL
NPYFTNDIRV IYSIKPKFAQ EIAFTLLFNN VLNELYESNG FTVPYIAEGK VTADNGYYPQ
AGRNFLAGVR VRF