Gene Slin_1136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1136 
Symbol 
ID8724869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1389229 
End bp1392225 
Gene Length2997 bp 
Protein Length998 aa 
Translation table11 
GC content51% 
IMG OID 
ProductTonB-dependent receptor 
Protein accessionYP_003385986 
Protein GI284036056 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.028233 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.759049 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAACT TACTGACTGT GAAGCAATTG ATCCGACTTT CGCTAATCGT TGTTCATTGT 
ACATTATTCA TTACCCTTAA ATCCTTCGCC CAGTCACCGG GTACAATCAG CGGGCAGGTA
GTGGATTCGT TAACCCGAAA GCCTTTGCTT GAAGCGTCGG TATCGCTTTT ATCGGCAAAG
GATTCCTCAC TGGTAAATTT TGGTATTACA GATGGAGAAG GACGTTTCTC GTTTCCCAAA
ATAGCCGAGG GACAATACCG CGTGCTGATT ACGTATGTGG GTTACCGTAG CCGTGCCCGT
CGGGTTGTGG TCACCAAAAC CGACCCGTCG CCAAACGTTG GTGCTATTGA TTTGGTCGCT
CAGTCGCAAA CGCTGACGGA AGTGTCTGTA CAGGGCGAAC GGGCACCCAT TGCCGTAAAA
GGCGACACGC TGGAGTTTAA TGCCGGTTCG TTCAAAACCC GTCCTAATGC TCAGGTAGAA
GAGTTACTGA AAAAACTGCC GGGCGTAGAA GTCGACCGGG ATGGTACCGT TAAAGCGCAG
GGCCAGGCTG TTACGAAAGT GCTGGTAGAC GGAAAACCTT TTTTTGGCAA TGACCCCAAA
ATGGCCACCC GAAATCTCCC TGCTGATATT ATCGACAAAG TACAGCTCTT CGATCAGGCT
TCGGAGCAAT CCGCCTTTTC GGGCGTGGAT GACGGCGACC GCGAAAAAAC CATTAACATC
ACCACCAAGA AAGACAAACG GAAAGGGTCT TTCGGGCAGC AAAGCATTGG CGTCGGTCCT
CAAACCGGCG ACCGGAGCGC CGGACCGGAT GCCCGGTATT CGGGGCGGGT GAGTTTAAAT
CGATTTAACA ATGGTCGTCA GATTTCGGTG TTAGGAATGG CGAACAACGT CAACCAGCAG
GGTTTCACGG CGCAGGATTT GGGGCTCGGC GGCAACTTCG GCGGGGCAGG TCAGGGCCAG
GGTGGTGGCG GAGGTGGTGG CGGCAACGTG GTTCGTGGGG GCCAGGGTGG CGGTAATTTT
GGCGGACAGA ATCAGGTTGG TAGCAACGCC ATCACGCAAT CGTGGGCGGC CGGTATCAAC
TACCGCGACG GCTGGGGTAA AAAAATAGAT GTTGTGAGCA GCTACAACGC CAGCAATACG
AACACCCTCA CCCAGCAAAG CAGCCGCCGG GAAAACGTTT TGCCCGGCGG AGCAACTACA
CGGTCGGACT CGTCTTTCGT ACGGAATCAA ACGAACGGTT CAGACAATAC AAATACCAAC
CACCGGGTTA ACTTACGACT CGATTATCGG CTCGATTCCC TGACCACAAT TCGTCTTATA
CCGAGTTTGT CGTGGCTAAA TTCGTCGTAC AGCAACCAAA GTGATGCCCG AACGGTAAAT
GCACAGGGGG CATTGGCTAA CGCAAGCACA ACGAATTACA ACTCCGTAGG GGATGGCTTT
ACCGGTAATA ATTCGTTGCT CTTGTTCCGG AAGTTCAGGA AGCGCGGTCG TACTTTTTCG
GTCAACTGGA ACATTGCCCT GAACGATCAG GATAATCAGG GCACCAATAT GTCCGTCAAT
CAATTTACCC GTTCTAATGC GCCGATCTCA ACGACGGGCA CATCAGGAAC AGCGACAACA
GGGCAGGCGG ATACCACAGG CTTGTTCAGG CAGGTAATCA ACCAGCGCAA CAACCAGCAA
ACGAACTCCA TGACCAACAG CGTAAACGTG AGTTACACGG AACCGCTGTC CATGCGCCAA
ACACTGGAGT TTCACTACCT CTTGTCCAAT AACCACAACA CGTCGAACCG GGCGGTCAAT
GATTTTAACG AGGCCACCAG CCAATACGAC TTGCCCAATA CGGTGCTGAG CAATCGGTTT
GTAAACGACT ACGTGACCAA CCGCGCCGGT CTGACGTGGC AAACCAAGCG ATTGAAATAC
ACTTATGCCT TCGGGCTGGA TGGGCAGCAG GCAAGTCTAC AGTCAACTAA CCTAAGCCGC
GAAACCAACC TGAGCCGGAC GTTTACGAAC TTGCTCCCCA ATGCGTTGCT TACCTATAAT
TTTGCCAAGC AGCGTACATT GCGCTTTAAC TACCGTACCC GCATTAACGC GCCGTCGGTA
AATCAGTTGC AGCCGGTTGC GAATAACACA AACCCGCTGA ACATACAACT CGGTAACCCT
GATCTACAGC CCGAATACAG CCATAATATC TCGCTGAACT TCAACCGGTT TGAGCCGTCG
ACGTTCCGGA ATTTGTTTGC GTCGATAAAC GCCAGCCGGA CAGATAACAA AATTGTGAAC
TCAACGGTAT TTACCCAATC GGGCGCACAG ACCACAACAC CGATCAATAC AAATGGGTAT
TACACGGTCA ACGGGTTTCT GGTGTTAGGG CAGCCGGTTA AGATTGGTAC CCAGAAAACG
AATCTGAACC TGCGAACCAA CCTGACCTAC AACAACGGCA CTAGTTTTAT CAATCGGCAG
GCCAATCAGG CAAAAAACTG GCTGGTGGGA CAAACGGTTG GCTTAAGCTC CAATTTTACC
GAAAAGCTCG ACCTGAATCT ATCGGCGAAT ATCAATCTTC AGTCGGCCAA ATACTCCTTG
CAGCCTCAGC AGAATACGAC CTTCCTGAAC CAGACGGTTA CGCTTGATGT GTACTACCAA
CTGCCGGGCC GTTTTACGCT CTCGACGGAT GTGTATTACA ATCACTACGG CGGTAACTCG
GCTAGTTTCA ATCAGTCGTT TACGCTGTGG AATGCAACAC TGGCAAAGCA GTTATTTAAA
CAGAATCAGG GGGAATTGCG GCTTCAGGTG TTCGATTTGC TGAATCAGAA CCAGAGTATT
GTCCGAAATG TGACCGATAC CTACACGGAA GAAGTCCGGA GCCGGGTGCT GAACCGCTAT
TTTATGGTAA GTTTTGTGTA TAACCTGCGG AGTTTCAGCG CGGGTGTAAC GCCACCAAGA
GACCCATTTA GTCAGCCAAC GCGCGGGCAG GGGGGAGGTT TCCGCCGGAA TGGGTAA
 
Protein sequence
MRNLLTVKQL IRLSLIVVHC TLFITLKSFA QSPGTISGQV VDSLTRKPLL EASVSLLSAK 
DSSLVNFGIT DGEGRFSFPK IAEGQYRVLI TYVGYRSRAR RVVVTKTDPS PNVGAIDLVA
QSQTLTEVSV QGERAPIAVK GDTLEFNAGS FKTRPNAQVE ELLKKLPGVE VDRDGTVKAQ
GQAVTKVLVD GKPFFGNDPK MATRNLPADI IDKVQLFDQA SEQSAFSGVD DGDREKTINI
TTKKDKRKGS FGQQSIGVGP QTGDRSAGPD ARYSGRVSLN RFNNGRQISV LGMANNVNQQ
GFTAQDLGLG GNFGGAGQGQ GGGGGGGGNV VRGGQGGGNF GGQNQVGSNA ITQSWAAGIN
YRDGWGKKID VVSSYNASNT NTLTQQSSRR ENVLPGGATT RSDSSFVRNQ TNGSDNTNTN
HRVNLRLDYR LDSLTTIRLI PSLSWLNSSY SNQSDARTVN AQGALANAST TNYNSVGDGF
TGNNSLLLFR KFRKRGRTFS VNWNIALNDQ DNQGTNMSVN QFTRSNAPIS TTGTSGTATT
GQADTTGLFR QVINQRNNQQ TNSMTNSVNV SYTEPLSMRQ TLEFHYLLSN NHNTSNRAVN
DFNEATSQYD LPNTVLSNRF VNDYVTNRAG LTWQTKRLKY TYAFGLDGQQ ASLQSTNLSR
ETNLSRTFTN LLPNALLTYN FAKQRTLRFN YRTRINAPSV NQLQPVANNT NPLNIQLGNP
DLQPEYSHNI SLNFNRFEPS TFRNLFASIN ASRTDNKIVN STVFTQSGAQ TTTPINTNGY
YTVNGFLVLG QPVKIGTQKT NLNLRTNLTY NNGTSFINRQ ANQAKNWLVG QTVGLSSNFT
EKLDLNLSAN INLQSAKYSL QPQQNTTFLN QTVTLDVYYQ LPGRFTLSTD VYYNHYGGNS
ASFNQSFTLW NATLAKQLFK QNQGELRLQV FDLLNQNQSI VRNVTDTYTE EVRSRVLNRY
FMVSFVYNLR SFSAGVTPPR DPFSQPTRGQ GGGFRRNG