Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_1136 |
Symbol | |
ID | 8724869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 1389229 |
End bp | 1392225 |
Gene Length | 2997 bp |
Protein Length | 998 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | TonB-dependent receptor |
Protein accession | YP_003385986 |
Protein GI | 284036056 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.028233 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.759049 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAACT TACTGACTGT GAAGCAATTG ATCCGACTTT CGCTAATCGT TGTTCATTGT ACATTATTCA TTACCCTTAA ATCCTTCGCC CAGTCACCGG GTACAATCAG CGGGCAGGTA GTGGATTCGT TAACCCGAAA GCCTTTGCTT GAAGCGTCGG TATCGCTTTT ATCGGCAAAG GATTCCTCAC TGGTAAATTT TGGTATTACA GATGGAGAAG GACGTTTCTC GTTTCCCAAA ATAGCCGAGG GACAATACCG CGTGCTGATT ACGTATGTGG GTTACCGTAG CCGTGCCCGT CGGGTTGTGG TCACCAAAAC CGACCCGTCG CCAAACGTTG GTGCTATTGA TTTGGTCGCT CAGTCGCAAA CGCTGACGGA AGTGTCTGTA CAGGGCGAAC GGGCACCCAT TGCCGTAAAA GGCGACACGC TGGAGTTTAA TGCCGGTTCG TTCAAAACCC GTCCTAATGC TCAGGTAGAA GAGTTACTGA AAAAACTGCC GGGCGTAGAA GTCGACCGGG ATGGTACCGT TAAAGCGCAG GGCCAGGCTG TTACGAAAGT GCTGGTAGAC GGAAAACCTT TTTTTGGCAA TGACCCCAAA ATGGCCACCC GAAATCTCCC TGCTGATATT ATCGACAAAG TACAGCTCTT CGATCAGGCT TCGGAGCAAT CCGCCTTTTC GGGCGTGGAT GACGGCGACC GCGAAAAAAC CATTAACATC ACCACCAAGA AAGACAAACG GAAAGGGTCT TTCGGGCAGC AAAGCATTGG CGTCGGTCCT CAAACCGGCG ACCGGAGCGC CGGACCGGAT GCCCGGTATT CGGGGCGGGT GAGTTTAAAT CGATTTAACA ATGGTCGTCA GATTTCGGTG TTAGGAATGG CGAACAACGT CAACCAGCAG GGTTTCACGG CGCAGGATTT GGGGCTCGGC GGCAACTTCG GCGGGGCAGG TCAGGGCCAG GGTGGTGGCG GAGGTGGTGG CGGCAACGTG GTTCGTGGGG GCCAGGGTGG CGGTAATTTT GGCGGACAGA ATCAGGTTGG TAGCAACGCC ATCACGCAAT CGTGGGCGGC CGGTATCAAC TACCGCGACG GCTGGGGTAA AAAAATAGAT GTTGTGAGCA GCTACAACGC CAGCAATACG AACACCCTCA CCCAGCAAAG CAGCCGCCGG GAAAACGTTT TGCCCGGCGG AGCAACTACA CGGTCGGACT CGTCTTTCGT ACGGAATCAA ACGAACGGTT CAGACAATAC AAATACCAAC CACCGGGTTA ACTTACGACT CGATTATCGG CTCGATTCCC TGACCACAAT TCGTCTTATA CCGAGTTTGT CGTGGCTAAA TTCGTCGTAC AGCAACCAAA GTGATGCCCG AACGGTAAAT GCACAGGGGG CATTGGCTAA CGCAAGCACA ACGAATTACA ACTCCGTAGG GGATGGCTTT ACCGGTAATA ATTCGTTGCT CTTGTTCCGG AAGTTCAGGA AGCGCGGTCG TACTTTTTCG GTCAACTGGA ACATTGCCCT GAACGATCAG GATAATCAGG GCACCAATAT GTCCGTCAAT CAATTTACCC GTTCTAATGC GCCGATCTCA ACGACGGGCA CATCAGGAAC AGCGACAACA GGGCAGGCGG ATACCACAGG CTTGTTCAGG CAGGTAATCA ACCAGCGCAA CAACCAGCAA ACGAACTCCA TGACCAACAG CGTAAACGTG AGTTACACGG AACCGCTGTC CATGCGCCAA ACACTGGAGT TTCACTACCT CTTGTCCAAT AACCACAACA CGTCGAACCG GGCGGTCAAT GATTTTAACG AGGCCACCAG CCAATACGAC TTGCCCAATA CGGTGCTGAG CAATCGGTTT GTAAACGACT ACGTGACCAA CCGCGCCGGT CTGACGTGGC AAACCAAGCG ATTGAAATAC ACTTATGCCT TCGGGCTGGA TGGGCAGCAG GCAAGTCTAC AGTCAACTAA CCTAAGCCGC GAAACCAACC TGAGCCGGAC GTTTACGAAC TTGCTCCCCA ATGCGTTGCT TACCTATAAT TTTGCCAAGC AGCGTACATT GCGCTTTAAC TACCGTACCC GCATTAACGC GCCGTCGGTA AATCAGTTGC AGCCGGTTGC GAATAACACA AACCCGCTGA ACATACAACT CGGTAACCCT GATCTACAGC CCGAATACAG CCATAATATC TCGCTGAACT TCAACCGGTT TGAGCCGTCG ACGTTCCGGA ATTTGTTTGC GTCGATAAAC GCCAGCCGGA CAGATAACAA AATTGTGAAC TCAACGGTAT TTACCCAATC GGGCGCACAG ACCACAACAC CGATCAATAC AAATGGGTAT TACACGGTCA ACGGGTTTCT GGTGTTAGGG CAGCCGGTTA AGATTGGTAC CCAGAAAACG AATCTGAACC TGCGAACCAA CCTGACCTAC AACAACGGCA CTAGTTTTAT CAATCGGCAG GCCAATCAGG CAAAAAACTG GCTGGTGGGA CAAACGGTTG GCTTAAGCTC CAATTTTACC GAAAAGCTCG ACCTGAATCT ATCGGCGAAT ATCAATCTTC AGTCGGCCAA ATACTCCTTG CAGCCTCAGC AGAATACGAC CTTCCTGAAC CAGACGGTTA CGCTTGATGT GTACTACCAA CTGCCGGGCC GTTTTACGCT CTCGACGGAT GTGTATTACA ATCACTACGG CGGTAACTCG GCTAGTTTCA ATCAGTCGTT TACGCTGTGG AATGCAACAC TGGCAAAGCA GTTATTTAAA CAGAATCAGG GGGAATTGCG GCTTCAGGTG TTCGATTTGC TGAATCAGAA CCAGAGTATT GTCCGAAATG TGACCGATAC CTACACGGAA GAAGTCCGGA GCCGGGTGCT GAACCGCTAT TTTATGGTAA GTTTTGTGTA TAACCTGCGG AGTTTCAGCG CGGGTGTAAC GCCACCAAGA GACCCATTTA GTCAGCCAAC GCGCGGGCAG GGGGGAGGTT TCCGCCGGAA TGGGTAA
|
Protein sequence | MRNLLTVKQL IRLSLIVVHC TLFITLKSFA QSPGTISGQV VDSLTRKPLL EASVSLLSAK DSSLVNFGIT DGEGRFSFPK IAEGQYRVLI TYVGYRSRAR RVVVTKTDPS PNVGAIDLVA QSQTLTEVSV QGERAPIAVK GDTLEFNAGS FKTRPNAQVE ELLKKLPGVE VDRDGTVKAQ GQAVTKVLVD GKPFFGNDPK MATRNLPADI IDKVQLFDQA SEQSAFSGVD DGDREKTINI TTKKDKRKGS FGQQSIGVGP QTGDRSAGPD ARYSGRVSLN RFNNGRQISV LGMANNVNQQ GFTAQDLGLG GNFGGAGQGQ GGGGGGGGNV VRGGQGGGNF GGQNQVGSNA ITQSWAAGIN YRDGWGKKID VVSSYNASNT NTLTQQSSRR ENVLPGGATT RSDSSFVRNQ TNGSDNTNTN HRVNLRLDYR LDSLTTIRLI PSLSWLNSSY SNQSDARTVN AQGALANAST TNYNSVGDGF TGNNSLLLFR KFRKRGRTFS VNWNIALNDQ DNQGTNMSVN QFTRSNAPIS TTGTSGTATT GQADTTGLFR QVINQRNNQQ TNSMTNSVNV SYTEPLSMRQ TLEFHYLLSN NHNTSNRAVN DFNEATSQYD LPNTVLSNRF VNDYVTNRAG LTWQTKRLKY TYAFGLDGQQ ASLQSTNLSR ETNLSRTFTN LLPNALLTYN FAKQRTLRFN YRTRINAPSV NQLQPVANNT NPLNIQLGNP DLQPEYSHNI SLNFNRFEPS TFRNLFASIN ASRTDNKIVN STVFTQSGAQ TTTPINTNGY YTVNGFLVLG QPVKIGTQKT NLNLRTNLTY NNGTSFINRQ ANQAKNWLVG QTVGLSSNFT EKLDLNLSAN INLQSAKYSL QPQQNTTFLN QTVTLDVYYQ LPGRFTLSTD VYYNHYGGNS ASFNQSFTLW NATLAKQLFK QNQGELRLQV FDLLNQNQSI VRNVTDTYTE EVRSRVLNRY FMVSFVYNLR SFSAGVTPPR DPFSQPTRGQ GGGFRRNG
|
| |