Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4238 |
Symbol | |
ID | 8727997 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 5112502 |
End bp | 5115807 |
Gene Length | 3306 bp |
Protein Length | 1101 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003389021 |
Protein GI | 284039091 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.324015 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGAAAA TTTTACTGAT GAGCTTACTT CTGGTATGCT CATTCTGGCT TCCAGCCTGG GCTCAGGAAC GAACAATTAC AGGTAAGGTT ACGGCCGCCG AAGATGGTAC GCCTTTACCG GGTGTATCTG TTGTGTTGAA GGGAGTGGCC CGGGGAACGA ATAGCGATGC TAACGGTGCC TACTCACTCA ATGTCCCGAC AAAAGGGGGA ACGCTGGTAT TCAGCTTTGT TGGAGCGGCT TCGCAGGAAA TCGAAATCGG CAACCGTTCC GTTATTGACG TTAAACTGGC GAACGACGCC AAGCAATTGG GTGAAGTGGT TGTAACGGCT CTGGGCCAGC AACGGGATAA GAAAGCACTG GCCTATGCCG TCTCCAACGT AAAAGGAGAT GTGCTCCAGC AACGGTCGGA GCCGGACCCG CTGCGGGCCT TATCGGGTAA AGTACCGGGG GTAAATATCA CGGCCGGTAA CGGAGCACCC GGTGCGGGTA CGCGGATTAC CATCCGGGGT AACAACTCCT TCACAGGTAA CAACCAGCCG CTGTTCGTTG TCGATGGTAT TCCTTTCGAT AACTCTGTAA ATACTCCACA AAATGGTAGC CAGGGCTACA ACACAAACAC TGTAACAACA AACCGGGCAT ACGACATTGA CCCGAACAAC ATTGAAGCGA TGACCGTACT GAAGGGTGCG GCTGCATCGG CGTTATATGG CTCACGGGCT GCCAACGGCG TTATTGTTAT CACAACCAAA TCGGGTAGCA AGTCGGCGCG GAAAGGTCTT GAGATTAATT TCAACACCTC GTACTCAGTC GAAAACGTAT CAACGGTTCC TGATTACCAG AACACCTATA CGCAGGGCTC CAACCAGACC TACAACGGTG GGTTTATCGG AAACTGGGGA ACTGTTTTCC CATCGGAGGT TGACCGCATC AACGCCGGCC TGGGTTTTGA GCGGTATTCA AAAGTAGTTG ATCCGGATTA TCCGGCGGGT ACCATTCCTC ATCCATTGGT CGATGCAACG GTACCCTATG GTGCTGCCCG CTACCAATCG GCTTTCCCTG AATTGCTCCA GTCAAACGGC CGGGGTATTG CGGTGCCACT CAAGCCTTAC GATATTATTG GCGGCTTTTT CCGGACGGGT AAAGTGATGG AAAATGGTAT TCAGATCACC TCTACCGGTG ATAAAACATC GCTGAACGCG TCGGTCTCAC GAACCAAAAA TGAGGGTATT ATTCCAAACT CATTCACGGA CCGTACCACC CTGAGCTTTG GTGGTAATGC TACCCTCACC AACAAAGTAA ACGTAGCCGG TAGCGTAGCA TATACGAATA CCAATCAGCA AAGTCCACAG TCGGGTGCTG GTTACTACGC TGACTACGGC GGGCTGGCTT CTGCCGGTTC TATCTACAGC CGTTTGTTCT ATCTTCCCCG TAACTTCGAC CTGAACGGCT ATCCGTTTGA AAACCCCGTA GATGGTTCAA ACGTATTCTA CCGGGCGCTC GATAACCCGC TCTGGACCGC TAAGTATAAC CTCTATAACT CCAGCGTGAA CCGGGTTTAT GGAAATATGA CGTTGAGCTA TGACGTTACG CCCTGGCTGA ACTTCACCGC CCGTGGTGGA ATAAATACGT ATTCAGAAAC CCGCAAAAAT GTCCTCCGTC CCGGTGGTTC GTTTTCTCCG CTCGGTTCGG TGTCCCGGAC AGATTTGACG AATACAGAAA TCGATTTTAC CTTTCTTGCT ACAGCGCAGC ACGATTTTTC GGAAAAGATC AATGCCAAGT TACTGGTTGG TTTCAACCCG AACCAGCGGA CCTATACGGA ATCTTCAGTT AGTGGAGCGC CCGTTATTGA TCCAAACATC CTGACAATTG GCGGTACACT GAACCAGAAC GCGGCCGATT ACCGGAGCCA GCGCCGTTTA TACGGAATCT TCAGTGAGTT AACGCTGGGT TACGGCAACT TCCTGTTCCT GACGGCTTCG GTTCGTAATG ACCAGTCATC GACACTGCCA GCTAAAAATA ACAGCTACTA CTACCCGGCC GTATCCGGTT CGTTTGTCTT CTCCGACGTG CTGAACCTGC CCAAGAACAT TATCAATCTG GGTAAACTGC GGGCCAACTA CGCCAAAGTG GGTAAGGATG CTTCGCCCTA TCAGGTATTT ACAGCCTATA ACTTGGGTCG TACGTTCTAC AACGGTACCG CCATTTCAAC GGCCAACCTG CCAAGCCAGT TGAACAACGT CAATTTGAAG CCTGAGTTTA CATCGGAGGT GGAGTTAGGT ACTGAACTCC AGTTCTTTAA TAGCCGTATT GGTATTGACG CAGCTTACTT TGATCGGGTA TCGACTGACC TGATTGTTAC CCGGGAGCTA CCCCGTACAT CAGGCTTTGC TACTGAAATA ACGAACGCCG GTAAAATCTC GAACAAAGGT TGGGAGATTG GCCTGACGCT CGTTCCGCTA CGGATGGCTA ATGGCCTGAC CTGGACATCG TACTTTGCAT ACACAAGTAT TAAGTCGAAA GTGGAGGATG CTGGTCCCGG TGGTGAAATT TTCATTGGTG GCACGGGCCT TTCTTCGCTG GGAACCATTT TCCGGAACGG CTTGCCGTAT GGGCAGATCA TCGGTTCGAA GAATGCTCGG GATGATGCGG GTAACCTGTT GATTAACCCA AGCACAGGTC TTCCGATCCG GGCTGCTAAG TCAGACATTA TTGGCGACCC TAATACGAAA TACCAGGTAG GCTGGACCAA TACGGTAAAC TTCAAAAATT TCTCGTTGAG TGTTTTGATG GACTACAAAG CCGGTGGTAG CCTGTTCTCC AGCACGGCTG CTTCGCTGCT GCTGCGTGGC CAGCTCAAGA ACTCCGAAGA TCGCGAAGGT ATGCGGGTAA TTCCGGGTGT ACTGGGTGAT CCGGCTACCT ACAAGCCACT TGTAGGTGAC AATGGTCAGC CGATAAAAAA CACCATCGCC ATGTCGGCCT TCCAGTATCA CTTTACGGAT GGATACGGTG CTTACGGTGC TGACGAGGTA AACATTTACG ACGCTACGGT CGTGCGCTTA CGCGAAGTAT CGCTGGGCTA TAGTGTACCT AAAGCCTTCC TGAAGCGGTA TGCTAAGGTA TTTGGCAGCA TGCGTCTGTC AGTATCGGGT CGTAACCTGT GGTTCTACGC GCCTAACATG CTGAAAGGGC TGAACTTCGA CCCTGAAGTA CTGTCAAACT TCGCCGATTC GAACATCCAG GGCTTTGACC TGGGCGCTTC GCCATCCACG CGCCGTTTCG GTATTAACCT CAATGCTTCA TTCTAA
|
Protein sequence | MRKILLMSLL LVCSFWLPAW AQERTITGKV TAAEDGTPLP GVSVVLKGVA RGTNSDANGA YSLNVPTKGG TLVFSFVGAA SQEIEIGNRS VIDVKLANDA KQLGEVVVTA LGQQRDKKAL AYAVSNVKGD VLQQRSEPDP LRALSGKVPG VNITAGNGAP GAGTRITIRG NNSFTGNNQP LFVVDGIPFD NSVNTPQNGS QGYNTNTVTT NRAYDIDPNN IEAMTVLKGA AASALYGSRA ANGVIVITTK SGSKSARKGL EINFNTSYSV ENVSTVPDYQ NTYTQGSNQT YNGGFIGNWG TVFPSEVDRI NAGLGFERYS KVVDPDYPAG TIPHPLVDAT VPYGAARYQS AFPELLQSNG RGIAVPLKPY DIIGGFFRTG KVMENGIQIT STGDKTSLNA SVSRTKNEGI IPNSFTDRTT LSFGGNATLT NKVNVAGSVA YTNTNQQSPQ SGAGYYADYG GLASAGSIYS RLFYLPRNFD LNGYPFENPV DGSNVFYRAL DNPLWTAKYN LYNSSVNRVY GNMTLSYDVT PWLNFTARGG INTYSETRKN VLRPGGSFSP LGSVSRTDLT NTEIDFTFLA TAQHDFSEKI NAKLLVGFNP NQRTYTESSV SGAPVIDPNI LTIGGTLNQN AADYRSQRRL YGIFSELTLG YGNFLFLTAS VRNDQSSTLP AKNNSYYYPA VSGSFVFSDV LNLPKNIINL GKLRANYAKV GKDASPYQVF TAYNLGRTFY NGTAISTANL PSQLNNVNLK PEFTSEVELG TELQFFNSRI GIDAAYFDRV STDLIVTREL PRTSGFATEI TNAGKISNKG WEIGLTLVPL RMANGLTWTS YFAYTSIKSK VEDAGPGGEI FIGGTGLSSL GTIFRNGLPY GQIIGSKNAR DDAGNLLINP STGLPIRAAK SDIIGDPNTK YQVGWTNTVN FKNFSLSVLM DYKAGGSLFS STAASLLLRG QLKNSEDREG MRVIPGVLGD PATYKPLVGD NGQPIKNTIA MSAFQYHFTD GYGAYGADEV NIYDATVVRL REVSLGYSVP KAFLKRYAKV FGSMRLSVSG RNLWFYAPNM LKGLNFDPEV LSNFADSNIQ GFDLGASPST RRFGINLNAS F
|
| |