Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4505 |
Symbol | |
ID | 8728269 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 5460393 |
End bp | 5463533 |
Gene Length | 3141 bp |
Protein Length | 1046 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003389284 |
Protein GI | 284039354 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTGGGC TAATCCTGCT GCTTTGCACG AATGTGTTTG CGCAAAGCAC TCGCTATACA TTGAAAGGAC GGGTTACCGA TCCTGAAAAA ATGGGGCTTC CCGGAACCAC CGTTGTGCTT GTTGGTACCA CCGTTGGTAC GACCACCGAT GCTGAAGGGA ACTACACACT CCCCGTTACG CTGAAACCTG GTCCCGTTAC CGTTGCCTTT ACGTCGATCG GTTATGAAAC CCTACGCCAG GACGTAACGC TGGGCAATGC CGATGAAGTA ACCGTGAACG CGCAACTCGT GGCGGCTGCG ACAAATCTCG ATGAAGTGGT GGTCACAGGC TCTACGCTAT CGGCTCCCAA ACGGGAGTTA GGCAATGCCA TCAGCACCAT CAAAGCGGCC GATTTAACCC AAAGCGGCTC GGGCAACCTG ATCAACTCCT TACAAGGCAA AGTACCGGGC GCGCAAATCA CGCAGAACTC CGGCGACCCG GCCGGTGGTA TCAGCATTCG TCTGCGGGGT ATCAAGTCGC TGGTTGGCTC GTCGGACCCG CTCTATGTTG TGGACGGGGT TATTGTAAGC AACGCCAGTA CCAACGTCTC GCAACTGGCG CTGGCTAACG ATGTTGGTAA CGCCAACGTC GGCCAGAACC GTCTGTCGGA TATTAACCCA GACGATATTG CCACCATCAA TGTCGTGAAC GGGGCGGCTG CGGCTGCTCA ATATGGTTCA CGAGCGGCCA ATGGAGTAGT GATTATTACC ACCAAGCGCG GCCAGAGTGG CAAGGCGCAG GTCAACTTTA CCACGTCGTT CAACATCAAC GAACTGCGAA AAGGCGTACC CGTCAATACC TATGGCAAGC AGTTCGGCTT TGCCTCGCTT CGGCTATATC CCATCGGGGT TATTTCGGCG GCTCAGGTAG CGGCTAATCC GGGTACAACC ACGACCAGCA TTTACCGCGA CGGGACGAAC TCCCTGCTGG CAACCAACCT CGTGGATGTA CAGCGGTATA ATTACTTCGA TCAAATCTTC CGGACGGGCT ACGGCACCGA CAATAATCTG TCTATTTCGG GTGGTCGCGA CAACACCCAG TATTATGTGT CGTTCGGCTA CCTCAAAAAT GAGGGCATTA TCAAAGGCAC AGACTTTACC CGCTACAACC TCCGCGCCCG CGTAGACCAG CGACTGGCCA ACTGGGCCAA GATTTCGGTG GGTATCAGCT ACTCGAATAG CCTTTCCAAT GAGAAGGCGA ATGGCAACGT GTTCTACAGC CCGATCAACT CGGTCAACAT CACGAACAAC ATCTACGACA TCACCAAACG CGATGCCGCC GGAAACTTGC AGGCTGTGGA GCCATCCCGC GTAAACCCAC TCTCGACCAT CGAAGACATG AAGTTTTCGC AGTCGGTGAG CCGGACGATC AACAGCCTTC AACTGAACCT GACGCCACTG AAAGGTCTGA CCGTTGATTA TATCGTAGGT GTCGATGCCT ATTCACAGTT CGGTAAAAAC TATATTCGGC CTTACCCTTA CCAGTCTGTT GCCCAGTTAC CAGCCGCTCG TTATCCGTTT GGTTTTGCCG CCACAGCCAC TAATCAGGTG CTGCAATTCA ACAATGACCT GAACGCGCAG TATGAAAACC AGTTTAGTGA GAAGTTTAAG CTGAACGCGG CCATCGGCTA CAGCTATCAA TATTATCAGG CTGATTACTC GATCAATAGC GGCCAGAACC TGTCGCCGTT TATCGAGACG GTTAGCGGAG CCAGCAACAC GACCTACTCC GTTCGCTATG ACCTGGACCG TTTCAGCCTG AGCGGTCTGT TTGCGCAGGC CACGCTCGGC TATAAAAATC TGGCATTCAT TACCGGGGCC GTCCGTCGCG ATCAGTCGTC GAAATTCTCA CCAACGCAGA CCAACCAGTA TTACCCCAAA GTAAGCGGTT CGTTCATTGT GTCGGACCTT GATTTCTGGA AAAACCTGTC GTTCAGCAAC GCTTTCAACA GCCTGAAACT CCGGACCAGC TATGGCGAAG CCGGTAACCT GTCGGGTATT GGTTCGTATG CCCGTTTTTA CCAGTTTAGC CCGGTGGGTT TCCTGGGTAA AAACACGGTG GTACCCGGTA CGCAACTGGC CAACCCCGAC GTGCAACCCG AACGTATGGC TGAACTGGAG GGCGGTGTTG ATTTGGCCTT CCTGAACGGC CGGATTGGCC TGGGTGTAAC GGCCTACAAC CAGAAGATCA CAAACCTGGT CGTGAACCGG ACGCTGGCCC CCACATCGGG CGGTACGAGC ATTGTGAATA ACGTTGGTTC GATGGAAAAC AAAGGGTTCG AGATTGTGCT GGACGCTACG CCCATCAAAA CCAAAGACCT GAACTGGGAT GTAACGTTCA TTTACAACCA CAACCGAAAC AAAGTCCTTG ATCTGGGCGG TCTGCCGATC ATTAACCCCG ATGCCTCGTC AGCGTCTGGA ACGCCGGTCA ACTTAATCGT TGGCCAACCA GTGGGTGTGT TCTACGGCAC AGGCTATGCC CGTAATCCGG ATGGTTCATT GCTGTTGTCG CCGTCGGGTT TCCCGCAGTC GGAGCGGGCG ACTGGGCAAG CCAATGGCGC TGTCGATTAT GTACCGGCTC GTAATTCAGA CGGTACGCTG GACGTTACCA AACCCCTGGC CAACGTGATC ATTGGCAATC CGAACCCAAA ATGGACGGGT TCCTTCAGCA CCAATCTGTC GTACAAAAAA CTGAACCTGC ATGTCTTGCT GGACATGGTT CAGGGCTCGG ATGTGTTCAA CGCGGACAAG CGGACGCGTC AGGGTGTAGG TCTTGGCGAC TATGCCGAAC AGGAGTTACG GGGAACGCTG AAGCGGGGTT ATATCTTCGG CATTTACAAC ACGCAGGAGT TCCGGGTCGA TCCGGGTTCG TACACCAAAC TGCGCGAGGT GTCGCTCAGC TATACGCTGC CAACTTTTAT CAAGTCGATC AGCCGGTTAA CTATTTCGGC GGTGGGCCGC AACCTGTATT CGTGGGATAA GTACACCGGC TTCGATCCGG AAACCAACGC GGGTGGCAAC AATGACCTGC TGCGCGGCAT CGACTTCGGA AACGTCCCCA TTCCCCGTAC GTACCAATTC AAACTGTCAG CGACATTCTA A
|
Protein sequence | MLGLILLLCT NVFAQSTRYT LKGRVTDPEK MGLPGTTVVL VGTTVGTTTD AEGNYTLPVT LKPGPVTVAF TSIGYETLRQ DVTLGNADEV TVNAQLVAAA TNLDEVVVTG STLSAPKREL GNAISTIKAA DLTQSGSGNL INSLQGKVPG AQITQNSGDP AGGISIRLRG IKSLVGSSDP LYVVDGVIVS NASTNVSQLA LANDVGNANV GQNRLSDINP DDIATINVVN GAAAAAQYGS RAANGVVIIT TKRGQSGKAQ VNFTTSFNIN ELRKGVPVNT YGKQFGFASL RLYPIGVISA AQVAANPGTT TTSIYRDGTN SLLATNLVDV QRYNYFDQIF RTGYGTDNNL SISGGRDNTQ YYVSFGYLKN EGIIKGTDFT RYNLRARVDQ RLANWAKISV GISYSNSLSN EKANGNVFYS PINSVNITNN IYDITKRDAA GNLQAVEPSR VNPLSTIEDM KFSQSVSRTI NSLQLNLTPL KGLTVDYIVG VDAYSQFGKN YIRPYPYQSV AQLPAARYPF GFAATATNQV LQFNNDLNAQ YENQFSEKFK LNAAIGYSYQ YYQADYSINS GQNLSPFIET VSGASNTTYS VRYDLDRFSL SGLFAQATLG YKNLAFITGA VRRDQSSKFS PTQTNQYYPK VSGSFIVSDL DFWKNLSFSN AFNSLKLRTS YGEAGNLSGI GSYARFYQFS PVGFLGKNTV VPGTQLANPD VQPERMAELE GGVDLAFLNG RIGLGVTAYN QKITNLVVNR TLAPTSGGTS IVNNVGSMEN KGFEIVLDAT PIKTKDLNWD VTFIYNHNRN KVLDLGGLPI INPDASSASG TPVNLIVGQP VGVFYGTGYA RNPDGSLLLS PSGFPQSERA TGQANGAVDY VPARNSDGTL DVTKPLANVI IGNPNPKWTG SFSTNLSYKK LNLHVLLDMV QGSDVFNADK RTRQGVGLGD YAEQELRGTL KRGYIFGIYN TQEFRVDPGS YTKLREVSLS YTLPTFIKSI SRLTISAVGR NLYSWDKYTG FDPETNAGGN NDLLRGIDFG NVPIPRTYQF KLSATF
|
| |