Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4224 |
Symbol | |
ID | 8727983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 5089716 |
End bp | 5092907 |
Gene Length | 3192 bp |
Protein Length | 1063 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003389008 |
Protein GI | 284039078 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.25512 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAGAA TTCTAATATT GAGTTTGCTG TTCATAGGCT CAATCTGGTC TACAGCGTGG GCTCAGGAAC GGAGAGTAGT CGGCAAGGTT ACGTCGGCAG AGGATGGGTC CGCTTTACCC GGTGTTTCGG TGGTAGTAAA AGGATCGACG AAAGGGACAA CGACGGATGC CAGTGGTATT TATAGTCTTA CGGTACCCAG TGGAAAAGGG ACAATTCTGG TATATAGCTT CGTCGGTGTT ACGACGCAGG AAGTTAAACT CGGCAGTGAG TCGGAAGTGA ATGTAAGCCT GGTGTCGGAC TCGCGTCAAC TGTCGGAAGT TGTGGTAACA GGGGTTGGGG TGGCTACCTC AAAAACCAAG TTAGGTATTG CCGTGGAATC AGTATCGGCG AAAGACCTTC CCGCTGCACC AACGGCTTCT ATCGACCAGG CACTGGTTGG TAAAATTGCC GGTGCTCAGA TCGTGAGTGC CAACGGTACA CCTGGCTCAA AAGCCAACAT TCTGCTGCGC GGTATCAACA CAATTAACCG GGGTACATCG CCAATTGTGT TGATGGATGG TGTGCAGGTG GGTTCTACCG ACATTAACAG CCTCGATCTT AGTACCATTG AGCGGGTTGA AGTTATCCAG GGTGCGGCTG CCGGTACACT GTACGGAGCA CAGGGTGCCA ATGGCGTTAT TCAATTGTTC AGCAAACGGG GTAAAGATGG CCCGGCGCAG ATCAGCTTCT CGAACAGCTA TGCCAGCAAC ACCTACATCA ACAAAGGTGG TGTGGCCCAG GCCGATAAAC ACGCCTTTGT AACGGATGCC AGCAACAATG TAATTGGTGT GTCCGGTAAG CCCCTTGCTT TCGATCCGGC AACCAGCACC TGGAGCGAAA ACGTACAGTA CAATGCCCTG GATGTAAACA GCCAGGCCAA CAAGCCCTAC GACCAGAACC TGAAGTTCTA CGATTATTAC AAGATGTTTT TCCGGCCTTC GGAAACGATC AACAACTCGC TGAACATCTC GGGCGGAAGC GGTAAAGCCG ATTACAGCAT CACGGCTTCC AATAGCTACC AGTCGACCAT TCTGAAAAAC AATGGTGCTT TCAACCGGAG CAACCTGGTT AGTAACATTG GGATGACCCT TGCCAAGAAT TTGACCTTGC GCAGCATATC TCAGCTGGTA TATACCAAGA ATACGATCAA CACCTACGAC CGGGCTGTCT GGTACGATAT CAACAACACC CGTCCGTTCG ACAATTACGA TTATGTTGAT CCCGACGGAA ACTATGCCGC TTATTTCGGC AGCGCGTCGG GCGTAAACGG GTACAACCCG AACTACCGTC TGCAATACCG AAATCACGTT GATAACAAAG TAGACGTTAT TCAGAGCGTT GAACTGGACT ACAAGCCCAT TAAATACCTG GACCTGAATG CCCGCTATGG TCTGAACTAC ACCCAGGAAG GCGAACGCTA TACGTATGGC AACCAGACCC TGAACCGCAA CATCATTGCC AACGGAGCCG GTTATGCTAC TTCGCTGAAC GCCAGTGACG CCAAAGGGGA GATTTCGACC TACGATTACA AAACCGTATT CCAGAACTTC CTGGCCAGTG CCTTCATCAA AACGGATTTC CAGGAAGATT TTAAGCTGAA TATTCCCATC CGTACATCAA CGCAAATCTC GTTCGACTAT CGGAAAAACA ATTACAAGGA ATTCGATACC TATGCGCTGG GCGTACCTAC CTACAACCCG TATACAGCCG CTCAGGCCAG TACCTACCGG GTTTCGCTTG ATAACAGCAC GCCATTCGTA ACGTACGGAT ACCTGATCAA TCAGCACATC GAATACGGTG AACTGCTGGG TGTTACCGCC GGTTTCCGTT CCGATTATTC ATCGGCATTC GGACGTGGGT CGACGCCGTT TACCTTCCCG CACGCCGATG GATATATCCG TCCTTCGTCG CTTACGTTCT GGCAGAACAG CGCCCTGGGA ACGTACGTGC CTGAGTTCAA ACTGCGGGCG GCTTACGGAC AGGCGGGTAT TCAGCCTAAG CCGTTTGACC GGTACGTAAC GCTGGGTACG CGTACGTTTG GAGCTAACAA CGTCTTTTAT AACACTGTTA CACAAAGTAA CCCGGATTTG GGCGTAGAGG TGTCGAAAGA ACTTGAACTG GGAACGGACT TCACGATTAA AGGCGGCAAC GGCGATTGGC TCCGTAAGCT GAACTTCTCG TTCAGCTACT GGGATCGTTC TACCGACAAT GCCATCTATA ACGTAAACTC GGCTCCGTCT ACGGGTATCG GTACGGTGAA AGACAATGCC TTCTCGCTAT CTTCGAAAGG TACCCAGTTC TCCCTGAACG CGACCGTTTA CCGGGGGCGT AGCTTTACCT GGAATTTCAC GACGAACTTT GGCCATCAGA GTTCACAGAT CGATGCCGTA AAAGGAAATC AGCAGATCGT TGTGACATCC AGCGCGGGTA GTACGAACTA TGTACTGAAA GCCGGTCAGA AAATTGGTCA GCTGTTCGGA TTCCTGGCTA TTCATAGCCT TGATCAGGTG TTGCCCGATG GCAAGCCCGC CATTGCCGAA AGCGCTAAAG CAAATTACGA AGTGGCCAGC AACGGCTACG TGGTCAACAA AACGACCAAG CAGCCTCTGT TTAGCTCGGC TCAGTACAGC TTCGGCGATC CGAACCCTAC GTTTGTGTCG TCGTTCATCA ACGATATTTC GTTCCGCGAC ATCGTAACAC TGAACTTCCA GTTCGACTGG ACGCAGGGCA GCCACATTTA TAACCAGACG AAAGAGTGGA TGTACCGCGA TGGTATCCAC AAGGATTACA CCAACCCAAT TACGATAAAT GGTCAAACGG GTGCCTGGAC GGCCTTCTAC CGGGGTGTTT ATCAGGCGGG TGCCAACAAC GGAACGAAAG ATTACTTCTA CGAAGATGCT TCGTTTGTAC GGCTTCGGAA CGTTGCGCTG GGTGTAGAGT TGACCAAGCT CATCAAGTTG CCGATGCGCC GGTTACAGGT TGTCTTCAGC GGTCGTAACG TGCTCACCTT CACGAAGTAC ACAGGATTCG ATCCTGAGGT AAGCTCCGGC CAGACAACGG GTAACGAAAG TTCGGCATGG GATCGGGGTA CGGACCATAA CACGACGCCA AACAACCGTT CGTATCAGGT TTCTCTCAAT TTTGGCTTCT AA
|
Protein sequence | MSRILILSLL FIGSIWSTAW AQERRVVGKV TSAEDGSALP GVSVVVKGST KGTTTDASGI YSLTVPSGKG TILVYSFVGV TTQEVKLGSE SEVNVSLVSD SRQLSEVVVT GVGVATSKTK LGIAVESVSA KDLPAAPTAS IDQALVGKIA GAQIVSANGT PGSKANILLR GINTINRGTS PIVLMDGVQV GSTDINSLDL STIERVEVIQ GAAAGTLYGA QGANGVIQLF SKRGKDGPAQ ISFSNSYASN TYINKGGVAQ ADKHAFVTDA SNNVIGVSGK PLAFDPATST WSENVQYNAL DVNSQANKPY DQNLKFYDYY KMFFRPSETI NNSLNISGGS GKADYSITAS NSYQSTILKN NGAFNRSNLV SNIGMTLAKN LTLRSISQLV YTKNTINTYD RAVWYDINNT RPFDNYDYVD PDGNYAAYFG SASGVNGYNP NYRLQYRNHV DNKVDVIQSV ELDYKPIKYL DLNARYGLNY TQEGERYTYG NQTLNRNIIA NGAGYATSLN ASDAKGEIST YDYKTVFQNF LASAFIKTDF QEDFKLNIPI RTSTQISFDY RKNNYKEFDT YALGVPTYNP YTAAQASTYR VSLDNSTPFV TYGYLINQHI EYGELLGVTA GFRSDYSSAF GRGSTPFTFP HADGYIRPSS LTFWQNSALG TYVPEFKLRA AYGQAGIQPK PFDRYVTLGT RTFGANNVFY NTVTQSNPDL GVEVSKELEL GTDFTIKGGN GDWLRKLNFS FSYWDRSTDN AIYNVNSAPS TGIGTVKDNA FSLSSKGTQF SLNATVYRGR SFTWNFTTNF GHQSSQIDAV KGNQQIVVTS SAGSTNYVLK AGQKIGQLFG FLAIHSLDQV LPDGKPAIAE SAKANYEVAS NGYVVNKTTK QPLFSSAQYS FGDPNPTFVS SFINDISFRD IVTLNFQFDW TQGSHIYNQT KEWMYRDGIH KDYTNPITIN GQTGAWTAFY RGVYQAGANN GTKDYFYEDA SFVRLRNVAL GVELTKLIKL PMRRLQVVFS GRNVLTFTKY TGFDPEVSSG QTTGNESSAW DRGTDHNTTP NNRSYQVSLN FGF
|
| |