Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4031 |
Symbol | |
ID | 8727789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 4843540 |
End bp | 4846902 |
Gene Length | 3363 bp |
Protein Length | 1120 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003388820 |
Protein GI | 284038890 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.280222 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGACC TGTTCAGCCA CGCCCATGAA TCCAGCGCAC CTTCACTTCA CCTGAGTGAT GTGCTTCAGG AAGAGTACAA CTTCATCCAC GACATTATCC CTGCTCCTCC CGAAATACCG TTGACCGATC ATGATCAAGA CCAGCCCGAT TCGGAGGATG TTCACCGGTT CAGGGTCAGT GAACCGGTCG AATTGTTCAG GAAGCTGGAG GCCGAGCGAG AACACATTGA GGCTACTGCG AAAACAGCCT TACCGAAACA GACACATACT GAAACATCCT ATACCCGTCT GGCCGAGTGG CTTGCCGAAA AACCCCTGCG TGAAGACCTC GTCAGAGATA CCCTTATAGC CAAAGCCGCG AACCCCGATT TATTGACTGA AGAACCCTGG TTACTGAACG CAGCTTTACG GCCTTACACT CGGGGGTTGC TTAAGCGTTA TCTACAATGG AAAGATGGAC GACACAAGGA CACCACCCAT TTCCCAACCG ACGAATTGAA CCAGCTTTTG CTGGAAGATG CCTTCGCGGA GCACATCGTG CCCAGAGAAA AATCCGCATT AGACACGATT TTCGGTAAAA TGCTCGGTAC GTACCAGAGT GCCTTATGCC TGTCGGGCGG GGGTATCCGA AGCGCCACCT TTGCGCTGGG TGTCATGCAG GGCCTCGCTC AGCACAACCT GCTTGGGCGA TTCTCCTACC TCTCTACCGT ATCGGGCGGT GGGTATGTAG GCGGCTGGCT TAGTGCCTGG CGGCACCACG AAGGGCTGGA CAAAGTCATT ACCAAACTTA AGACTACCAG CAGTACGCCC ATTGCCAGTG AAGCTGACCC GATCCGGCAT CTGCGCCAAT TCAGCAACTA CCTGAGCCCG CAGTTGGGCC TATTCTCCGC CGACACCTGG ACACTCGTTG CCACCTATAC ACGTAACCTG CTGTTAATCT GGCTGGTGAT TCTGCCCTTT CTGGCGGCTC TCGCGGCCGT TCCCTGGGTA GGCGTTACCC TGGCCTCTGC CAAACTGAAT CCTGGCGATA CCATTTGGTT CTGGATTATG GGGAGTTTAC TGGCGGTAGC GGGTGCGTTG TCCGTGATGG CTGTCTATTT TGTCCATTCG TACATACCGA CACCCGAAAC AAAAAAACCG TCCGAAAAAA AACTCACCGA CATTCCCCTG AAATCAGACC GGGATCAAAC CGCGTTCATT AACAAGTGCC TGCTGCCGTT CTCCGTCGCT GTTCTTTTGT TGATACTGGT GTGGATCTGG TTCACTAAAC TGGACTCAAC CAGTACACAT TGGGCGGGCA ATATATATAG ATCGCTGGGA CTCAACCAGC GAACCGGACT GAACATTTTC TCGGAGGGAG GCTACTGGAT TATGGGCGGT ACCACACTGG CCCACGTCAT AGGCTGGCTA CTGGCCCGGC CAACACCGAA GAAGTTTTAC TTACAGTTCC TGATGTTTCT GGTGATCGCC GTTGTAGGAG CTATGGCAGG TTTTTTGTTA TTGCTGACCG CCAAACTGTT GCGTTCATCA TCCATAGAAT TGTACACCTG TCTGGCTTTT CCCTGTTTTA TGCTGTCCAT TTTGCTGGTT GGTTATTTCT TTGAAGGCGT TGTCAGCCGA TACCTCGACG ATGCCCGACG CGAATGGACG GCCCGGTACA GCGCCTGGCT GCTGATTGCG GCCCTGGGCT GGCTCGTTTT ATCGAGTGTC ATCCTGTTTG GACCGGGGCT TATTGACGCC ATAAAGCTGC AAGTCGCCAG CATTGGCCTT GGTTCGGGTA TTTTAACGGC TCTGCTTGGA GGAAGTGCGC AGAGTGCCGG GCGGGGCGAG GGTGCCGGAT CGCGTCAGGG AAAAGGCAAC GCGTCGGGCA TCATCGGGTT ACTATCCCAA TTCAGCCTGC CCATTGTCGC TACGCTGACC ATCTTTATCC TGATGGTCAT GCTTTCGCTG CTGAACCAAA CGCTGGCCGG GCTGCTGACG GATCAGCTTT ATGACTGGTT CGGCGACAAT ACATCTGGCC GTATCCTAAC CGTTTTTACC CCTCTGATTC TGCTCATCCT CTTTCTGGTG GCAGGCTGGC TGCTGGCTCT GATGATCGAT ACCAATCGGT TTTCGCTCCA TGCGATGTAT AGAGCCCGGC TGATCAGGGC GTATCTGGGG GCCTCGCGTC CGCAGGAAAC ACGCACCCCC GACCCGTTCA CTGGCTTTGA TGAAGACGAT AATATTCCCA TGGGCCAATT GAAGGTGGAT TCCTATACGA CACCGACTAC AAATACGCCG AACGAGGTTG GCCCCGAAAC TAAACCAGAA GCAACGCCAA AGAAGCCCCT GTTTCATATT ATAAATCTGG CTCTCAACCT GGTAAACGGG CAGAATCTGG CCTGGCAGGA GCGAAAAGCG GAAGCGTTTT CCATTTCACC CCTGCACGCC GGAGCCATGA ATCTGGCGTA TCGGCGAACC CGCGTCAAAA TCAACCCTAC TGATTACCGC TCCGGGCAAG AAAACCCGGC TTTGTCGACA CCGGAGTATA ACTGTTATGG CGGTAAAAAA GGTATCAGTT TAGGTACAGC CATAACCATA TCGGGAGCGG CTGCCAGCCC GAATATGGGT TATCACTCAT CTACTCTGGT TGCTTTTCTG ATGACCTTAT TCAACGTCCG GCTTGGCTGG TGGCTGGGAA ACCCCGGTCC GGCGGGCGAC AAGACGTTTG ATAAGTCGAC GCCCGACCTG GCCGTTAAAC CCATCTGGGA TGAACTTCAG GCCAATACCG ACGATACTAA CGAATATGTG TACCTGTCGG ATGGCGGGCA CTTCGAGAAT CTGGGCCTTT ATGAAATGGT GTTACGGCGC AACCGGTTTA TTGTCGTGAG CGACGCCAGC TGCGACGAAT CCTGTACGCT GGAAGACCTG GGCAATGCAA TCCGTAAAAT CCGTATTGAC CTGGGTATAC CCATCGAATT TCAGGGCAAC TTCCCCATTC AGGCCCGGTC AACCAATGGG GTCAATGCAG AAGGAAAATA CTGGGCACTG GCCCGCATTG GCTATTCCGC CGTTGATAAG CCGACTGCTG CAACAGACCC CGACGAGGTG GATGGTCTGC TGCTTTACAT TAAACCTGCT TTCTACGGCA ACGAACCCCG CGACATATTC AATTATGGCT CTACCAAAAG TGCTTTCCCC CACGAATCGA CGTCGGATCA GTTCTTTTCA GAAAGTCAGT TTGAAAGCTA CCGGGCGCTG GGCAGACATG CTTTCGAGAC CATGCATACC AGCTTCAAAA AAGAAGCCGG TGTAGAATTA AATGAACTGT TTACGAAAAA TGGACTTGCC CTCCACTGGA AGTTCATGAA GACCAAAAGC TAG
|
Protein sequence | MPDLFSHAHE SSAPSLHLSD VLQEEYNFIH DIIPAPPEIP LTDHDQDQPD SEDVHRFRVS EPVELFRKLE AEREHIEATA KTALPKQTHT ETSYTRLAEW LAEKPLREDL VRDTLIAKAA NPDLLTEEPW LLNAALRPYT RGLLKRYLQW KDGRHKDTTH FPTDELNQLL LEDAFAEHIV PREKSALDTI FGKMLGTYQS ALCLSGGGIR SATFALGVMQ GLAQHNLLGR FSYLSTVSGG GYVGGWLSAW RHHEGLDKVI TKLKTTSSTP IASEADPIRH LRQFSNYLSP QLGLFSADTW TLVATYTRNL LLIWLVILPF LAALAAVPWV GVTLASAKLN PGDTIWFWIM GSLLAVAGAL SVMAVYFVHS YIPTPETKKP SEKKLTDIPL KSDRDQTAFI NKCLLPFSVA VLLLILVWIW FTKLDSTSTH WAGNIYRSLG LNQRTGLNIF SEGGYWIMGG TTLAHVIGWL LARPTPKKFY LQFLMFLVIA VVGAMAGFLL LLTAKLLRSS SIELYTCLAF PCFMLSILLV GYFFEGVVSR YLDDARREWT ARYSAWLLIA ALGWLVLSSV ILFGPGLIDA IKLQVASIGL GSGILTALLG GSAQSAGRGE GAGSRQGKGN ASGIIGLLSQ FSLPIVATLT IFILMVMLSL LNQTLAGLLT DQLYDWFGDN TSGRILTVFT PLILLILFLV AGWLLALMID TNRFSLHAMY RARLIRAYLG ASRPQETRTP DPFTGFDEDD NIPMGQLKVD SYTTPTTNTP NEVGPETKPE ATPKKPLFHI INLALNLVNG QNLAWQERKA EAFSISPLHA GAMNLAYRRT RVKINPTDYR SGQENPALST PEYNCYGGKK GISLGTAITI SGAAASPNMG YHSSTLVAFL MTLFNVRLGW WLGNPGPAGD KTFDKSTPDL AVKPIWDELQ ANTDDTNEYV YLSDGGHFEN LGLYEMVLRR NRFIVVSDAS CDESCTLEDL GNAIRKIRID LGIPIEFQGN FPIQARSTNG VNAEGKYWAL ARIGYSAVDK PTAATDPDEV DGLLLYIKPA FYGNEPRDIF NYGSTKSAFP HESTSDQFFS ESQFESYRAL GRHAFETMHT SFKKEAGVEL NELFTKNGLA LHWKFMKTKS
|
| |