Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_2107 |
Symbol | |
ID | 8725845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 2545026 |
End bp | 2549075 |
Gene Length | 4050 bp |
Protein Length | 1349 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | |
Product | two component transcriptional regulator, AraC family |
Protein accession | YP_003386941 |
Protein GI | 284037011 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000288147 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.419692 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCTTTA AACGTTTACT AGTTCTCCTT TGGCTGTGGT GGCCATGCAT CGTTTCAGGC CAGTCGGGTA AGCTGTTCAC GGTCGAAACG GGCCTGTCGG GCAGTCTGGT CACGGACATC CATCAAGACC AGAGTGGCTT TGTCTGGATA GCCACCGAAG ACGGATTGAA CCGATTCGAC GGGATAAAAT TCACGGTCTA CCAGCACAGC AAAAAAGATA CGACCAGCCT GCTGAACAAC CAGATTCATG CGCTCTTTGC CGACAAAAGC GGGCGCATGT ATGTTGGCTC CACCGAAGGG CTCCAATATT ACGACCCGGC CACCGATGCC TTCCACACCA TACCGCTAAA ACTGCCCAAC GGGCAATCGA TCAACGCCAG TGTGTCGACC CTGTGCCAGC GAAAAAATGG CGATGTGCTG GTGGGCACCT CCGGTCACGG CACCTTCAAA CTAATCCGAA AGGGCCCCCT CTTGTTTGGC CGGAAAGTAC TGGCAAATGC GCCCAGCGAA ATAATTATCA GGATTCTGGA GGATACCGAG CAAAATCTCT GGGTATCGAC GGAAGATAAG GGCCTCTATC GATTCAGCGG CCAGTCGGTG AACGCGTATT TCGTATCCAA GAAATCGCAG AACAACATCA TCAGCAGCAT CTGCCAGGAT CGTTATGGAC GGCTGTTTGT GGGCAACATG AGCAGTGGTT TGTACCGCTA CGACCCGGCC AGTAACACCT TCCTGTCGAT TCCTTACAAC GGGCGCACCG ACCTGCCGGT GGCCGATCTG CTCGTCAATC GAAACAATCA GATCGTTGTG GCCACCAGCG GGAAAGGCAT GAAGTATTTC GATCCGGTGT CCGACAAAAT TCTGGACCTG GAATCCAGCG TGACCAGCTT CGATTTTTCG CGAACCAAAG TGAACACCAT GCTGGAAGAT CAGGCCGGTA ACATCTGGCT TGGTATTTAT CAGAAAGGCC TGCTGCTGCT GGCGGGTAAC ACCAACCGCT TCGGGTATAT TGGCTACAAA TCGGTGAGTC GCAAGTCGAT TGGATCGAGT GCCGTTATGG CTCTGACGCA GGATACGCAG GGAACCGTTT GGGTAGGCAC CGACAGCGAT GGCCTGTTTG CCTTACCGGC CACCAGCAAC GCCAGTGTTC ATTACCTGAC GGATAAGGAA GGCGGTACGG CCCCATCGAA CATTCTGACG ATTCACGAGG ACTCCCAAAA AAATCTGTGG CTCGGCTCGT ACCTGACGGG CTTGTCCCGC TTCGATCGCA ACACCGGGAA AAGCACCTAC ATCGACAAGC TGGTCGATAA ACGGGGCGAC AACGTACAGC GTGTGTTCAG TATTAAGGAA GATGCCCGCC AACGGCTTTG GATCGGCACG ATGGGTTCCG GTGTTTTCCA GCTGGACCTA CGGTCGGGGG CGGTCAAAAA CTTTGAAGCC TTACCCGGCA AGCTCTACCG GCCCGAACTC AACTACATTC CGAACAGCTG GATCAACTGC CTGCTCGTTA CGAAAGACAA TAAACTATTC ATCGGCACCT TCGACGGGCT GGGCTGTCTG GACCTCGACA CCGAAAATTT CGTATCTACG CTGGGTACGA ATCGGTTGAT GGCCGGTACC GTCGTGTACT CACTCTACGA AGACAATAAG GGCAATTTGT GGATAGGCAC GTCGCAGGGA CTGAAAAAAA TGAACCGTTC GACGAAGGAA ATCACCTCGT TCGACGCGGA CCAGGGCTTA CCCGGCAACT TCATCTGGGC CATACAAGGT GATAAGATGG GCGATCTCTG GCTCAGCACC AACCGGGGAC TTTCGAAGAT GGATGTCGAC TCCAGCACGT TCACTAATTT CTATTCCTCG GATGGGCTGC AGGGGAATGA ATTTAGTCGG GGAGCCTCGC TCAACACCAG CAGCGGGGAG TTGTTTTTCG GTGGCGTGAA CGGCATTTCC TATTTCAAGC CCGACGAAGT CCGCGTTGCC AACAAGCGGC TGACGGTGCG AATCGTTGAC CTGTTTATCC GAGACAAACC CGTTAAGAAG GGAATGCTGT CGGGCAGTTT CCAAGTGGTC GATACCACCG TTACGAACGC TACCGAATTT AATCTGGCTC ACCATGATAA CTCGTTTACG CTGGAATTCT CGACCATGGA TTTCATCGAT GCTGAGCGGG TAGCCTACCA ATATTCAATA AACAACAACG ATTGGGAGGA ATTAAGACCC GGTTCTAACC GGCTAACCTT CGATAATCTG TCGCCGGGCC GGTACGAATT TCAGCTACGG GCAAAATTCA ACAGCGCCGT TTCAGAGATT CGGCAGGTAA CGGTCATCGT CCATCCGGTC TGGTATTTAT CGACCTGGGC TAAACTTGTT TACGCGCTAC TCTTCCTGTT GCTGGGTATA CTGATTGCCC GAATTATCAA AAACCGGCGA CAGGCAAAAG CGGAATTTCT GGCGCACCAG CGTCAGGAAG AAATCAACGA AGCCAAGCTT CAGTTTTTCA TAAACATCGC CCATGAGATA CGGACTCCCC TAACGCTGGT GGTCAGTCCG CTGCAAAAAC TCATTACTAC CGATCAGAAC GAGGAACGCG GCCATCTGTA CGACATTATG GGGCGTAACA CAAAACGAAT ACTGGATCTG GTGAACCAGC TCATGGACAT CCGGAAAATT GAGAAGGGGC AGATGACGCT GCAACTCGCC CAGATTGAGA TGACCCGGTT TACGAAAGAA GTTTGTCAGC TTTTTGAGGA ACAGATTCTT TCGAAACAAA TTCAGTTTAT CCTGGACGTT CCGCCAGAGC GGATTTTTGC CCGCATCGAC CCGCAGAATT TCGACAAAGT ACTGATCAAT GTATTGTCCA ATGCGTTCAA ATTTACCCCT TCTGGCGGCA CGATCCGGGT GGCGCTGACC GTGGGCCATT CTGAAACGGC GGATAGTCAA CCCGATTTAG CGATTGCCAT CGAAGATTCG GGACATTGCA TCAGTGAGCC GGAGACCGAA CGAATATTCG AATGTTTTTA TCAATCCGAA GAACACCGGG GGTATAATCA GCAGGGTACG GGCATTGGTC TCCATCTGGT AAAGCAACTG GTTGAGTTGC ACGGGGGCAG CATCAAAGCC GAAAATATGG ACCCGGTCGG TTGTCGTTTT CTGATCACCT TACCGCTCGA ATCGGCTGTT GACGAGACAG CGGCACCGGT GGCAGGATTG CCTGTCGGCC ATCACCAATT GCCGGAGTTG GCAGAGAATG GCCAGACCGA AGTGAAATCC CGGAAGAAAG CCCGTCGTAT CGTTATTGCC GACGACGACA CGGAGATCTG CCATTACCTC ACGGATGAAT TATCGGGCGA GTACACGATT TTTGCCTATG CCAATGGCGA GGATGCGTAT AAAGGAATCG TGAGAGAAAA ACCCGATCTG GTGGTCAGTG ACGTCATGAT GCCGGTAATG GACGGAATGG CCCTCTGCCG AAAGTTGCGG GCAAATCCAC TGGTTAACCA CATACCCGTT ATTCTGCTGA CGGCCAAAAC GGAGGAGTCG AGTAATGCAC TGGGATTTGA ATTAGGAGCC GACGCTTACA TCACCAAACC GTTCAACATC GACATACTGT CGAAAACAAT TAAAAGCCTG ATCCGAAATC GGCAGATCAT CCTGAACAAT GAAAGCGAGC AGCAGTATCA GGAGGAGTTC ATTTCGAAGG TCAGTATCAA ATCCGCCGAT ACAAAGCTCC TGGAAAAAGT ACACGCCCTG ATCAATAAAA ACCTGTCTAA CCCCGACTTA AGTGTCGAGA TGATTGCCGG TGAAATTGGT ATAAGCCGGG TACACCTTCA TCGGAAATTA AAAGAGTTGA CAAACTCGAC AACGCGGGAT TTAATCAGGC ATATCCGACT AAAACAGGCG GCTGATCTGC TGACTACGAA GGGATTGACG GTTTCGGAAG TCGCCTTTGC CACGGGCTTT GCGAATGTGA ATAACTTTTC GGTGTCCTTC AAAGAATTGT ACGGTGTATC GCCCGCTCAC TATGCCGAAC AACAGTTGGT AAAACATTGA
|
Protein sequence | MLFKRLLVLL WLWWPCIVSG QSGKLFTVET GLSGSLVTDI HQDQSGFVWI ATEDGLNRFD GIKFTVYQHS KKDTTSLLNN QIHALFADKS GRMYVGSTEG LQYYDPATDA FHTIPLKLPN GQSINASVST LCQRKNGDVL VGTSGHGTFK LIRKGPLLFG RKVLANAPSE IIIRILEDTE QNLWVSTEDK GLYRFSGQSV NAYFVSKKSQ NNIISSICQD RYGRLFVGNM SSGLYRYDPA SNTFLSIPYN GRTDLPVADL LVNRNNQIVV ATSGKGMKYF DPVSDKILDL ESSVTSFDFS RTKVNTMLED QAGNIWLGIY QKGLLLLAGN TNRFGYIGYK SVSRKSIGSS AVMALTQDTQ GTVWVGTDSD GLFALPATSN ASVHYLTDKE GGTAPSNILT IHEDSQKNLW LGSYLTGLSR FDRNTGKSTY IDKLVDKRGD NVQRVFSIKE DARQRLWIGT MGSGVFQLDL RSGAVKNFEA LPGKLYRPEL NYIPNSWINC LLVTKDNKLF IGTFDGLGCL DLDTENFVST LGTNRLMAGT VVYSLYEDNK GNLWIGTSQG LKKMNRSTKE ITSFDADQGL PGNFIWAIQG DKMGDLWLST NRGLSKMDVD SSTFTNFYSS DGLQGNEFSR GASLNTSSGE LFFGGVNGIS YFKPDEVRVA NKRLTVRIVD LFIRDKPVKK GMLSGSFQVV DTTVTNATEF NLAHHDNSFT LEFSTMDFID AERVAYQYSI NNNDWEELRP GSNRLTFDNL SPGRYEFQLR AKFNSAVSEI RQVTVIVHPV WYLSTWAKLV YALLFLLLGI LIARIIKNRR QAKAEFLAHQ RQEEINEAKL QFFINIAHEI RTPLTLVVSP LQKLITTDQN EERGHLYDIM GRNTKRILDL VNQLMDIRKI EKGQMTLQLA QIEMTRFTKE VCQLFEEQIL SKQIQFILDV PPERIFARID PQNFDKVLIN VLSNAFKFTP SGGTIRVALT VGHSETADSQ PDLAIAIEDS GHCISEPETE RIFECFYQSE EHRGYNQQGT GIGLHLVKQL VELHGGSIKA ENMDPVGCRF LITLPLESAV DETAAPVAGL PVGHHQLPEL AENGQTEVKS RKKARRIVIA DDDTEICHYL TDELSGEYTI FAYANGEDAY KGIVREKPDL VVSDVMMPVM DGMALCRKLR ANPLVNHIPV ILLTAKTEES SNALGFELGA DAYITKPFNI DILSKTIKSL IRNRQIILNN ESEQQYQEEF ISKVSIKSAD TKLLEKVHAL INKNLSNPDL SVEMIAGEIG ISRVHLHRKL KELTNSTTRD LIRHIRLKQA ADLLTTKGLT VSEVAFATGF ANVNNFSVSF KELYGVSPAH YAEQQLVKH
|
| |