Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_5025 |
Symbol | |
ID | 8728790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 6122862 |
End bp | 6126032 |
Gene Length | 3171 bp |
Protein Length | 1056 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | putative transcriptional regulator, Crp/Fnr family |
Protein accession | YP_003389801 |
Protein GI | 284039871 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00524189 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGCTG GTGCAGATTC TACTTCTCAA CCTATTTCTA CTCCTCAGCG ATGGCATCGT GCGTTAGGCA TCCGGCCGGA GGAAAGCCGA ACGGTGGGGT TATTTTTCAT TCATAACTTT TTGCTTGGCA TCGGCACTAT TCTCATTTAT GTAGCAGCCA ACGCAATTCT GCTTGAAAAT CAACCTGATA CAAGTCTTCC TATTGCCTAT ATAGCCTCGG CAGCAGCCAT GATCGGCATT GGCCGAATCT ATGGCTACTT TGAACATCAC CTGGCCTTAC GCAGTCTGGC CGTTCGGGTG TTGATCGCGG CTGTTGTGAT GACGATTGTG GTGGGTATTC TGGTCGTTGT TGGGCATTCC ATTATGGCGG CTGTAGCCAT CATGACGGGC TATCGAATTA TTTACCTGCT TACCAACCTC GAATTCTGGG GCGTTTCGGC CGTTGTATTT GATACGCGTC AGAGTAAGCG TTTATTTGGC GTGATCAGCT CCGGCGATAT GCCCGCCAAA GCCATAGGCG CTATCCTGGC TGCTCTGGTA CATGCCCATG CCGATGTATT GCGGTTGTTG CTGGTTGCTT TCGGGGCCTT TCTGGCAGCA CTTTATATTC TTCAGCTAAC TATTCAGTCG CACGATGTAC ACGCGCCACA GGGGGCCGAC CGGGTCGCCC GCCGGGAGCC GTCCCGATTA ATTGGCAAGC GGTTCGGCGG GAGTGAACTG ATTTTCTACA TGTGCCTGAG TATGGCCTTT CTGGCCGCCG TGGCAACCGA GATCGAATAC AATTTCTTCA TTAACGTGAA GCACCGCTTT CACGACCAAA CGGATGTTAT CCGCTACGTG AGTTATGTAC TGGCGTTGAC GTACGGAGTG GCCATGCTGG CTAAATTGCT GATGTCGAGG CAGGTACTCG ACCGATTTGG GATACAGCGC TCGTTGCTGG TATTGCCCTG GCTGGCCTTG ACGGGGCTAA TTGGCTTAAT CGGACTACGC TATTTTACCA CCAGTGAAAC CGTCTTGCTG GTTTACTTCT GTGGCTTATA TCTGCTCTTT GAAGTAGCCC GCCGGGCGCT GTTCGATCCC GTATTTCTCG TTTTATTTCA ACCCCTGTTG CCCCCTCAGC GACTCAAGGG ACACACACTG GCCAAAGGTT TATATGAACC CATTGGCTTA GGTGTGGCCG GTCTGCTCAT TTATGTTTTG CATACAACGC TCGAATCGGG TGGCGTACTG GTATGGATTT TACTCCTGGC GGGCGCTATC TGGCTGCTGC GCCACACGTA TAAACGGTAT CTGCATGAGT TGACGGATGC CATTGGCCGA CGCTTTCTGG AAAGCGATCA ACTAGCGATG CCAACGGTAG CGCAGGCCAG TCTGGTCAGG CAGTTGCAAA GTGACCGACC CGGCGACGTG CTAACGGCAA TTAGTTGGCT GGATACGCAT AATCCGGACG AATTAAGTCG GCAAATCCCT TCGCTGCTTA CGCATACTAA TGCCAGTATC CGCCGAAGAA CGCTGGCCGC AGCGGCCCGG ATGCAGCAAC CCCTGCCCGT CCGTCAGGTT ACGCACATTG CCCTGACGGA CGATGACCCG GCCTTACGTC AGCAAGCGGC CTACCTGTTG GGACGACGCA TAAATGCAGA AGCCGCTGAA CTGGCCCCGC TTATTAATCA CACCGATCTC GCTATCCGGC AGGGGGCTAT CAAGGGGGTG CTGGAATTAA ACCCACACGA TCCGGCGGCT CTTGCGGCCC TGACTAAACT GGCCACCGCT GCGGACGTTG TTCTTCAGCA ACTAGCTTTA TCCCTCATTG CCACTCTCAG ACTGACCAGC TTTGCCGCCC TGGTTGGCGA ACGGCTGGAG AGCCCGAGTG CCGATGTACG AAAGGCCGCC ATCAATGCGG CAGGCCATTT GCCCAGTACC AATCTGACCC GGTATTTACT GGATAACCTT ACTCACAAGA CCATCGGGCG AGCTGTAACC CAGGCACTGA AAGCCCGCGG CAGCGAAACC ATTGCAGCCC TACGTCAGGT GCGGCAGTCG ACTACAGATC GACTCGTATT GGAACGTATT GCGGCCGTTT GCGGAGCTAT ACCTACGCCG GAGAGCCGCC ACGTGCTCAA TGGGCTGGCG CAGCAACCCG ATGTATTGGT GCGGAGTGCC GCCCTCCGGG CCTTGCGCCG GTTCCCGAAC GAACCCGAAG ACGACGCACT TTTTCGCGCG GTCATGCAGG AAGAGGTGTT ACTGGCCCAG CGGCTCCTGC ACGGGAGTTT GACGGATACA GCCCTGACGA AGACGCTGGA TTATGAACTG TCGGTACTGC TACAGCGGCT GTTCGATGTG CTGATGCAGC TTTACGACTC CGAAACCATT GCCGGGGCTC GCATGGGTAT TGGCCATCCT GCCCGCGAAC GGCGGGCCAA CTCGCTCGAA ATGCTGGATA ACCTGATTCC ACGAGCTACC TACCAAACGC TTCAAGTACT CGTAGATGAT CTGCCCCGCA CCGAAAAAAA TCGTATTCTT GATGCGGAAT TAGGGCCTTT CGACGACCGG GAACCCATCC GGGCGTTTGT GTTGCGAATG GGTGAAACCG TTTTTTCGTC CTGGACGGTC GATGTAGTGC TGCGTACATT ACCACCGGCT GACGCTTTGC TGGCCCGGCA GTCTTTAGAA TCCCGTTTTG GTACTTTATC CCCTTTGCTT ATGTCGCATG CCCCTCATTC TACCGAGCAG ATTTCAGCCT ACGACCGGGT ATTGCTATTA TCCCAGGCCA GTCTTTTTTC CCAAACGCCC GAGAATGTAC TGGCGAGTAT CACGCCAATT ATGAAAGAAG AGGAACATCA AGCCGGTGTA TCCATTTTCC GTAAAGGCGA TCTGGGCACG GGCATGTACG TTATCTACAC CGGCGAAGTA GCTATTCTGG ATGGCGATAC CGAACTGGCT CGTTTTGGGC GGGGCGACTT CTTTGGCGAG CTGGCGCTGC TGGATACCGA AGCCCGCTCG GCTACGGCAG AGGCCATTAC CGATGTTCGG CTGCTGCGTA TTGACCAGGA TGATTTTTAC GACCTGATGG AAGAACGGAG CGAGGTTCTT CGTAGTATTG TTAAAAGCTT GTCAGGGCGT ATTCGGCGGC AGAACGAATT GCTGTCAAAC CGGGCTACAA CACCGCAGTA G
|
Protein sequence | MIAGADSTSQ PISTPQRWHR ALGIRPEESR TVGLFFIHNF LLGIGTILIY VAANAILLEN QPDTSLPIAY IASAAAMIGI GRIYGYFEHH LALRSLAVRV LIAAVVMTIV VGILVVVGHS IMAAVAIMTG YRIIYLLTNL EFWGVSAVVF DTRQSKRLFG VISSGDMPAK AIGAILAALV HAHADVLRLL LVAFGAFLAA LYILQLTIQS HDVHAPQGAD RVARREPSRL IGKRFGGSEL IFYMCLSMAF LAAVATEIEY NFFINVKHRF HDQTDVIRYV SYVLALTYGV AMLAKLLMSR QVLDRFGIQR SLLVLPWLAL TGLIGLIGLR YFTTSETVLL VYFCGLYLLF EVARRALFDP VFLVLFQPLL PPQRLKGHTL AKGLYEPIGL GVAGLLIYVL HTTLESGGVL VWILLLAGAI WLLRHTYKRY LHELTDAIGR RFLESDQLAM PTVAQASLVR QLQSDRPGDV LTAISWLDTH NPDELSRQIP SLLTHTNASI RRRTLAAAAR MQQPLPVRQV THIALTDDDP ALRQQAAYLL GRRINAEAAE LAPLINHTDL AIRQGAIKGV LELNPHDPAA LAALTKLATA ADVVLQQLAL SLIATLRLTS FAALVGERLE SPSADVRKAA INAAGHLPST NLTRYLLDNL THKTIGRAVT QALKARGSET IAALRQVRQS TTDRLVLERI AAVCGAIPTP ESRHVLNGLA QQPDVLVRSA ALRALRRFPN EPEDDALFRA VMQEEVLLAQ RLLHGSLTDT ALTKTLDYEL SVLLQRLFDV LMQLYDSETI AGARMGIGHP ARERRANSLE MLDNLIPRAT YQTLQVLVDD LPRTEKNRIL DAELGPFDDR EPIRAFVLRM GETVFSSWTV DVVLRTLPPA DALLARQSLE SRFGTLSPLL MSHAPHSTEQ ISAYDRVLLL SQASLFSQTP ENVLASITPI MKEEEHQAGV SIFRKGDLGT GMYVIYTGEV AILDGDTELA RFGRGDFFGE LALLDTEARS ATAEAITDVR LLRIDQDDFY DLMEERSEVL RSIVKSLSGR IRRQNELLSN RATTPQ
|
| |