Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3199 |
Symbol | |
ID | 8726952 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 3873044 |
End bp | 3875866 |
Gene Length | 2823 bp |
Protein Length | 940 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | |
Product | two component transcriptional regulator, AraC family |
Protein accession | YP_003388009 |
Protein GI | 284038079 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.157032 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.236848 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTACA CCCTTCGTGG CTTTGTAGCT GTCTATAGGC TTCGATCAAT TAAATCGCGA TTGAGTAAGG GAACCTTTGG CTTTCTGCTC TCGGTAACGC TGCTGCTGCT GATGGCAGGC TGTCAGTCGG CAACAAAGAC CAAAACATAC CGGATTGGCT TCTCGCAATG CACAGGGGGC GACGAATGGC GAAAGACGAT GCTGCATGAT ATGAAACGGG AGTTGACGTT TCATCCCAAT TACACCCTCC TCTACGAAGA TGCCGAAAAC AGCACGACCC GCCAGATTAG CCAGATCCAG GCACTGATCG ATCAGGGCAT CGATCTGCTG ATCGTTTCAC CCAATGAAGT GGCTCCATTT GCCAAAGTTA TCGAAGACGT TTTTAAGCGA GGTATCCCGG TCATTCTGCT CGATCGTAAA ATCGAAACAG AGTCCTACAA CGCTTATATC GGTGGCGATA ACGTCGAAAT TGGCCGGTTA GCGGGTGTTT TTATTGGCAA TCACCTAAAA GGGAAAGGAC GGATCGTTGA AATTTGGGGG TTGCCCAGTT CATCACCGGC TCAGGAACGG CACCGGGGAC TTCTGGAGGA GTTAAGGAAA TACCCCGGCA TTCAGGTGGT TAAAGAACTG AATGGACAGT GGGAGCGCGA TACTGTAGGT CGGGTAGTGG CTGCGGAATT GAACACCCTG AAGGATGTCG ATCTGGTATT TGCCCACAAT GATGTAATGG CCCTTGGCGC GTATGCCGTC TGTAAGCAGA AGGGTATCGA AAAAAAACTT GATTTCGTTG GAATCGACGC TCTACCGGGT CCCAACGCGG GTATGCAGGC AATTACAGAC GGAATTCTGA AAGCCAGCTT TTTGTACCCA ACGGGTGGAG AAGAAGCCAT TGAGACAGCT ACGCGGATTC TGGCCGGAAA ATCGGTAAAA CGGGAACAGG TGCTCAACTC CATTCAGATA GATGGGTCGA ATGTCAGGGC ACTAAAAGCA CAAAGCGATA AGCTGTTGGC CCAGCAAGTC GACATTGAAA AGCAAAGCCA GCGCATTGAC GCCCTAACAC AAACCTACGC GTCCCAGAAA AACACCTTAT ACATCACGCT GGCCAGCCTG ATCGTCGTTA TGCTGCTGGG GGGCTGGGCG TTGTACCTCT TCCGGTCGAA GCAAACAGCT TACCGGACGC TTGAACGCCA AAACGAAGAA ATACGGGAAC AGAAAGATAA AATAGAAGCG GTGTCGCAGC AGGCACGGCT GGCAACGGAA GAAAAACTTC GGTTTTATTC TTATATCTCT CACGAATTCA ATACGCCCTT AAGCCTGATT CTAACGCCGA CGGAGGATCT GCTGACCAAG AAGACCGTCA GTTCCCACGA CCTGAAAAGT AACCTGGCCC TCATCCAGAA AAACGCCTAT CGGCTGCTGC GGCTGATCGA TCAGATGCTG GACCTCCGCA AAACAGATGC CGGTAAACAA CGCCTTCGTA CGGTTGAACA GGACCTGGTT GCCTTCATTC AGGATATCGT TCAGGATTTC AAACGCAAAG CGGAAAAGCA GCGGATCGAC CTTCGGTTTA TGCCGTCGAA AGCCAGCCTG CCGGTTTGGT TCGACGAAGA GAAGCTCGAT AAAGTCATCG TCAATCTGTT GTCTAACGCG TTCAAGTATA CGCCGAAGGG TGGGCTGATT CATGTTCGGC TGGATGTGCT GGATAATCGG GTATGCATTC AGGTAGAAGA TAATGGCGAG GGTATGACGC CCGATGAACA GGCACATGCC TTCGACTTGT TCTTCAGCGG TACCCGACCG TTCAACCTCT CGAAAGGATT GGGGCTGGCG CTGTCGATGG AGTTTATTCA ATTGCATCAG GGTGATATCA GTGTGAAGTC CGAAAAGGAT AAAGGAACAA CGTTTACCAT TCTGCTGCCT CTGGGCAAGG ACCATCTGGC CGAGGAGGAA ATGGATCAGT CGGGGCCTCG AAATACATTA AGAGAAGGGG CGTTCGTTTC TCCGCGCCTG CTGGTTGATG TGGAAGAGGA GGAGAGTACG CTCCCTGCGG ATCTCACACA AAAACGGTCG GGGACCCTGC TGGTCGTTGA AGATAACGAC GACATCCGGA CGTTTCTAAC CACCCGATTG GGTCATGAGT TTGAGATTAT TGCCGAGGGT ACCGGCGAAA AAGGCTGGGA ACGGGCCATT GACGTGATAC CCGATCTGAT CATCAGTGAT ATCATGCTGC CGGGTATGGA TGGTATGCAG CTGACCCAGC GCGTAAAGGC TGATCTGCGT ACGTCACACA TTCCGGTGGT GCTGCTAACG GCCAAAGGGC AAATGGAGCA ACGCATTGAA GGCACCCGCG CCGGAGCCGA CGCTTACATC ACCAAGCCGT TCAACACCAC GTACCTACTC GAAGTGCTGC GTACCACGCT CGCCAATCGG GAGAAGTGGC AACGACGGTA CGCATCCGAC TTTCTGTCGC AGGCAGGTGC GGGTAATCGG CAGGACAAGA AGTTCCTGAA TGAGTTAACC GGACTAATCG AGCAAAATCT CACCGACCCC GACTTTGGCG TCGAGAAGCT GAGCCGCGAT ATGGGCCTGT CGCGGGTGCA ACTTTACCGA AAGGTGCAGG CCCTGCTGGA TATGAACGTG ATTGATTACG TAGCCGAAAT ACGCCTGAAA AAGGCCCGGC GTCTGCTGAC TGAAAGTACC AAAACAATGG CCGAAATCGC TTACGAAACC GGCTTCAGCT CACCAGCCTA CTTCACGACT TTTTTTAAGC AGCACACGCA AAAGACCCCT TCGGAATACC GAAAATCACC GGCAAGCGCG TGA
|
Protein sequence | MSYTLRGFVA VYRLRSIKSR LSKGTFGFLL SVTLLLLMAG CQSATKTKTY RIGFSQCTGG DEWRKTMLHD MKRELTFHPN YTLLYEDAEN STTRQISQIQ ALIDQGIDLL IVSPNEVAPF AKVIEDVFKR GIPVILLDRK IETESYNAYI GGDNVEIGRL AGVFIGNHLK GKGRIVEIWG LPSSSPAQER HRGLLEELRK YPGIQVVKEL NGQWERDTVG RVVAAELNTL KDVDLVFAHN DVMALGAYAV CKQKGIEKKL DFVGIDALPG PNAGMQAITD GILKASFLYP TGGEEAIETA TRILAGKSVK REQVLNSIQI DGSNVRALKA QSDKLLAQQV DIEKQSQRID ALTQTYASQK NTLYITLASL IVVMLLGGWA LYLFRSKQTA YRTLERQNEE IREQKDKIEA VSQQARLATE EKLRFYSYIS HEFNTPLSLI LTPTEDLLTK KTVSSHDLKS NLALIQKNAY RLLRLIDQML DLRKTDAGKQ RLRTVEQDLV AFIQDIVQDF KRKAEKQRID LRFMPSKASL PVWFDEEKLD KVIVNLLSNA FKYTPKGGLI HVRLDVLDNR VCIQVEDNGE GMTPDEQAHA FDLFFSGTRP FNLSKGLGLA LSMEFIQLHQ GDISVKSEKD KGTTFTILLP LGKDHLAEEE MDQSGPRNTL REGAFVSPRL LVDVEEEEST LPADLTQKRS GTLLVVEDND DIRTFLTTRL GHEFEIIAEG TGEKGWERAI DVIPDLIISD IMLPGMDGMQ LTQRVKADLR TSHIPVVLLT AKGQMEQRIE GTRAGADAYI TKPFNTTYLL EVLRTTLANR EKWQRRYASD FLSQAGAGNR QDKKFLNELT GLIEQNLTDP DFGVEKLSRD MGLSRVQLYR KVQALLDMNV IDYVAEIRLK KARRLLTEST KTMAEIAYET GFSSPAYFTT FFKQHTQKTP SEYRKSPASA
|
| |