Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_1046 |
Symbol | |
ID | 8724776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 1277599 |
End bp | 1280799 |
Gene Length | 3201 bp |
Protein Length | 1066 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | protein of unknown function DUF1549 |
Protein accession | YP_003385896 |
Protein GI | 284035966 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAATTA CGACTGGATT CATTACTGTA GGCTATCAAA GGTTTCGATT CTTATTAGTC GTCTGTTGTA CCCTTCTGTT TCTTCAAAGT TGCGGGGTCG ACAAACCGGC GGAGATCGCT CAACTGGACG ACAAGCTTCC AGAAACCATC GACTATAATC TGCACGTCAA GCCTATTCTG TCTGATAAAT GTTTCTTCTG CCACGGGCCG GACAAAAACA GTCAGAAAGC CGGATTGGAA CTTGCCACCC CAGAAGGGGC CATGGCCGCC CTCAAAAAAG CAAAAGGGAA ACACGCCATT GTTCCCGGTG ATCTGGCAAA CAGCGAAGTT TACCACCGGA TTTTGAGCAC CGATGAGCAC GAGATGATGC CGCCGAAAGC CTCGAACCGG TCGTTATCGG TTTACGAAAA GGCCGTTCTG ATCAAATGGA TCGAGCAGGG AGCCGAATAT AAACCGCACT GGGCGCTCAT CAAACCGCAA AAACCAACCT TCCCAACGGT AAAAAACACA AGCTGGCCCA AGAATCCAAT CGACTATTTT ACGCTGAGCA AGCTGGAGGA AAAAGGACTC ACCCCTTCGC CCGAAGCCGA TAAGGAAACC CTCCTTCGCC GGGTAACCCT CGACCTCACT GGGCTACCGC CTACCCTTGC CGAAATCGAC GCTTTTCTGG CCGACACCTC GCCCAACGCC TACGAAAACG TAGTCAACCG GTTGTTGAAG TCGCCCCATT ACGGCGAGAA AATGGCGGTC GACTGGCTGG ATTTATCGCG CTATGCCGAT ACACATGGCT ACACCGTTGA CCGATACCGA CCCATGTGGC CCTGGCGCGA TTGGGTCATT CAGGCATTTA ACCGCAACCG GCCATTCAGC CAGTTTATCA CCTGGCAACT GGCGGGCGAT CTGCTTCCCA ACCCAACGCC CGAACAGCGG CTGGCAACGG CCTTCAACCG CAACCATTCG CAGAATATGG AAGGCGGCAT TATCAACGAG GAGTTCCGGG TGGAGTACGT CGCCGACCGG ACCAACACGC TGGGTACGGC CTTGATGGGT ATGACCGTCG AGTGCGCCCG CTGCCACGAC CATAAGTTCG ACCCCATCTC GCAGAAAGAT TATTACAGCC TGTTCGCTTT TTTCAACAAC GTGGACGAAG CCGGGCAGAT TTCCTGGGAC GATGCCATGC CCGTGCCCAC CATGCTGCTC ACCCAGCCGA AAGAAGACAG CTTGCTGGCC TTCATCGACA CGAAAATCAG GAAGGCAGAA ACCGATCTAA AACAAACCGT ACAAACCGAA CAGAAAGCCT TTGACGCCTG GCAGGCGAAG ACCAATGCGG TGCCGTTCGA TGCCCGCAAA GGACTACAGG CACGTTTTAC CTTCGATAAA CTGGTGAAGG GACGGTTCGT GAATGAAGTG AGCATCAAAG ACAGCGGTAA AGTAGCAGAC CCCGTGCTTG TGCCCGGCAA GCTGGGTAAT GCCTTTCAAT CCAACGGCGA CGACATTCTG ACACTCGGCC AGGCCGGTGT TTTCAATCGC TCGCAGCCGT TCTCCATTGG CGTGTGGGTC AACATTCCGA AAGGCGTGTC GAAAGCGGCC CTTGTTCACA AAGGGAACGG TGATATTCTG TATAATTTCC GGGGGTACTT CCTCAACCTC CGCGACGATA AAGCCGAACT GCTGATGGCC CATACCTGGC CTTACAACAG TATTGTCCGG GTTAGTCGGC AGGTGCTGCC CAAGGAAAAG TGGATTCAAC TGACCATGAC CTACGACGGC TCCAGCAAAG CGAACGACTT CCGGCTGTAC GTCGACGGGC AGGAAATGGC CTTACAGACC GAACGCGACA ATCTCTACAA AGACATTTTA TTCAACGGAA ATCAAACGGG TCTGATGGTG GGGGCAGATA TGCGCGGACC GGGTTTCAAG AAGGGGCTGG TCGACGAACT GGTTGTCTAT AACCGTACGC TGAGTCCTCC TGAAGTGCAG GCGCTGGCCC AGTCTGTGCA GCAGGTAACG CCGATGAAAG GCGACCCGTT GAAACAGTAT TACTTCAGTC ATCTATCACC CGCTTACCAG CAGCAGCAGC AGGCTTTGCA GCAACTCCGG GCTGAGCGGA ACAAACTGGT GGAGGCCATA CCGGAAATTA TGGTGATGGA GGAGATGAAA ACGCCCCGTC CAACCTACCT GCTCAAACGG GGTGTTTACG ACGCCCACGG CCCGCAGGTA AAACCCGATG TGCCGGCCAG CGTGCTGCCC TTTTCGCCGG ATCTTCCTCG CAATCGGCTG GGGCTGGCCA AATGGCTGCT GCACCCCGAT AACCCGCTGA CGGCGCGGGT GGTGGTAAAC CGGTACTGGC AAACGTATTT CGGCAACGCT ATTCAGAAGT CAGCCAACAA CTTCGGAAAT CAGGGCGGGC TGCCTACTCA CCCGGAACTG CTCGACTGGC TGGCGGTTAC CTTCCGAGAG AGCGGCTGGA ACATGAAAGC CATGCAGAAG CTGATTGTGA TGTCGGCCAC TTACCGGCAG TCATCGTACG GCACCCGCGA AAGTTTGCAG AAAGACCCGG AAAACACCTG GTTGTCGCGC GGACCGATTT CGCGCCTAAC CGCCGAGATG ATTCGCGACA ATGCGTTGGC CGCCAGCGGC CTGTTAGTCA AACAACTGGG TGGCCCCAGT GTAAAGCCTT ATCAACCCGA AGGACTCTGG GCGGTCAATA ACACGACGTA TGAACAGGAC AAAGGAGACA GCCTATACCG CCGGAGCCTG TACACGTTCT GGCGTCGGAC CAACCCGCCC CCGTCCATGA ATACCTTCGA TGCGCCGTCG CGCAGTTACT GCGTGGTTCA ACGACAGAAA ACTAGTACCC CATTACAAGC ACTGGTGTTG CTCAACGATC CGCAATTTGT CGAAGCCGCC AAAGTCGTTG CCGAACGGTC GTTTGCCCAA CACACCACCG TACCGGATCG GTTAACGCAC CTGTTTCGGC TTTTGGCCAG TCGGAAACCA GCACCAAAGG AGCTGGCTAT CTTAACCCGC TTGTATCAGC AGGAATACCA GTTATTTACG AAAAATCCGG CCAAAATGCA GGGCTGGCTG CAGGCCGGTG ACTATAAACT GAAGGATGTA GCCGACAAGC CTGCTTTGGC CGCTGGTGCC GTTGTGGCCA GTACCGTCAT GAATTCCGAA GCGTTCGTTA CCAAGCATTA G
|
Protein sequence | MAITTGFITV GYQRFRFLLV VCCTLLFLQS CGVDKPAEIA QLDDKLPETI DYNLHVKPIL SDKCFFCHGP DKNSQKAGLE LATPEGAMAA LKKAKGKHAI VPGDLANSEV YHRILSTDEH EMMPPKASNR SLSVYEKAVL IKWIEQGAEY KPHWALIKPQ KPTFPTVKNT SWPKNPIDYF TLSKLEEKGL TPSPEADKET LLRRVTLDLT GLPPTLAEID AFLADTSPNA YENVVNRLLK SPHYGEKMAV DWLDLSRYAD THGYTVDRYR PMWPWRDWVI QAFNRNRPFS QFITWQLAGD LLPNPTPEQR LATAFNRNHS QNMEGGIINE EFRVEYVADR TNTLGTALMG MTVECARCHD HKFDPISQKD YYSLFAFFNN VDEAGQISWD DAMPVPTMLL TQPKEDSLLA FIDTKIRKAE TDLKQTVQTE QKAFDAWQAK TNAVPFDARK GLQARFTFDK LVKGRFVNEV SIKDSGKVAD PVLVPGKLGN AFQSNGDDIL TLGQAGVFNR SQPFSIGVWV NIPKGVSKAA LVHKGNGDIL YNFRGYFLNL RDDKAELLMA HTWPYNSIVR VSRQVLPKEK WIQLTMTYDG SSKANDFRLY VDGQEMALQT ERDNLYKDIL FNGNQTGLMV GADMRGPGFK KGLVDELVVY NRTLSPPEVQ ALAQSVQQVT PMKGDPLKQY YFSHLSPAYQ QQQQALQQLR AERNKLVEAI PEIMVMEEMK TPRPTYLLKR GVYDAHGPQV KPDVPASVLP FSPDLPRNRL GLAKWLLHPD NPLTARVVVN RYWQTYFGNA IQKSANNFGN QGGLPTHPEL LDWLAVTFRE SGWNMKAMQK LIVMSATYRQ SSYGTRESLQ KDPENTWLSR GPISRLTAEM IRDNALAASG LLVKQLGGPS VKPYQPEGLW AVNNTTYEQD KGDSLYRRSL YTFWRRTNPP PSMNTFDAPS RSYCVVQRQK TSTPLQALVL LNDPQFVEAA KVVAERSFAQ HTTVPDRLTH LFRLLASRKP APKELAILTR LYQQEYQLFT KNPAKMQGWL QAGDYKLKDV ADKPALAAGA VVASTVMNSE AFVTKH
|
| |