Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_1528 |
Symbol | |
ID | 8725262 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 1842770 |
End bp | 1845724 |
Gene Length | 2955 bp |
Protein Length | 984 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003386376 |
Protein GI | 284036446 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.0809349 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATCAT TTAAACGCCT GAACACCCTA ACGGGCTGGC TCGTTTTCGC CGTCGCGCTC TTTACCTATG CGATGACCGT CGAACGCACC GCCAGTTTCT GGGACTGTGG CGAGTTCATT GCGTGTTCGT TTAAACTCCA GGTACCCCAC CCGCCCGGTG CTCCATTTTT CCTTCTACTG GGACGACTTT TCTCCATGAT GTCGTTCGGC GATTTGACCA GCGTAGCCTA TTGGGTCAAC ATGGCGTCGG TACTGGCGAG TGCCTTTACC ATTGCGTTCC TGTTCTGGAC CATCACCATG CTGGCACAGA AATTGCTCGG CAAAGCGGAA CGTGATTATA CCACCGCCGA CACGCTGCTG GTTATTGGTA CGGGTGCCGT TGGTGCCCTG GCCTACACCT TTTCCGACAC ATTCTGGTTT TCGGCGGTTG AGGCCGAAGT GTACGGCATG TCGTCATTTT TCACCGCCAT CGTGGTTTGG GCAGCCTTCA AATGGGAGCG CATCGAAGAT GAAGCCGCTG CCAATCGCTG GCTTATCTTT ATCGCTTACC TGACGGGGCT ATCCATCGGG GTCCACTTAC TGAACCTCGT AACCCTGCCC GCACTGGCAC TTATTTACTA TTTCAAAAAA TATCCCAAAC CAACGTTCTG GGGCGGTGCA GCGGCTTTCG GTATCGGTCT GGTTATTCTG GGTATCATCA ATTCGGGTAT TATTCCCGGC TTGCCTGGCA TGGCCTTCGC CTTTGAGCGG TTCTTTGTGA ATACCCTTGG CTTACCGTTT ACGTCGGGCG CTATCTTCTT TACCGTCGTC TTCCTGGGAG CTATCGTATA TGGCATCATC TGGTCGGCCC GGCAAAAGCG GGTTATCCTG AATACCTCCC TGTTGGCGCT GGCATTCGTA CTGATCGGCT ATGCTTCGTA CATGCAGGTA CTGGTACGGG CCGACTTCAA CCCGCCAATC AATGAGAATG ACCCCAGCGA TGAACTGAAC TTCCTGTCGT ACCTGCGTCG GGAACAGTAC GGAAGCCGCT CGCTGTTGTA CGGTCCTGTT TTCACGGCGC GCCCCATCGA CCAGAAGCAG GGTGCCGCCA TGTGGAAAAA ACAAGGCAAC AAATACGTGG TATTCGACCA TCAGCCGGAG TACGTTTACG CGCCCGGCGA TGAGATGCTG TTTCCCCGCG TATATAGCAG TCAGCAAAAC CACCCGGCCC TGTACCGACA GATGCTTGGT CTGGCAGAAG GTCAGAAACC AACGATGGGT GATAACCTGA AATTTTTGTT CAACTATCAG TTGGGACACA TGTGGTGGCG CTACCTGATG TGGAACTTTG CCGGACGTGA GAGCGACGAA GAAGGTGCAG GTTACCTCCT CCCCTGGTCG ACGGATCAAA ATGCCCCGGA TTTGTTGAAG ACCAATAAAG CCCGCGACAA TTTCTACATG CTGCCGTTCA TCCTGGGCCT GTTTGGTATT ACGTTCCAGT ACTTCCGCCG TCGGCGCGAC TTCCTGATTG TCGGGCTCCT GTTTTTATTC ACAGGTATTG CCCTGCAAGT CTTCCTGAAC TCGCCCCCAT CGGAACCCCG CGAGCGTGAT TACATCTACG TGGGTTCGTT CTACTTCTTC GCCATCTGGC TTGGGCTGGG CGTTATGTCG ATTGCCGAAG GATTACGCAA CGTTCTGAAG TCGGACGTAG CCCGCAATGG TCTGGTTGCC GGTATTGCTC TGCTGGTGCC GGTTATGATG GGTGCCAAAA GCTGGGATAA CCACAACCGC GATAAGCGTT ACCAGTCGGT CGATTTCGCG AAAAACCTGC TGAACTCCTG TGCCCCCAAC GCGGTGCTCT TTACGGGCGG TGATAATGAT ACCTTCCCCC TTTGGTACGT GCAGGAAGTA GAAGGGTTCC GGCGCGATGT GCGGGTGTGT AACCTGAGTT TGCTGGGTAC GGAATGGTAC ATCCAGCAGA TGAAGCGGAA GACTTACGAG TCGGAAGCAC TGCCCATTTC GCTCGAATTC GACAATTTCA ACAAAGGCAA AAACGACATT GTGCCGTTCT ACGAAGTGCC TGGTGTGAAA AACGGGATCG ACCTCAAGCA ATACATTAAC CTGATCAAGA CGAGCAGTCC GGCGGTTCAG GTACCGCTCA CCAACGGCGA TATGACGAGC ATTCTGCCTT CGTCGGTGCT GTTCCTGCCC ATCGACAAAG CTGCGGTCGA CAAAGCCAAT TTCGTACCGG CTGCACTTCG CCCGTTGATG AAGGATACCC TGCAATGGAC CATCGGGAAG AAGGATTTGT ACAAGCCTGA CCTGATCATG CTCGACATGA TTGCCACCAA CAACTGGAAG CGGCCCATTT ACTTCTCCAG CACCCTGGCA AGTGACAACT ACCTGAGCCT GAAGAACTAC ATGCAGTTAG AAGGTTACGC GTATCGGCTC ATGCCGGTGG CCGTACCGGG CGCAACGGAT GGCTATGTAA ACTCCGACAT CATGTACACC AACATGACGA AGAAGACCTT CTGGCGTGAG TTCAACAACC CGGATGTGTA TTATGACGAA ACCTACAAAG GTCCGCCGGT GATTTCGGCC CGTATTGCGT TCTTCCGTCT GGCCGATCAG TTCATCCGCG AAGGCCGGAA AGATAAAGCC CTTGAGGTGC TTAACTACTC TCTCAAGGTT ATTCCGGACA AGGCCATTCC GTACGACCAG ATTTCGTCGA ACTACGTGCG CTTCCTGTTT GAAGTAGGTG ATAACAAAAA GGCCCTCGAA ATTGCCGAAG TAATGGCCAC CCGCGCCGAT CAGGATCTGA CTTATGCCAA GAGTGGTAAT GGACGATTTG GCAGCCCCGA TTCAGACCTG TACATTCTGC AAACGATTGT CGAAGCCTGT AAGGAAGCCA AGCAAACGGC AGCCGCCAAT AAATACGAAG CTATTTTCCA GAAGCATATC AATGCATTTG GTTGA
|
Protein sequence | MQSFKRLNTL TGWLVFAVAL FTYAMTVERT ASFWDCGEFI ACSFKLQVPH PPGAPFFLLL GRLFSMMSFG DLTSVAYWVN MASVLASAFT IAFLFWTITM LAQKLLGKAE RDYTTADTLL VIGTGAVGAL AYTFSDTFWF SAVEAEVYGM SSFFTAIVVW AAFKWERIED EAAANRWLIF IAYLTGLSIG VHLLNLVTLP ALALIYYFKK YPKPTFWGGA AAFGIGLVIL GIINSGIIPG LPGMAFAFER FFVNTLGLPF TSGAIFFTVV FLGAIVYGII WSARQKRVIL NTSLLALAFV LIGYASYMQV LVRADFNPPI NENDPSDELN FLSYLRREQY GSRSLLYGPV FTARPIDQKQ GAAMWKKQGN KYVVFDHQPE YVYAPGDEML FPRVYSSQQN HPALYRQMLG LAEGQKPTMG DNLKFLFNYQ LGHMWWRYLM WNFAGRESDE EGAGYLLPWS TDQNAPDLLK TNKARDNFYM LPFILGLFGI TFQYFRRRRD FLIVGLLFLF TGIALQVFLN SPPSEPRERD YIYVGSFYFF AIWLGLGVMS IAEGLRNVLK SDVARNGLVA GIALLVPVMM GAKSWDNHNR DKRYQSVDFA KNLLNSCAPN AVLFTGGDND TFPLWYVQEV EGFRRDVRVC NLSLLGTEWY IQQMKRKTYE SEALPISLEF DNFNKGKNDI VPFYEVPGVK NGIDLKQYIN LIKTSSPAVQ VPLTNGDMTS ILPSSVLFLP IDKAAVDKAN FVPAALRPLM KDTLQWTIGK KDLYKPDLIM LDMIATNNWK RPIYFSSTLA SDNYLSLKNY MQLEGYAYRL MPVAVPGATD GYVNSDIMYT NMTKKTFWRE FNNPDVYYDE TYKGPPVISA RIAFFRLADQ FIREGRKDKA LEVLNYSLKV IPDKAIPYDQ ISSNYVRFLF EVGDNKKALE IAEVMATRAD QDLTYAKSGN GRFGSPDSDL YILQTIVEAC KEAKQTAAAN KYEAIFQKHI NAFG
|
| |