Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3971 |
Symbol | |
ID | 8727729 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 4768796 |
End bp | 4770583 |
Gene Length | 1788 bp |
Protein Length | 595 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003388760 |
Protein GI | 284038830 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000911124 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATTT TTGAATTCCT GATACCCCGA TTAGCCAGCA CGCTACCCCG GCGTTCACTC CGCTCCGGCT GGCTTTCGGC CGTGCTTTTT CTTCCACTTT CGGTTGCCTC AAACACGCCC GGTCAGCCAC CCGCTGCTGC CACACCGAAG ACCGTGGTGC GGCAGACCGA GGTAAAGGAA TCCATGGTGC AAAAGGCCGT GGTGAGGCAG GCCGATGTCA TTATTTACGG AGGTACATCG GCGGCTGTTA CGGCGGCTGT TCAGGTAAAG AAAATGGGCA AATCGGTCAT CATCGTTTCG CCGGATAAAC ACCTGGGTGG CCTTTCGTCA GGTGGACTTG GCTTTACCGA TACCGGAAAC AAAGAAGTGA TCGGGGGCCT CTCGCGCGAG TTTTACCAGC GACTGTACCA GCACTATCAG CAGAAAGACT CCTGGAAATG GCAGAAGCCC GAAGAGTACG GCAACAAAGG CCAGGGTACC CCCGCCATTG ACGGTACCAG CCGGACCATG TGGATTTTTG AACCCCATGC CGCCGAACAG GTCTTCGAGG ATTTTGTCAA AGAATATAAA CTGACCGTTT ACCGCGATGA ATGGCTCGAC CGCTCGGCCA AGGGATTAAC CAAACAAGGC GGTAGCATAC GCTCGTTCCG GACGTTGAGC GGAGCCGTCT ATCAGGGAAA AATGTTCATC GATGTCACGT ACGAAGGCGA TCTGATGGCC GCTGCGGGTG TAAAATACCA CGTGGGCCGG GAAGCCAACA GCGTGTACGG CGAAAAATGG AATGGCGTGC AGAAAGGCGT TTTTCAGCAC GGACACCACT TCAAAACCAA CATCAGCCCA TATAAAATAC CGGGCGATCC GGCTAGCGGA CTTCTCCCTG AAATATCGGC AGAACAACCC GGCGAGAACG GAACCGGCGA CAATAAGGTC CAGACCTACT GCTTCCGGAT GTGCCTGAGC AACCACCCCG ACAACCGGGT GCCCTTTCCG AAACCCGAAG GCTACAACCC CGACCGCTAC GAACTGCTGG CTCGGGTATT TGCGGCCGGC TGGCGCGAAA CCTTCGATAA ATACGACCCT ATTCCCAACC GCAAAACGGA CACCAACAAC CACGGCCCGT TCAGTACGGA CTACCTCGGC AAAAACTACG ATTATCCAGA AGCGACCTAC GCCCGCCGGA AGCAGATCAT TAAGGACCAT GAACTGTACC AGAAGGGCCT GATGTACTTT CTGTCCAATG ACCCACGCGT TCCGGCCGAT GTTCACCAGG CCATGCAGCA GTGGGGCCTG GCGAAAGATG AGTTCACCGA TAACGGAAAC TGGCCGCATC AACTCTATAT TCGCGAAGCC CGACGCATGC TGGGTGTCTT TGTCATGAAA GAAGCCGACG CGCTTGGCAA AACAACCGTG CCGCAGCCCA TTGGCATGGG CTCCTATTCG CTGGATGCCC ACAATGCCCA GCGGTACGTT AAACCAGATG GTTTCGTACA GAACGAAGGC GACATTGGCG TGCATCCTGA CAAACCCTAT TCGATTGCTT ACGGGTCTAT CTTGCCCAAA GAAACAGAGT GCAACAACCT GCTGGTGCCG GTTTGTGTAT CCAGTTCGCA CATCGCCTAC GGCTCCATTC GGATGGAACC CGTATTCATG ATTCTGGGCC AGTCGGCAGC CACGGCGGCT GTGCTGAGCA TCGACAACAA AGTGAGTCCG CAACGGCTGC CCTACGAGAC GTTGAGCAGG GTGTTGTTAA ATGATAAACA GCGGTTAACG CTGAATGCAG CAAACTAA
|
Protein sequence | MNIFEFLIPR LASTLPRRSL RSGWLSAVLF LPLSVASNTP GQPPAAATPK TVVRQTEVKE SMVQKAVVRQ ADVIIYGGTS AAVTAAVQVK KMGKSVIIVS PDKHLGGLSS GGLGFTDTGN KEVIGGLSRE FYQRLYQHYQ QKDSWKWQKP EEYGNKGQGT PAIDGTSRTM WIFEPHAAEQ VFEDFVKEYK LTVYRDEWLD RSAKGLTKQG GSIRSFRTLS GAVYQGKMFI DVTYEGDLMA AAGVKYHVGR EANSVYGEKW NGVQKGVFQH GHHFKTNISP YKIPGDPASG LLPEISAEQP GENGTGDNKV QTYCFRMCLS NHPDNRVPFP KPEGYNPDRY ELLARVFAAG WRETFDKYDP IPNRKTDTNN HGPFSTDYLG KNYDYPEATY ARRKQIIKDH ELYQKGLMYF LSNDPRVPAD VHQAMQQWGL AKDEFTDNGN WPHQLYIREA RRMLGVFVMK EADALGKTTV PQPIGMGSYS LDAHNAQRYV KPDGFVQNEG DIGVHPDKPY SIAYGSILPK ETECNNLLVP VCVSSSHIAY GSIRMEPVFM ILGQSAATAA VLSIDNKVSP QRLPYETLSR VLLNDKQRLT LNAAN
|
| |