Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4109 |
Symbol | |
ID | 8727868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 4947115 |
End bp | 4948845 |
Gene Length | 1731 bp |
Protein Length | 576 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | |
Product | Rhs element Vgr protein |
Protein accession | YP_003388895 |
Protein GI | 284038965 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.0273732 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGTTT ACTCTCCTGT TGTCGAAACG ACCCTGCTCA TCGAGGGAAA AAAAATTCCA ACGTTCCACT CCATAACCTT ACAGCAGTCT ATTCATACCA CACATGAATT CCGGGTTGTT TTTGAACATG AGTCCGTTGA CGAATTAGTG GTGCTGTTCT CCGATCAACC TGAAAAATTG AATCGAAAAT CAATAGAGCT TACCGTCAAA GCAGCTGGTG AGGGGCCTCC GCTGCAGTTT AAGGGCGTGA TTACCCAAAC CGAATTAAAA CAGCAGGATG ATGGCTACTG GGGGCAAATG ATCATAAGCG GACACGGTCA TTGTGAATCC CTGCTGACGA TAGCCGGTAC CCAGACGTTC ACCGATTTAT CGATAACCGA TATTGTAAAT AAATGCCTGA GTAGCTACAC GCAGGTGAAA AATGTGTCTG GCAGCGTAAA ACCCGCCAAG CTACCGTTTT GTGTTCGCTA TACAGAGTCG GTCTGGCACT TCATAAAGCG GCTGGCATAC GATTTTGGGG CCTGGTTCTA TTACGATGGG TCAGTACTTC GATTCACGAC CAGCCCGGGT ACAACTTCTA CGTTAAACCT GACCTTTGGC GCCAATTTAA CGCACTTTCG CACGGGCGTC CGAGCGGTAC CCGCTTACTT TAAACAGTAC GACTATTTAG CCGAAGAAGA CAAACGGCTG GAATCGGAAG CAGCCAAAGA CAAAACTCCT TATGACAAGC CGGAAACGAG TGGTGTCATT ACGCCCCATC CGGCCCAGAC AGCTGCCGAT ATGCCCGATT ACCGCGACAG TCGCCATGCT TCCCTAACCG CCGAAGAAAA ATATATGGAA GGGCAAGCTC GGGTTCCGGG CCTGTTTCCG GGCAGTAAAA TCGTGGTTAA GGATTCTGAA AGAGGAAAAG GCGGGCAATC AGCCCCTTAC CTCGTCACCG ACATTGTTCA CTACGTAAGT GGTGTTGGCG AGTACACAAA CAGGTTTCGG GCCATTCCGG CCGATGTAGC CGCTATGCCC GTGCGCAAAT TGGTTCGCCC GCTGGCCCAA ACGCAGATTG GTCAGGTTAC TGACATTAAA GACCCCAAAG GCATGGGGCG GGTAAAGGTA AGGCTGCTTT GGATGAGTGG CTCGGAATCG ACCCCCTTTA TCCGGATGAC TCAGCCGCAC TTCGGGGTTC ATACGGATAA CAAAAAAACG CGCGGCTTTC AATTTGTGCC AACCATTGGC GACCAGGTCA TGGTCGGCTT TGAGTACAAC AACCCCGAAC GTCCCTTTGT GATTGGCGCC TTACCCCACG GTAAAAACAG TGGTATGGAC ACCAGTAAGC CCGATGAAGA AAAGCACATC AGCGTAGGAA GTGGCAGTAC GCTGACGTTT ATTGAAAAAC CCAGCGTGAA AGAAATTCAT CTGCAAGTCG ATGAAAAGAA CTTCGTCAAG ATCTCCGTAC CGAGTGCGGG TGGTGATATA ACGATCAACT CTTCGAAAAA TATTGTTGTC AAAGCCACCG CGAAAGTGAC CATCGAAGCT CCCGAAATCG TGTTATCCGG AAACACCATT ACCTTAGATG CCAAACAGGC GGTCAATATT AAAGGCACAC AGGTTAAAGT AGAAGCATCA GCCCAGATGA ACATCAAAGG AGCCATGACC GATGTGGAAG GGTCGGGTAC GCTAAACGTC AAAAGCTCAG GCATGACAGC CATTAAAGGA TCAATGGTTA TGATTAATTA G
|
Protein sequence | MPVYSPVVET TLLIEGKKIP TFHSITLQQS IHTTHEFRVV FEHESVDELV VLFSDQPEKL NRKSIELTVK AAGEGPPLQF KGVITQTELK QQDDGYWGQM IISGHGHCES LLTIAGTQTF TDLSITDIVN KCLSSYTQVK NVSGSVKPAK LPFCVRYTES VWHFIKRLAY DFGAWFYYDG SVLRFTTSPG TTSTLNLTFG ANLTHFRTGV RAVPAYFKQY DYLAEEDKRL ESEAAKDKTP YDKPETSGVI TPHPAQTAAD MPDYRDSRHA SLTAEEKYME GQARVPGLFP GSKIVVKDSE RGKGGQSAPY LVTDIVHYVS GVGEYTNRFR AIPADVAAMP VRKLVRPLAQ TQIGQVTDIK DPKGMGRVKV RLLWMSGSES TPFIRMTQPH FGVHTDNKKT RGFQFVPTIG DQVMVGFEYN NPERPFVIGA LPHGKNSGMD TSKPDEEKHI SVGSGSTLTF IEKPSVKEIH LQVDEKNFVK ISVPSAGGDI TINSSKNIVV KATAKVTIEA PEIVLSGNTI TLDAKQAVNI KGTQVKVEAS AQMNIKGAMT DVEGSGTLNV KSSGMTAIKG SMVMIN
|
| |