Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3894 |
Symbol | |
ID | 8727652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 4671964 |
End bp | 4674030 |
Gene Length | 2067 bp |
Protein Length | 688 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | |
Product | coagulation factor 5/8 type domain protein |
Protein accession | YP_003388683 |
Protein GI | 284038753 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.813544 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCGAC TCACACTTCT TGCCGCTTTC GTCAGTTTTT CTCTTTTATC GATAGCTCAG ACACCAGCCC CGTACGGTGC CGTTCCATCG CCCCGCCAGC TTGCCTGGCA TAAGCTCAAG TACTATGCCT TCGTCCATTT CAATATGAAC ACCTTCACCA ATGAAGAGTG GGGACACGGC ACCGAAACCC CCGATATGTT TAACCCCACT CAACTCGACT GTCGGCAGTG GGCGAAGGTG GCAAAAGAAG CCGGGATGGA AGGGATTGTC ATTACGGCCA AGCACCACGA CGGCTTCTGT CTGTGGCCGT CGAAATACAC GGAGCACTCG GTCAAAAACA GCAAATGGCG GAACGGGAAG GGCGATGTGC TGAAGGATCT GTCGGAAGCC TGTAAGGAGT ACGGTCTGAA GTTCGGCGTG TACCTGTCCC CCTGGGACCG CAATCACCCG GCCTACGGGA CGCCGGAATA CAACGAGGTG TTCAAGAAAA CCCTTCAGGA AGTGCTGACC CAGTACGGCG ATGTGTTTGA GGTCTGGTTC GACGGCGCAA ACGGAGAAGG GCCAAACGGC AAAAAACAGG TCTACGACTG GCCCGGCTTT ATCGAAACCG TACGCAAATA CCAGCCCAAC GCCGTTATCT TCAGCGATGC CGGTCCCGAC ATTCGGTGGG TAGGCAACGA AGATGGGTAC GCGGGCGAAA CCAACTGGAG CACGCTCAAC CGCGATAAGG TCTACCCGGC TTATCCAAAC TACTGGGAGT TGACGCTGGG TCACGAGGAC GGTACGCATT GGGTGCCTAC GGAGGTAAAC TGCTCCATCC GGCCGGGCTG GTATTACCAC GCCAGCGAAG ACAACAAAGA GAAGTCGCTG GAACACCTGG TCGATATTTA TTACAGCTCC ATCGGCCGCA ACGGCAACTG GCTCCTGAAC TTGCCCGTCG ATCGGCGTGG GCTGGTGCAT GAAAACGACG TAAAACAGCT CATGGCGCTG AAAGCCTACA CCGACAAGGC TACGCACAAC CTCGCCGGAG GGAAGAAAAT CACGAGCAAC AGTGTGTTCA GCAAGGCTTC GACCTTTGCC GCCGGTAACG TGCTCGATGC TAGCCGGGAT ACCTACTGGG CCGCTGCCGA AGGCGCCAAA CAGGCGACGC TGGATATTGA CCTGGGCAAG CCTACAACCC TCAACCGGCT GCTGATTGAA GAGTATATTG CGTTGGGCCA GCGGGTAAAG AAGTTCTCGG TAGCCGCCTG GCAGAAAGAT GGCTACCAGA CCATTGCCAG CGGAACGACC ATTGGGAACC GGCGGATTCT GCGATTCCCG ACCGTTACGA CCACCAAAAT TCGGGTGAGC ATCGACGAGT CGAAAGCCAG TCCGCTGATT CGGCATATCG AGATGTACAA CGCGCCCGAA CTGATCGTAA CGCCCGTGAT CAGCCGGAAT AAAGAGGGTA TGGTTACGAT TGTCTGCCCC AAAACGACCG ACCCGGTCAT TACCTACACC ACCGACGGCT CGGAACCGAC CGCCCAGAGT AAGCGCTTTA CCCAGGCCGT TGCTTTGCCG CAGGGCGGGG TTATAAAAGC CCGCGCCTTT GTCGATAACA TGAAAAAAGC GAGCAGCCCC GTAACGGCCG AATTCGACAT TAGTTCGGCG AAATGGACGG TTGTGTCGAC CGGCGACGCA GTACCCAAAA AAGAAGCTGT CCGGCTAATC GACGGCAACG CGGACTCGTT CTGGCAGCAA CGTAAACAGG CCGAAAGTCC AACATCGGTG GTGCTGGATT TGGGCGAGGA GCTGCCGCTG AAAGGCTTCA CGTACCTGCC GCGTCAGGAT GGGAAAAAGG CGGGTATCGT GTACCGATAT GCCGTTTCCG TAAGCCAGGA TGGGAAAACA TGGTCGGCAC CGGTAAGTCA GGGAGCGTTC AACAACATCA ATAATAACCC CGTTGGGCAA GCCGTCCGCT TCGACAAACC ACAAACGGCC CGGTTCCTGA AGTTCGACGC CCTCGAAACA ACCGAGGCCA GCGACGCCAC CGTATCCATT GCCGAACTGG GTGTACTGAC CCGCTGA
|
Protein sequence | MRRLTLLAAF VSFSLLSIAQ TPAPYGAVPS PRQLAWHKLK YYAFVHFNMN TFTNEEWGHG TETPDMFNPT QLDCRQWAKV AKEAGMEGIV ITAKHHDGFC LWPSKYTEHS VKNSKWRNGK GDVLKDLSEA CKEYGLKFGV YLSPWDRNHP AYGTPEYNEV FKKTLQEVLT QYGDVFEVWF DGANGEGPNG KKQVYDWPGF IETVRKYQPN AVIFSDAGPD IRWVGNEDGY AGETNWSTLN RDKVYPAYPN YWELTLGHED GTHWVPTEVN CSIRPGWYYH ASEDNKEKSL EHLVDIYYSS IGRNGNWLLN LPVDRRGLVH ENDVKQLMAL KAYTDKATHN LAGGKKITSN SVFSKASTFA AGNVLDASRD TYWAAAEGAK QATLDIDLGK PTTLNRLLIE EYIALGQRVK KFSVAAWQKD GYQTIASGTT IGNRRILRFP TVTTTKIRVS IDESKASPLI RHIEMYNAPE LIVTPVISRN KEGMVTIVCP KTTDPVITYT TDGSEPTAQS KRFTQAVALP QGGVIKARAF VDNMKKASSP VTAEFDISSA KWTVVSTGDA VPKKEAVRLI DGNADSFWQQ RKQAESPTSV VLDLGEELPL KGFTYLPRQD GKKAGIVYRY AVSVSQDGKT WSAPVSQGAF NNINNNPVGQ AVRFDKPQTA RFLKFDALET TEASDATVSI AELGVLTR
|
| |