Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4722 |
Symbol | |
ID | 8728486 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 5749022 |
End bp | 5752006 |
Gene Length | 2985 bp |
Protein Length | 994 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | |
Product | Immunoglobulin V-set domain protein |
Protein accession | YP_003389499 |
Protein GI | 284039569 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.933088 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACATT TTGTACGAAT GGGGACGGCT CCCCGTAATC CGTTTCTTTA CCTGCTGCTC TGGGCAGGCC TTTGGCTGCT CTCCGCTCCC GCGCTGTTTG CGCAAACTGG TCTGCAACAG TCGCTGATTG CCAACCCCGA TAATGCCGGA ACTTACCAGG GCTACCAGGG CCAGCCGATT GTGTTTAACA ACGCGCTGTA TGGCTTGTAT CTGAACGAGA GTGGGGTCTA TCAATTAGCC AAATACAATG GCACGAGTTT GACCCTGATT GCCAATCCCG ATAATGCCGG AACTTACGAG GGCTATGATA GCGGTCTGAT TGTGTTTAAA AACGCGCTGT ATGGCGTTTA TAGGAACGCG AGTAGAGTCT ATCAATTAGT CAAATACAAC GGCACAAGTT CGACCTTAAT TGCCAACCCC GATGAGGCCC CAACTTACCT GGGCTATTAT GGCCAGCCGA TTGTGTTTAA CGACGCGCTG TATGGCAAGT ATATGAACAA GAGTGGGGCG TATCAATTAG TCAAATACAA TGGCACGAGT TTGACCCTGA TTGCCAACCC CGATAATGCC GGAAATTACC AGGGCTACCA TGGCGATCCG ATTGTGTTTA ACAACGCGCT GTATGGCCGG TATATGAACG CGAGTGGGGC CTATCAATTA GCCAAATCCA ATGGCACGAG TTTGACCCTG ATTGCCAACC CCGATAATGC CGAATTTTAC CAGGGCTATG TAGACGATCT AGACTATCCG ATTGTGTTTA ATAACGCGCT GTATAGCCAG TATCTGAACG CGAGTGGGGC CTATCAATTA GCCAAATACG ATGGCACGAG TTCTACTCTG ATTGCCAACC CCGATAATGC CGGAAGTTAC GAGGGCCACT CGATTGTGTT TAACAACACC CTGTATGGCC AGTATCTGAA CGCGAGTGGG GTTATTCAAT TAGCCAAATA CAATGGCACG AGTTCGACAC TGATTGCCAA CCCCGATAAT GCCGGAAGTT ACCAGCGCTC CCCGATTGTG TTTAACAACG CCCTGTATGG CCAGTATCTG AACGCGAGTG GGGTCTATCA ATTAGCCAAA TACGATGGCA CGAGTTCGAC CTTGATTGCC AACCCCGAAA ATGGCCCAAG TTACCGGGGC TATATGAGCG ATCCGATTGT GTTTAACAAC GCCCTGTATG GCAAGTATAT GAACGCGAGT GGGGTTATTC AATTAGTCAA ATACGATGGC ACGAGTTCGA CCCTGATTGC CAACCCCGAT AATGCCAAAG GTTGCGATGG CCACTCGATT GTGTTTAACA ACGCCCTGTA TGGCAAGTAT CTGAACAAGA GTGGGGTCTA TCAATTAGTC ACTGGGGTAC CCTGTGCATT GTCGCTGAGT ATCAACCCCT CCTCGCTAAC GATAACAGCG GGTGGCTCGG TCACGCTTAC CGCTTCCGGA GCTACGACCT ACACCTGGAG CAACGGCAGC ACGGCCAACC CGCTTATCGT CAGCAACGTC ACCAGTGCCA CGGCCTTTTC AGTGACGGGC GTAACGGGTA CGTGTTCGGC CACGGCCACG GCCAGCGTGA GCGTGGCCAC CATCACGGCG GGTACGACCT CGGGAACCAT CACGGCTTGC GCGGGTACGG CATCGGCATT GCCCGCCGTG CAGCAATTCA ACGTTTCGGG CAGCACTCTT TCAGGAAACA TCGTGGCTAG TGCCCCGCTC GGTTTCGAGC TTTCCACCAC TGCCAGCACT GGCTATGCGG CCTCTCTGAC GCTCACTCAA TCGAGTGGCG TAGTGGCCAA CACCACCATC TACGTGCGCT CGTCTGCCTC GGCCAGCGGG AACCTTTCGG GCAATGTGAG TCTGGCTTCG AGCGGAGCGA CGACGCAAAA CGTAGCCGTG AGTGGAACGG TTACCCCCCT GGCCACCATT ACGGCCCAGC CCGTGGCCAG TTCATCGGTG TGCGCGGGCA CGACGGTCAC CGTCTCGGTG AGCACCAGTG GCCCGGTGAG CAGCTATCAG TGGTATAAAG GCGGCACCCT GCTCAGCGGC CAAACCTCGG CTACGCTCAC CCTGACCAAC CTCAGCACCA CCGATGCGGG CAGTTATTCG GTCGTGGTCA CGGGCAATTG CAACAGCCTC ACCTCAACCG CCTTTAGCCT GACAATCGTT GCCCGGCCCG ACGCGCCCGC CCTGACCCCC GCCAGCTCTA GTCTGGCAGC GACTCTGACA CCCCTCTCGC TGACGGGCTT TGCACTGGCA ACCACCGGCA ATAGCCTCCA CTTCTTCCAA GCCGGAGGTA GTGAACTCAG CCCGCCCACC ATCAATATTA CCACTGCCGG GGTAATGAGC TTTTGGGTCG GCCAGACCAG CAACGCCAGC GGCTGCAAGA GTTCACTCAC GCCATTGAGT CTGACCATCA CGGCCACCCC CACCAGCCAG ACGGCTACCC CCACCAGCCA GACCGTTTGC CGCAGCACCA ACGTTACCCT GAACGTCACC GTGGAGGGAA CCGCCTACCA GTGGTACAAA AACGGTACTA CCCTAGCCAA CAAACTTACC GAACTCACCA GTGCCCAGCG CGGTACGACC ACCGCCACCC TGACACTGGT CAATTTGCAA ACCACCGCCG ACTACTACTG CAAAATCACT ACTCCCAACG GCGTTCAGAC TGTGGGGCCC CTGAAGGTGA GTGTCAACTT TGGCTGTTCG GCCCGGCCTG CGGCCGAGGA AGCAGACTTG CAACTATTGG TACTGGTCAG GCCAAACCCT ATCGTAGACG GCCACCTGCG GGCCCTGGTG AAGGGGGCTC AGGGGCAAGC CCTGAACGTA GCCCTCTACA GTCTGCAAGG GGAGTTGGTG AACCAGCAGG TCTGGGACTC AGCCCCGGCC GAGGTCAATC TAGATTGGGA TATCAGCCAG CGCAGCATGG GTGTGTTACT CTTGCGGGCC CAGACCCCAA CTCAGCAGCA AACCATCAGG CTTATCCAAA ATTAA
|
Protein sequence | MEHFVRMGTA PRNPFLYLLL WAGLWLLSAP ALFAQTGLQQ SLIANPDNAG TYQGYQGQPI VFNNALYGLY LNESGVYQLA KYNGTSLTLI ANPDNAGTYE GYDSGLIVFK NALYGVYRNA SRVYQLVKYN GTSSTLIANP DEAPTYLGYY GQPIVFNDAL YGKYMNKSGA YQLVKYNGTS LTLIANPDNA GNYQGYHGDP IVFNNALYGR YMNASGAYQL AKSNGTSLTL IANPDNAEFY QGYVDDLDYP IVFNNALYSQ YLNASGAYQL AKYDGTSSTL IANPDNAGSY EGHSIVFNNT LYGQYLNASG VIQLAKYNGT SSTLIANPDN AGSYQRSPIV FNNALYGQYL NASGVYQLAK YDGTSSTLIA NPENGPSYRG YMSDPIVFNN ALYGKYMNAS GVIQLVKYDG TSSTLIANPD NAKGCDGHSI VFNNALYGKY LNKSGVYQLV TGVPCALSLS INPSSLTITA GGSVTLTASG ATTYTWSNGS TANPLIVSNV TSATAFSVTG VTGTCSATAT ASVSVATITA GTTSGTITAC AGTASALPAV QQFNVSGSTL SGNIVASAPL GFELSTTAST GYAASLTLTQ SSGVVANTTI YVRSSASASG NLSGNVSLAS SGATTQNVAV SGTVTPLATI TAQPVASSSV CAGTTVTVSV STSGPVSSYQ WYKGGTLLSG QTSATLTLTN LSTTDAGSYS VVVTGNCNSL TSTAFSLTIV ARPDAPALTP ASSSLAATLT PLSLTGFALA TTGNSLHFFQ AGGSELSPPT INITTAGVMS FWVGQTSNAS GCKSSLTPLS LTITATPTSQ TATPTSQTVC RSTNVTLNVT VEGTAYQWYK NGTTLANKLT ELTSAQRGTT TATLTLVNLQ TTADYYCKIT TPNGVQTVGP LKVSVNFGCS ARPAAEEADL QLLVLVRPNP IVDGHLRALV KGAQGQALNV ALYSLQGELV NQQVWDSAPA EVNLDWDISQ RSMGVLLLRA QTPTQQQTIR LIQN
|
| |