Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_1574 |
Symbol | |
ID | 8725308 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 1902062 |
End bp | 1905061 |
Gene Length | 3000 bp |
Protein Length | 999 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | glycosyl hydrolase, BNR repeat-containing protein |
Protein accession | YP_003386422 |
Protein GI | 284036492 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.0267092 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGTAC ATTACACCCT CAACCGGCTA TTTTTAGGCC TGACATTCAT CTTTTCCGCC GTTACCCTAC ACGCGCAGCC TTTCTCGTCA AAGTTATTTG ATGCCATGAA ATGGCGGATG ATTGGTCCGC ATCGGGGCGG CCGGACGGTT GGCGCTACCG GTGTTCCTCA GCAACCCAAC GTCTTCTACA TCGGTGTAAA CAATGGTGGC GTCTGGAAAA CGACGGACTA TGGCCGTACA TGGGTGCCCA TTTTTGATGA TCAGCCAACT GGTTCTATCG GCGATGTCGC GGTAGCACCC TCAAATCCGG ATGTGGTGTA CGTTGCCAGT GGCGAGGGGC TGCAACGGCC CGACCTGTCG GTGGGGAATG GGATGTATAA ATCGACGGAT GCTGGGAAGA CCTGGGCGTT TCTGGGCCTT AAAGACGGGC AGCAGATTGG CGGTATCAGT ATCGATCCAA CCAACGAGAA CCGGGTTTTT GCGGCTGTAC TCGGTCATCC ATACGGCCCC AATACCGAAC GGGGTGTTTA CCGAACCGTC GATGGCGGCA AAAAATGGGA ACGGGTGCTG TATAAAGATG AAAATACAGG CGCTGTTCAG GTAACCATTG ATCCAAAGAA TCCGAATATC GTGTACGCCG ATCTTTGGGC GGCTCGTCAG GGACCCTGGG AAAACGGGCA GTGGCAAGGG CCGGAAAGTG GTCTGTTCAA ATCGACGGAT GGCGGGACAA CCTGGCAAAA ACTGACCAAC GGCTTGCCTA CGGTTGAGCA GGGCCTGGGA CGGATTGGCT TCTGCATTGC CCCGAGTGAG CCCAATCGGC TATATGCAAC GGTCGACGCG CCCGAACTCG GTGGGGTGTA TCGCTCCGAT AATGCAGGCC AGTCCTGGAC ACGAATCAAT AACGACCAGC GGCTTTGGGG GCGGGGGAGC GACTTTGCGG AGGTAAAAGC GCATCCAACC AATCCTGATA TTGTGTTTAT CGCCGATGTT GCCGCCTGGA AATCGACCGA TGGCGGCAAA ACCTGGAACG ACTTCCGCGG GGCACCCGGT GGCGACGATT ATCACCGACT ATGGATTAAC CCTAACGATC CCAATACCCT GCTTCTGGCG GGCGATCAGG GCGGTATCGT CACCGTTAAC GGCGGGCAGA CATTCAGTTC GTGGTACAAC CAGCCTACGG CGCAATTCTA CCATGTGAGT ACTGATAATA GCTTTCCGTA TAATGTTCTG GGTGGCCAGC AGGAGAGTGG TTCGGTCGGT ATAGCCAGTC GGGGAAACGA CGGGCAGATC ACGTTTCATG ACTGGCACCC GGTAGGCGTG GAAGAATACG GCTACGTGGC TGCCGACCCG CTCGATCCAA ATATTATTTA CGGCGGTAAA GTGACAAAAT TCGACAAACG AACCGGGCAG GTGCAGAACA TAGCCCCCGA AGCGGTTCGT TCGGGTAAAT ACCGCTTTGT CAGAACTGCT CCGGTACTCT TTTCGCCCAT TGACCCTAAA GAGCTTTACT TCGCCGGAAA CGTCCTGTTC AAGACCCGCG ATGGTGGCAA CAGTTGGCAG GTTATCAGCC CCGACCTGAC GCGGACGTCC TACCCCGACA TTCCCGAAAG CGTGGGCGTG TATCGCACGG CTGATATGAC TACTATGCCC GCCCGGGGTG TCATTTATAC TATCGCGCCT TCGCATAAAA CAATTAACAC AATTTGGGTA GGAACTGATG ATGGACTGAT TCAGATTACC CGTGATGGCG GTAAAATATG GAAAAATATA ACCCCGCCGG GTGTGGGTTC CTGGAGTAAA GTTTCGCTTA TCGACGCCGG TCATTTCGAC GACAACACCG CCTATGCCGC CGTGAATCGG ATTCGCTGCG ACGACCTGCG TCCTCATATT TACCGAACAC ACGACGGGGG TAAAACTTGG CAGGAGATTG TAAGCGGTTT GCCCAATGAC CCCATTAATG CCGTTCGTGA AGACCCCTCC CGGAAAGGGC TGCTGTTTGC CGGGTCGGAA ACCGCCGTGC ATGTTTCGTT CGATGATGGC GATCATTGGC AACCGCTGCG GCTCAATATG CCCGCCACCT CGATTCGCGA TCTGGTCATT AAAGATGATG ATCTGGTAGT AGGTACACAT GGTCGGTCAT TCTGGATTCT GGACGACATT ACCCCGTTAA GACAACTGAC CGCTGATCTG GCAAAGGCTG AAACGATCCT GTATAAACCC CAGCGGGCCT ACCGGGTCCG CTGGAACATG AACCCCGATA CGCCGTTGCC GCAGGAAGAA CCCGCCGGGC AGAATCCACC CGATGGCGCC GTTATTGACT ATTTTCTAAA GGAAAACGCC GGTCAGGTTG TAACGCTGAC AATTAAGGAA GCGGCTGGTG CGATTGTCAG GCAATTCAGC AGTGACGATA AACCCTACGA TGTGCCCGAT GTAAATCTGC CCGCCTATTG GATTCGGCCG CAACAGATTC TGTCGGGAGG GGCTGGTTCG CACCGGTTCA TGTGGGATCT GCATTACACA CCTATGCCCG GTCCACCTTC CTATCCAATT GCGGCAACTT ATCACCAAAC GGCCCCTGAG TTCACTTCGC CCTGGGTCAT GCCCGGTACC TACACGGTGA CTTTGACGGT GGGCGGCAAA TCGTACACCC AGCCCCTAGT CGTGACCATG GACCCTCGGG TAAAAACGAG TCTGGTCGCG CTGAAACAGC AACACGATCT GTCGGTCATT GTGTTCGAGG GGCGGAAAAG CGTCATGGCT TTACAAAAAG AAGCGCAGGA CTTCAAGGGC CGTTCGATGA CGGACACCCA AAAAAAGACA TTCAGTACAT TACAAATGAC CCTGAACGGA CTAAACAGAG CGTTTAGTTC GCTGTTTGGT ACCCTTCAGG ATGCCGATAT GCCACCCACT ACACAGGCTG TAGCTGCCGT GAAGGAAGCG CAATCCGTTT TTGGAAAACT GGTGGGTGAG TGGAAAGCCT GGAAAAGCAC AGTACGCTAA
|
Protein sequence | MIVHYTLNRL FLGLTFIFSA VTLHAQPFSS KLFDAMKWRM IGPHRGGRTV GATGVPQQPN VFYIGVNNGG VWKTTDYGRT WVPIFDDQPT GSIGDVAVAP SNPDVVYVAS GEGLQRPDLS VGNGMYKSTD AGKTWAFLGL KDGQQIGGIS IDPTNENRVF AAVLGHPYGP NTERGVYRTV DGGKKWERVL YKDENTGAVQ VTIDPKNPNI VYADLWAARQ GPWENGQWQG PESGLFKSTD GGTTWQKLTN GLPTVEQGLG RIGFCIAPSE PNRLYATVDA PELGGVYRSD NAGQSWTRIN NDQRLWGRGS DFAEVKAHPT NPDIVFIADV AAWKSTDGGK TWNDFRGAPG GDDYHRLWIN PNDPNTLLLA GDQGGIVTVN GGQTFSSWYN QPTAQFYHVS TDNSFPYNVL GGQQESGSVG IASRGNDGQI TFHDWHPVGV EEYGYVAADP LDPNIIYGGK VTKFDKRTGQ VQNIAPEAVR SGKYRFVRTA PVLFSPIDPK ELYFAGNVLF KTRDGGNSWQ VISPDLTRTS YPDIPESVGV YRTADMTTMP ARGVIYTIAP SHKTINTIWV GTDDGLIQIT RDGGKIWKNI TPPGVGSWSK VSLIDAGHFD DNTAYAAVNR IRCDDLRPHI YRTHDGGKTW QEIVSGLPND PINAVREDPS RKGLLFAGSE TAVHVSFDDG DHWQPLRLNM PATSIRDLVI KDDDLVVGTH GRSFWILDDI TPLRQLTADL AKAETILYKP QRAYRVRWNM NPDTPLPQEE PAGQNPPDGA VIDYFLKENA GQVVTLTIKE AAGAIVRQFS SDDKPYDVPD VNLPAYWIRP QQILSGGAGS HRFMWDLHYT PMPGPPSYPI AATYHQTAPE FTSPWVMPGT YTVTLTVGGK SYTQPLVVTM DPRVKTSLVA LKQQHDLSVI VFEGRKSVMA LQKEAQDFKG RSMTDTQKKT FSTLQMTLNG LNRAFSSLFG TLQDADMPPT TQAVAAVKEA QSVFGKLVGE WKAWKSTVR
|
| |