Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_1972 |
Symbol | |
ID | 8725709 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 2382278 |
End bp | 2383366 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | |
Product | domain of unknown function DUF1738 |
Protein accession | YP_003386816 |
Protein GI | 284036886 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0738445 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0525568 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACCCGG TACCGCTCGT TCCACCAACA AGAATTTTCG CCAAGCCGTT GGCTTTCCCC GCCCAACCAA AATTCTTGCT GGTTCTCTTG CTACAGGCAC CGGGTTTGAC CATCCCCGTC AACCTGACCA GTGGAGAACA GACCATGACA ACGATTACTC ACCACACCGC AACCCAAACG ATACAGGAGC CGACCTTGAC CAAACCCACG ACACAATCAA ACGATGTCTA TGCTCGCATT ACCAACAAAA TTCTGGCCGA TCTCGAACAG GGCGAACTCA CCTGGCGCAA GCCTTGGAAT GCCGATCACT TGAGCGGTCA GGTCACGCGA CCTTTACGCT GGAACGGTAT TCCGTATTCC GGTATTAATA CGCTAATGTT GTGGGGAACT GCTGCCGAGC AGGGTTATAC CTCGCCGTAC TGGATGACTT ATAAACAGGC CAGCGAACTC AAAGCCAATG TTCGCAAAGG CGAGAAAGCA ACGCAGGTCG TCTATGCCGA TAAGTTCATG AAAGAAGATC AGGATGCCAA TGGCGAAATC ACAACCAGCC AAATCCCATT TCTGAAGTGT TACACAGTCT TCAATGCGTC GCAGATCGAG GGGCTGCCTG AGACGTATTT TCCGACGCCT GTACCAATTG GCACCGATGC TAAACAGCGT AATGCCGAAC TGGATGCATT TTTTGCCCAG ACCAAAGCCG ATATTTATAC CGGCACAAAT GCGTGTTACA TTCAAAGAAC GGATCGCATT CAAATGCCCC CGTTTGAAAG CTTTGAGAGT GTAAAAAGTT ATTATGCTGT TCTCGCCCAC GAGCTGACGC ACTGGACGAA ACACCCTGAC CGGTTAGACC GTGATATGGG TCGTAAACAC TACGGCGATG AAGGCTATGC GAAGGAAGAG CTAGTGGCCG AGCTGGGAGC CTGTTTCCTT GCTGCTGATC TGGGTTTTGA GCCTATGCCC GAAGTTCAGC ATGCAGCTTA CATCCAATCG TGGCTTCAAG CGTTGAAGGA TGATAAAAAA TTGATATTCA CGGCTGCCTC ACACGCACAA AAAGCGGTTG AATATCTGCT TGCTTTAACC TGTACATGA
|
Protein sequence | MYPVPLVPPT RIFAKPLAFP AQPKFLLVLL LQAPGLTIPV NLTSGEQTMT TITHHTATQT IQEPTLTKPT TQSNDVYARI TNKILADLEQ GELTWRKPWN ADHLSGQVTR PLRWNGIPYS GINTLMLWGT AAEQGYTSPY WMTYKQASEL KANVRKGEKA TQVVYADKFM KEDQDANGEI TTSQIPFLKC YTVFNASQIE GLPETYFPTP VPIGTDAKQR NAELDAFFAQ TKADIYTGTN ACYIQRTDRI QMPPFESFES VKSYYAVLAH ELTHWTKHPD RLDRDMGRKH YGDEGYAKEE LVAELGACFL AADLGFEPMP EVQHAAYIQS WLQALKDDKK LIFTAASHAQ KAVEYLLALT CT
|
| |