Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_1877 |
Symbol | |
ID | 8725614 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 2267066 |
End bp | 2268391 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | |
Product | Curlin associated repeat protein |
Protein accession | YP_003386721 |
Protein GI | 284036791 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.246502 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAG CTGTATTATC CGGTATTGCA CTGATGTGGT TTGCCGCTGC CGCGTACGCT CAAACGAATA CCGCAAACCT CGACCAAAAT GGAATATCAC AGTCCGGTAC CATCAACCAA ACGGGGCAGG AGCAGACCGC CACGATTCTC CAGAAAGGCA CGGGGAATTT AAACAACATC GGGGTGATTC TGCAATCATC GAAGGTCGCT CAGACCGGTA TCATTAGCCA GATGGATGGT GCCAAAAGCA ATAGTGCCAA TATAAATCAG ACTGACCTGA TTCCAACCTC AAGCAGCACG GATCTCAACA CAGCCAGCAT TCAGCAGATT ACGGGCGCTA TCGGCGGTAG TGACTTAAAT ATTGCTCAGA TTACACAGAG TGGCCGGGGG GCTGCGGCCA GTATTGAGCA GAAAAATTAC TCGACTAGTA ACCAGGCCAG TATAGTCCAG GCGCTGGCCG GTGCCGGTAC AGCTACCATT CATCAGAATG GTAGTGCGTC GCAGAATACA TTCTCAGTAG GGCAGCGAGC AACCATCAGC CAGATGGGCG GAAACCACAC CGCTTCTATT GACCAAAGTG CTTACAGCGC AAACAACCAG TCGTACATAA CCCAAACAGG TGGCGGTGCT ACAAATGGCA ACACGGCGAT TACCCAGCAG AATAATACCT CGGGGCAAAA TATAATCACG ATGAATCAGT CGGGGTCTCT ACTAAACGCA ACAGTCGAGC AGTCATTTGA AAGTTACAAC AACAATGTAA AGCTTACGCA GTCGGGTTCG CAGAATAACG CGACTATTAC GGAGACGCTG GTATCGGATG GAAACTCAGT TGAATCGTCG CAGTCCTTTG TTTTCAATAA TTTATACGTG GAGCAGAAAA ATAGTTCTAT CAATAACAAA GTATATACGA ATCAGGGCGG ATTTATAGGT ACGATAACCG TTCGGCAAAA TGGAGCCAGC GAAAGTGTGG CAACCGTTAA TCAGAACACA AACTTTATTG GGTTATTTAA CCTGACTGAT CTAATGCAAA CGGGCCACAA CCAGGTAGTG ACAGTTTCTC AGGATAATAG TTCGAATACG ATTCGTTTTA AGCAGGAAGA TGGTGGCGGA ACCAAGGGAG GGAACGTGGC CACACTGACT CAGTTGGGTG ACACTAACGT AATTCAGGGT ATTGATGGTA CGGGTGGGTT TGGCCTGCAA AGCGGAACGA CAAACACGTT GACTATCTCC CAGAATTCAT CCGGTACTAT TTTGTCAAAC ATAGCCAATG TAAGTCAGGT TGGTACCGGC AACATTGGGT CGATCACCCA AGTAAGAGGT AACTAA
|
Protein sequence | MKKAVLSGIA LMWFAAAAYA QTNTANLDQN GISQSGTINQ TGQEQTATIL QKGTGNLNNI GVILQSSKVA QTGIISQMDG AKSNSANINQ TDLIPTSSST DLNTASIQQI TGAIGGSDLN IAQITQSGRG AAASIEQKNY STSNQASIVQ ALAGAGTATI HQNGSASQNT FSVGQRATIS QMGGNHTASI DQSAYSANNQ SYITQTGGGA TNGNTAITQQ NNTSGQNIIT MNQSGSLLNA TVEQSFESYN NNVKLTQSGS QNNATITETL VSDGNSVESS QSFVFNNLYV EQKNSSINNK VYTNQGGFIG TITVRQNGAS ESVATVNQNT NFIGLFNLTD LMQTGHNQVV TVSQDNSSNT IRFKQEDGGG TKGGNVATLT QLGDTNVIQG IDGTGGFGLQ SGTTNTLTIS QNSSGTILSN IANVSQVGTG NIGSITQVRG N
|
| |