Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4642 |
Symbol | |
ID | 8728406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 5652264 |
End bp | 5654669 |
Gene Length | 2406 bp |
Protein Length | 801 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | protein of unknown function DUF214 |
Protein accession | YP_003389419 |
Protein GI | 284039489 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000162597 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCCGCA ACTACCTCAC TATCTCGCTC AGAAATCTAT GGCGTAACCG GAAAGTTACG CTGATCAGCA CCGTCGGACT TTCCATCGGG CTGGCTTGCG GCCTTGTTAT CTTTTTGCTG GTCAGCTACA TGTTCAGCTT CGACCGCTAC CATACCAAAG CTGACCGAAC CTACTGGGTT GTTACAGACA TTCGGCAGGA GAACGTTGTG CCAACGGATG CCACGCCCCG GCCGATGGGC GATGTGCTTC GGGAGGAGCT TCCATTTGTG GAAACGGCGG CCCGGCTCGA AAACAGCCCC AGTCGGGTCA TGGCGGTACC CGATGGGAAA GGCGGATTTT CGAAGAAATT TGACGAATCG CGTAGCCTTT GCTTCACCGA ACCGCAGTTC TTCAGCGTGT TCGATTCGGA CTGGTTAAGC GGCAATCCAG AAACCGCTCT GGCAGCCCCG AACACGGTTG TTCTGACGGA ACGGTACGCT CAAAAATACT TTGGTTCAGC CAATCCAATG GGTAAGGTGC TGCGCTTCGA TAACCAGACC GACCTGACCG TTACAGGACT TATCAAAAAC CTGCCGTCCA ACACCAAGCT CCGGTACGAC GCTTTCATTT CCTACGCAAC CGTGCCTACT CTTTCGGGCG GAGGAGGCCA ACAGGCCATG CAGGACTGGA GCCGGGTATT TACCGTATGT TTCGTCACCC TCCGCCAGGG CACGCCCGTA GAACGGCTGC TCGATGCCTT TCCGGTTATC CGGAAAAAGT ACCTGACCAC GCCCGAAGCG AAAAAACTGG ACTTTCATGC CATTCCACTC CCGGACCTGG AGCATATGCC TCAATACGGT GGCCGGTCGC CAGGGCTCAT TTTGTACACC CTGATTATTG TCGGGCTGTT TCTGGTGCTG GCGGCCTGTA TCAATTTCAT CAACATTGCC ACCGCTGGTG CCCTGAAACG CGCCAAAGAA GTGGGCGTTC GGAAAGCGGT GGGCAGTTCG CGAGGGCAAC TCATCGGGCA ATTTATGATT GAAACAACGC TGGTTACGCT GGCGGCTGTT GCACTGGCGA TGCTGCTGGC CCACCTCTGT TTGCCCATGC TGAACAGTGT CCTGTCTGTT ATGCACACCG ATATCTCCAT TACAAACCTG TTCCATCCCG ACTCGCTGGT CTGGTTTGTC GCGTTGCTTG TTGGCGTTAT TCTATTGGCT GGCTTGTATC CCTCGCTGGT GCTGGCCCGT TTTAATCCGG TAGCGGCCCT GCGCGGGCGA CTCAGCACGC AACAGATTGG CGGGGTATTC GTTCGGCAGG GGCTGATCGT CACGCAATTT TTTATCACCC AGCTCTTTAT CATTGGCGTG GGCGTCATGC TGGCGCAGGT ACGGCACATG CAGCAGGCCG ATCTGGGGTT TCAGAAAGAA GCGATTTTGA CGGTGCCGGT GCCCGTTAGC AATGCCCTTA AGCAGGATGT TGTTCGTGCC CGGATGGCAC AGATAGCGGG TGTCGAAGCC GTGTCATTAG GTGCCGACCC GCCCGCAACC TACCGGCGAC TGCCCGTGCC GTTCACCTAC GACACCCACA CGCAGCCGGA AAAATTCCCC ACGGTGGTTA AAGTTGGCGA CAAAAACTTT GTGTCGCTGT ACGGAATCAG GCTGCTGGCC GGGCGCAACT TCCGAACCAA CGATACGACG AACAACGAAG CCCTCGTGAA CGAAACAATG GTAAGAGAAT TGGGGCTGCG CTCGGCCCGC GATGTACTCG GCAAACGCGT TAACCTGTGG GGTGGCGACA AAACGATTGT GGGTGTCGTG CGCGATTTTC ACCTGAGCGA TTTACACCAG GGTATTCCGC CCGCCACCAT TCTGAACTAC TACCGCGAGA ACCGAATGGC CTCACTAAAG CTCAATCCAA CCGATATACC GACAACCCTG AAGGCAGTTG AAAGTACCTG GAATGAGTTA TTCCCGGAAC AGGTATTCAA AGCCAATTTC GTCGACGATC TGCTGGCTAA CTTTTACATC ACCGAACACG TCCTGCTGGG GCTGGCCGAA GTGTTTTCGC TCATTGCCGT TCTGCTCAGT TGCCTGGGTT TATATGGTCT GGTAACGTTT ATGGCCGAAG CGAAGACGAA AGAGATTGGT GTTCGGAAAG TGCTGGGGGC CACTCCTGCT CAACTTGTGT GGTTGTTTGG TCGCGAGTTC AGCCGACTCG TTCTGCTTGG CTTTGTACTG GCGGCTCCGC TGGGCTGGTT TCTGATGAAC GGCTGGCTAC AGGGGTATGC CTACCGCATT AATTTCAGCG GCTGGCTGCT GGCCGCAACC CTCGTTATAG CCAGCCTGAT TACGGCCCTG ACAGTTGGCT ACGAATCGCT GAAAGCCGCC CGCATGAACC CGGCAAAAAG CCTTCGAAAC GAGTGA
|
Protein sequence | MFRNYLTISL RNLWRNRKVT LISTVGLSIG LACGLVIFLL VSYMFSFDRY HTKADRTYWV VTDIRQENVV PTDATPRPMG DVLREELPFV ETAARLENSP SRVMAVPDGK GGFSKKFDES RSLCFTEPQF FSVFDSDWLS GNPETALAAP NTVVLTERYA QKYFGSANPM GKVLRFDNQT DLTVTGLIKN LPSNTKLRYD AFISYATVPT LSGGGGQQAM QDWSRVFTVC FVTLRQGTPV ERLLDAFPVI RKKYLTTPEA KKLDFHAIPL PDLEHMPQYG GRSPGLILYT LIIVGLFLVL AACINFINIA TAGALKRAKE VGVRKAVGSS RGQLIGQFMI ETTLVTLAAV ALAMLLAHLC LPMLNSVLSV MHTDISITNL FHPDSLVWFV ALLVGVILLA GLYPSLVLAR FNPVAALRGR LSTQQIGGVF VRQGLIVTQF FITQLFIIGV GVMLAQVRHM QQADLGFQKE AILTVPVPVS NALKQDVVRA RMAQIAGVEA VSLGADPPAT YRRLPVPFTY DTHTQPEKFP TVVKVGDKNF VSLYGIRLLA GRNFRTNDTT NNEALVNETM VRELGLRSAR DVLGKRVNLW GGDKTIVGVV RDFHLSDLHQ GIPPATILNY YRENRMASLK LNPTDIPTTL KAVESTWNEL FPEQVFKANF VDDLLANFYI TEHVLLGLAE VFSLIAVLLS CLGLYGLVTF MAEAKTKEIG VRKVLGATPA QLVWLFGREF SRLVLLGFVL AAPLGWFLMN GWLQGYAYRI NFSGWLLAAT LVIASLITAL TVGYESLKAA RMNPAKSLRN E
|
| |