Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_0999 |
Symbol | |
ID | 8724729 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 1214550 |
End bp | 1215758 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | |
Product | protein of unknown function DUF214 |
Protein accession | YP_003385849 |
Protein GI | 284035919 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00052376 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.231875 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCTCC CCGCCTGGAT TGCCCGTCGT TACTTCTTCT CCCGTAAAAA ACGCAGTTTT ATCAGCTGGC TGTCTATTTT ATCAATGCTC GGTGTGGGTG TTGGTACGAT GGCGCTAGTG GTGGTACTGT CTGTGTTCAA TGGGATGGAG GAGTTGAACA AACAGATATT CAGGACGTTT GAGGCCGATA TGACCATTGC GCCGAAACAG GGGAAGCGAT TTTTAGCAAC ACCCGCCCTG AAAAAAAGAC TGCAAAAGAT ACCGGGTGTA AATCTGCTGA CGACCGTTGC TCAGGATAAT GCGCTGGCCC GCTACGGCAA TGCGCAGACC GTTGTTCGGC TGAAGGGCGT CGATAATAAT TATCTTCAAC GTCAGCAACT CGACTCAGCT TTACTGGAAG GGAAATTACT GCTTCAAAGG CAGGGCGTAA ATTATGCTAT TGTAGCCGAT GGGGTTCGTA GTGACCTCAG CGTATCGCCA ATTGATATTC TGACTCCCCT CGAAATCCTG TATCCACAAA GCGGACAGTC GTTTAGCGTG CTCAATCCCG ATGCGTTTAA CCGCGAAGCG TTCACAGTTT CGGGCGTGTT TTTCATTGAG TCGAAGTACG ACAACTTTGT GCTGGCCCCC ATTACCTCCG CACAAACCCT GTTTGGCTAC CAGCCCGACG AAGTAACGAG TCTGGAAATC CAGCTCCGGC CCGGAACCAA CGAAGTCGAC GTAAAACAAG CGCTTCAGGA TGCGGTTGGT GATAAGCTGA TTGTACAAAG TCGGGATGAT CTAAACGTTG ATCTGTACAG GGCGATACGC GTCGAAAAGT TGTTTACCGC CCTGACGCTC GGATTCATCA TTCTGGTCGC ATCCATAAAT ATATTCTTCT CGCTTTCCAT GCTGGTCATC GAGAAGAAGG CAGATATCCG GATTCTGTAT GCGCTGGGTG CTACCCGGCC TATGGTACGT CGAATTTTTC TGACAGAAGG TGCCATTATT GCCCTGACCG GGGCCTTTGC CGGTCTGATA CTGGGCATTG GTATCTGCCT GGCGCAGGAA CGTTACGGCT TTATTCGTAT GGGTACCGAA AGCTCAATCA TCGACGCGTA TCCTGTACGC CTTGACACCA GCGATATTCT GCTGACAGGC GTGTTGGCCA TTGTAATGAC CATTCTGACT TCCTGGTTCC CAGCTCAGCG AGCCGCTAAC GTTCGTTGA
|
Protein sequence | MNLPAWIARR YFFSRKKRSF ISWLSILSML GVGVGTMALV VVLSVFNGME ELNKQIFRTF EADMTIAPKQ GKRFLATPAL KKRLQKIPGV NLLTTVAQDN ALARYGNAQT VVRLKGVDNN YLQRQQLDSA LLEGKLLLQR QGVNYAIVAD GVRSDLSVSP IDILTPLEIL YPQSGQSFSV LNPDAFNREA FTVSGVFFIE SKYDNFVLAP ITSAQTLFGY QPDEVTSLEI QLRPGTNEVD VKQALQDAVG DKLIVQSRDD LNVDLYRAIR VEKLFTALTL GFIILVASIN IFFSLSMLVI EKKADIRILY ALGATRPMVR RIFLTEGAII ALTGAFAGLI LGIGICLAQE RYGFIRMGTE SSIIDAYPVR LDTSDILLTG VLAIVMTILT SWFPAQRAAN VR
|
| |