Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_5020 |
Symbol | |
ID | 8728785 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 6117397 |
End bp | 6118653 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | |
Product | protein of unknown function DUF214 |
Protein accession | YP_003389796 |
Protein GI | 284039866 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCTGT TCCGAATTAG TTGGAGTAAC CTGAAAGACA AACCCCTGAG CAGTTTTTTA AGCGGACTGC TCATGACGTT CGGTATTACG ATTATTTCGT TGCTGCTGTT GCTCAATAAG CAATTGGATG ATCAGTTTCG CAAAAACATA AAAGGGATCG ACATGGTGCT TGGTGCTAAA GGAAGCCCGC TGCAGCTGAT TCTGGCGAGC ATCTACCAGA TTGATTCGCC AACGGGTAAT ATTCCACTCG ACGAAGCCGA ACGCCTGACC CGCAACCCGA TGATCAAAAC GGCGATTCCC CTGTCGATGG GCGATAATTA CCAGTCATTT CGGATTGTGG GTACCAACAA AAAATACCTC GATCATTTTG GTGCTACCGT TGCTCAGGGG AAGCTTTTTG ACAAAGCCCT GGAAACCGTA ATTGGTCCAC GGGTGGCGGC CGTTACCGGC TTAAAACTTG GCGATACGTT CTCTGGCTCA CACGGCCTCG ACAAAGACGG CGACGTACAC GCCGATACCA AATATAAGGT GGTTGGTATC CTGAGCCCCA CCAATACCGT TGCCGACCAG CTGATTGTTA CGCCATTGTC CAGCGTTTGG GCCATTCATG AGCACCATGA GGAACACGAA GAGGGACACC ATGACGAAGA AACGCAGCCA GCCGGACCAA CGCTTGGCGG CCCGGCGCCC GATGTAGCAG AAGAACCCGG AGAGCCCCGG GAGATTACCA GTATGCTTAT CAAGTTTCGG AACCCGTTGG GGATGATGCT GGCGCGGGGC ATTAACAGCA ACTCCAAACT TCAGGCGGCA TTACCCAATA TTGAGATAAA TCGCCTGTTT TCATTGCTTG GTGTGGGTGT TGAAACACTG CGGGGCCTGG CTATCGTCAT CATGCTGATT TCCGGCATCA GCGTGTTTGT TTCGTTGTAT AACTCATTGA AGGAACGACG CTATGAAATG GCTCTGATGC TGTCGATGGG CGCTACCCGT GCACAGCTTT TCGGTATGTT GCTCCTCGAA GGACTGGTGC TGGCACTGAT CGGCTTCATC CTCGGCATAC TTCTCAGTCG CGTTGGCTTG TGGTTATTTT CCAGCAGTGT ATCGTCGGAA TACCATTATA ATCTGGCCGC ATTCGGTATT CTGCCCGAAG AGTGGGTTTT GCTCGGCGTT GCCATTCTGA TTGGCCTGCT GGCCGCTGCC CTACCCGCTC TGGGCGTCTA CCGCATGAAC ATCTCCAGAA CGCTGGCTGA AGAATAA
|
Protein sequence | MNLFRISWSN LKDKPLSSFL SGLLMTFGIT IISLLLLLNK QLDDQFRKNI KGIDMVLGAK GSPLQLILAS IYQIDSPTGN IPLDEAERLT RNPMIKTAIP LSMGDNYQSF RIVGTNKKYL DHFGATVAQG KLFDKALETV IGPRVAAVTG LKLGDTFSGS HGLDKDGDVH ADTKYKVVGI LSPTNTVADQ LIVTPLSSVW AIHEHHEEHE EGHHDEETQP AGPTLGGPAP DVAEEPGEPR EITSMLIKFR NPLGMMLARG INSNSKLQAA LPNIEINRLF SLLGVGVETL RGLAIVIMLI SGISVFVSLY NSLKERRYEM ALMLSMGATR AQLFGMLLLE GLVLALIGFI LGILLSRVGL WLFSSSVSSE YHYNLAAFGI LPEEWVLLGV AILIGLLAAA LPALGVYRMN ISRTLAEE
|
| |