Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4029 |
Symbol | |
ID | 8727787 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 4837876 |
End bp | 4839303 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003388818 |
Protein GI | 284038888 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.500538 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.302353 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCGTC GTACGTTTAT CCAACAATCT GCCGGGGCAA GTGCTGGCCT GATGGCTGGT CAGTCACTTT GGGCCGCACC AGCCGCCGAC TTTCCCGTTG TTCGGACAGC AGCCGCTCAG CGCAATTTCA CCAGCTCGGC TGTAGAGCAG ACCATCCAGC GAATGCATAA AACTATTCGC GACCCGGAAC TGGCCTGGCT TTTCGAGAAC TGCTTTCCCA ATACCCTCGA TACCACCGTG CAGGTGAGTA CCAGCAATGG CCAACCCGAT ACGTTTGTCA TTACCGGCGA CATCGACGCT ATGTGGCTGC GCGACAGTAC GGCGCAGGTA TGGCCCTATT TGCCGCTCAT TAAGCAGGAT AAACCCTTGC AGCAACTGAT TGCGGGGGTT ATCCGTCGGC AATCCCTGTG CATTCGGCGC GACCCCTACG CCAACGCCTT TTATGCCGAT GCCAGCAAAG AGGGCGAATG GAAGAAAGAC GTGACAGCCA TGAAACCCGG TCTGCACGAA CGAAAGTGGG AACTCGATTC GCTTTGCTAC GCCATCCGGC TGGGTTATCA CTACTGGAAA ACGACCGGCG ATACCAGCCC ATTCGATGCC GACTGGCTAC AGGCCATGCA ACTCGTTTTG CAGACCTGCC GGGAGCAGCA ACGTAAAACC AGCAGAGGGC CTTATAAATT CAGCCGCGAA ACGTCCTGGT CGACAGATAC CGTGCCGGGC GATGGCTACG GGAATCCAAC GCGTCCGATT GGGTTGATAA ACAGCATCTT CCGCCCGTCT GACGATGCAA CGGTTTTTCC GTTTTATGTA CCCTCGAACT GGTTTGCAGT GGTGTCGCTT CGGCAACTGG CTACGATGGT CGATCAGATT CGGCCCACGC CTGCACTGGC TGCCGGTTGC CGGGCTTTGG CCGATGAAGT GGAACGGGCG CTGAAACAGT ACGCCATTTA TACGCACCCC AAATACGGGA AGATGTACGC CATGGAAGTA GATGGGTACG GAAATCACCT GCTTCAGGAC GACGCCAACG TGCCCAACTT ACTGGCTTTG CCGTATCTGG GTGCCATGCC CGCCAGCGAT CCGATCTATA AAAATACCCG TCGCTTTGTG CTCAGTCCGG ACAACCCGTA TTTCTTCAAA GGAAAAGCGG CCGAGGGTGT CGGCAGTCCG CACACGCTGG TCAACAACAT CTGGACTATG AGCCTGACCA TGCGCGCACT CACTTCCACC GACGATCAGG AGATTCTGGC GCAGCTTCGG CTGCTGAAGA AAACCCATGC AGGCACGGGC TTCATGCACG AATCTTTCAA CCAGGACGAC CCCGCTAAAT TCACCCGAAA GTGGTTTGCC TGGGCCAATA CCCTCTTCGG CGAACTGATC CTGAAAGTGG CTAACGAACG TCCGCAGCTG TTGGATAAAG TCCTGTGA
|
Protein sequence | MNRRTFIQQS AGASAGLMAG QSLWAAPAAD FPVVRTAAAQ RNFTSSAVEQ TIQRMHKTIR DPELAWLFEN CFPNTLDTTV QVSTSNGQPD TFVITGDIDA MWLRDSTAQV WPYLPLIKQD KPLQQLIAGV IRRQSLCIRR DPYANAFYAD ASKEGEWKKD VTAMKPGLHE RKWELDSLCY AIRLGYHYWK TTGDTSPFDA DWLQAMQLVL QTCREQQRKT SRGPYKFSRE TSWSTDTVPG DGYGNPTRPI GLINSIFRPS DDATVFPFYV PSNWFAVVSL RQLATMVDQI RPTPALAAGC RALADEVERA LKQYAIYTHP KYGKMYAMEV DGYGNHLLQD DANVPNLLAL PYLGAMPASD PIYKNTRRFV LSPDNPYFFK GKAAEGVGSP HTLVNNIWTM SLTMRALTST DDQEILAQLR LLKKTHAGTG FMHESFNQDD PAKFTRKWFA WANTLFGELI LKVANERPQL LDKVL
|
| |