Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1905 |
Symbol | |
ID | 5708114 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2198518 |
End bp | 2199696 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641271409 |
Product | thiamin pyrophosphokinase catalytic region |
Protein accession | YP_001536781 |
Protein GI | 159037528 |
COG category | [S] Function unknown |
COG ID | [COG4825] Uncharacterized membrane-anchored protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.028315 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000109724 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGTCTAC CTACGTTGCG CTGGACCCGA CCCGCGGAGC CGGGCCGGGT CGCCGGCACC GCCCGCCTGG ATCGTCGGAC CAAGCGCCTG GTTGGCCGGC TCCGGCCTGG GGACATCGCC GTGATCGACC ATGTCGACCT GGACCGGGTG GCCGCCGATT CGCTTGTCGC GGTCGGTGTT GGGGCCGTGC TCAACGCGAA GCCGTCGGTT TCGGGTCGTT ATCCCAATCT CGGCCCGGAG GTGCTCATCG AGGCTGGTAT CCCGCTCCTG GACGACTTGG GGGAGAGTGT CTTCGAGCGG ATCCAGGAGG GCGACACTGT CCGGATCGAG GGCAACACCG TCAATCTCGG CGAGGAGCCG GTGGCCCACG GTGTCTTGCA GGATACGGAG ACCGTCGGGA AGGCGATGGC TGATGCCCGG GAGGGGCTGT CGGTCCAGTT GGAGGCCTTC GCCGCGAACA CCATGGTCTA CCTGAAGCAG GAGCGGGACC TGCTGCTGGA CGGTGTGGGC GTTCCGGACA TCCGTACCGA GATTCAGGGG CGGCACTGCC TGATCGTGGT GCGTGGCTAC GACTACAAGG CTGACCTGGA TGTGCTGCGC CCGTACATTC GGGAGTTCAA GCCGGTCCTC ATCGGTGTCG ACGGTGGGGC GGACGCCCTG GTCGAGGCCG GCTACCCACC CGACCTGATC ATCGGTGACA TGGACTCGGT GACCGACGAC GTGCTGCGCT GCGGTGCCGA GGTCGTGGTA CACGCCTACC CGGACGGGCG GGCGCCGGGG CTGGCGCGGG TCAATGGTCT CGGGGTTCCG GCGGTCACCT TCCCCGCCGC GGCCACCAGC GAGGATCTGG CGATGCTGCT CGCCGATGAG AAGGGCGCGT CGTTGCTGGT GGCGGTCGGC ACGCACGCCA CGCTCGTCGA GTTCCTGGAC AAGGGGCGGG GTGGGATGGC ATCGACCTTC CTCACCAGGC TGAAGGTCGG TGGGAAGCTG GTCGACGCCA AGGGCGTCAG CCGGCTCTAC CGGCAGAGCA TCTCCGGATC CTCACTGCTG CTGCTGGTGC TCTCGGCGAT TGCCGCGATG GCCTCGGCCG TGGCGGTCTC CACCGTCGGG AAGGCGTACC TGGGCGTGGC CTCCGAGTGG TGGAACAATT TCGTGTTCCA GCTGGAGCGG CTCTTCTGA
|
Protein sequence | MRLPTLRWTR PAEPGRVAGT ARLDRRTKRL VGRLRPGDIA VIDHVDLDRV AADSLVAVGV GAVLNAKPSV SGRYPNLGPE VLIEAGIPLL DDLGESVFER IQEGDTVRIE GNTVNLGEEP VAHGVLQDTE TVGKAMADAR EGLSVQLEAF AANTMVYLKQ ERDLLLDGVG VPDIRTEIQG RHCLIVVRGY DYKADLDVLR PYIREFKPVL IGVDGGADAL VEAGYPPDLI IGDMDSVTDD VLRCGAEVVV HAYPDGRAPG LARVNGLGVP AVTFPAAATS EDLAMLLADE KGASLLVAVG THATLVEFLD KGRGGMASTF LTRLKVGGKL VDAKGVSRLY RQSISGSSLL LLVLSAIAAM ASAVAVSTVG KAYLGVASEW WNNFVFQLER LF
|
| |