Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_4436 |
Symbol | |
ID | 8885638 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 4729075 |
End bp | 4730265 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | Thiamin pyrophosphokinase catalytic region |
Protein accession | YP_003513174 |
Protein GI | 291301896 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.463172 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.30245 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCTAC CTACATTGCG CCGACACGAC CGGGCGAGCG ACTGGGCCAC CGGTGCCGTC GGCGGGTCCG CGAAGCTCGA CCGCCGCACC AAGCGGCTGG TGGGGCGGGT CAATTGCGGG GACATCGCCG TCGTCGACCA TATGGACATC GACCGGGTCG CCGCCGATTC GCTGGTGGCC GCCGGGGTCG GCGCCGTCAT CAACGCCAAG CCCTCGATCT CGGGCCGGTA TCCGAACCTG GGGCCCGAGG TGCTGGTGTC GGCGGGGATC GTGCTGATCG ACGAGGTCGG CGACGAGATC TTCGACCGCA TCGACGACGG CGACAAGCTG CGCATCGAGG GCGGCAAGGT GTTCCGGGGC GAGGAACTGC TGGCCGAGGG GCGACGCCTG GACGCCGCCG CGGTCGCCGC CGCCATGGCC GAGGCCCGGC AGGGGCTGGC GGTGCAGCTG GAGGCGTTCG CCGCCAACAC GATGGAGTAC CTGAACAAGG AACGCGACCT GCTGCTGGAC GGCGTCGGTG TGCCCGAAGT GGACACCGTC TTCGCCGACC GGCACTGTCT CATCGTGGTG CGCGGCTACG ACTACAAGGC CGACCTCGAC GTGCTGCGGC CCTACATCCG GGAGTTCAAG CCGGTGCTGG TGGGTGTGGA CGGTGGCGCC GACGCCCTGG TCGAGGCCGG TTACACCCCG GATCTGATCG TCGGCGACAT GGACTCGGCC TCCGACGAGG TGCTGCGCTG CGGCGCCGAA CTGGTCGTGC ACGCCTACAG CGACGGGCAC GCGCCCGGCA TGACGCGGCT GGAGCACCTG GGGGTGGCGG CGAAGGTGTT CCCCGCCTCG GCCACCAGTG AGGACATCGC GATGCTGCTG GCCGACGAGA AGGGCGCCAG CATGATCGTC GCGGTCGGCA CCCACGCCAC GCTGGTGGAG TTCCTGGACA AGGGCCGCTC CGGCATGGCC TCGACGTTCC TGACCCGGCT GCGGGTGGGC GCGAAACTCG TCGACGCCAA GGGAGTCAGC CGCCTGTACC GGCAGCGGGT TCCCGCCTCG GGTTCGTTCC TGCTGGCGTG TTCGGCGATC GCCGGGGTGG CCGCGGCCGT GGCCCTGTTC ACCGCCGGTA TGAGTTTCGC CGAGACCGGC GCCGACTGGT GGGACGGCAT GGTTTCCGCG ATTCAAAGGA TCTTCTCGTG A
|
Protein sequence | MRLPTLRRHD RASDWATGAV GGSAKLDRRT KRLVGRVNCG DIAVVDHMDI DRVAADSLVA AGVGAVINAK PSISGRYPNL GPEVLVSAGI VLIDEVGDEI FDRIDDGDKL RIEGGKVFRG EELLAEGRRL DAAAVAAAMA EARQGLAVQL EAFAANTMEY LNKERDLLLD GVGVPEVDTV FADRHCLIVV RGYDYKADLD VLRPYIREFK PVLVGVDGGA DALVEAGYTP DLIVGDMDSA SDEVLRCGAE LVVHAYSDGH APGMTRLEHL GVAAKVFPAS ATSEDIAMLL ADEKGASMIV AVGTHATLVE FLDKGRSGMA STFLTRLRVG AKLVDAKGVS RLYRQRVPAS GSFLLACSAI AGVAAAVALF TAGMSFAETG ADWWDGMVSA IQRIFS
|
| |