Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_1039 |
Symbol | |
ID | 8882224 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 1099468 |
End bp | 1100475 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | transcriptional regulator, LacI family |
Protein accession | YP_003509842 |
Protein GI | 291298564 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.763087 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGATCACA CCCGCCCCAC CACTCTTGCC GACGTCGCCA AACGTGCCGG AGTCTCCGTG GCGACGGCCT CACGCGTGCT CAACGGCTCG CCGCACAAGG TCTCCGCCGG TCTGGCCGCC AAGGTGCGCG CCATCGCCGA CGAACTGCGC TACGCCCCCA ACGCGCAGGC ACAGGGCTTG GCGCGCAGCA CTTCCCGCGT GGTCGCGTTG CTGCTGCACG ACATCACCGA CCCGTATTTC GGGCAGATCG CGCAGGGCGT CCTTGCCGAG GCCGCCGAAC GCGACGTGTG CGTCGTCATC GCCGAGACCG GCATCGACCC CGACAACGAA CGCGACCAGC TGGCCTCGCT GGGAAGCCTG CGGCCCCGAG CGGCCATCAT GGTCGGCTCC CGCACCACCG AGACCGACGC CGAGCAGCGG CTGTCGCGGG CCCTGGAGGA CTTGCGCGCC ATCGGAACCG CCGTGGTCAA CGTCGGACAG CCGTCGCTGC CGGGCTCGTG CGTGCACCCG CTCAACCGCG AGGGCGCGAA GGAACTGGCC ACCCACCTGG CCGGGCTGGG ACACCGGCGG TTCGCGCTGG TCACCGGACC GGCCAGTCTG CGGGTGGTCG CCGAACGCCG CGAGGGTTTC GTCGACGGCC TGCCGGTCGA GGCCGGGGTG GAGATCATCG ACGCCCCGTT CAGCCGCGAC GGCGGCCACG ACGCCGGGAT GCGGTTCGCG GCCATGTCGA ACCGGGCCAG CGCCGTCTTC GTCACCAGTG ACGTGATGGC CTCGGGCTTC TACGCGGCGC TGCGGGAGTC CGGGCTGTCC ATCCCCGAGG ACGTCTCGGT GGCGGGCTTC GACGACGTCC CGGTGGCCGC CGACCTGTAC CCGGCGCTGA CCACGGTGCG GCTGCCGCTG TCGGGCATGG GCGCGCAGGC GCTGCGGCTG GCGCTGGACG GCGAGGAGGA GCAGACCGTC ACGATCGAAC CCGAGCTGAT CCCCCGGGCC AGCACCGCCC GGGCGTGA
|
Protein sequence | MDHTRPTTLA DVAKRAGVSV ATASRVLNGS PHKVSAGLAA KVRAIADELR YAPNAQAQGL ARSTSRVVAL LLHDITDPYF GQIAQGVLAE AAERDVCVVI AETGIDPDNE RDQLASLGSL RPRAAIMVGS RTTETDAEQR LSRALEDLRA IGTAVVNVGQ PSLPGSCVHP LNREGAKELA THLAGLGHRR FALVTGPASL RVVAERREGF VDGLPVEAGV EIIDAPFSRD GGHDAGMRFA AMSNRASAVF VTSDVMASGF YAALRESGLS IPEDVSVAGF DDVPVAADLY PALTTVRLPL SGMGAQALRL ALDGEEEQTV TIEPELIPRA STARA
|
| |