Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_1753 |
Symbol | |
ID | 8882943 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 1836517 |
End bp | 1837737 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | putative RNA polymerase sigma-24 subunit, ECF subfamily |
Protein accession | YP_003510544 |
Protein GI | 291299266 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.620196 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.802735 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCACGC CGCCCGTGGA CGTCGCGGGC GTGTTCCAAC GCGAGCACGC CCGCGCCGTT TCCGTCCTCA TCGGAGTCTT CGGCGACATC GACATCGCCG AGGAGGCCGT CGCCGACGCC TTCACCGAGG CGGTGCGGCG CTGGCCCGAC ACGGGCCTGC CGCCCAGCCC GGCCGGGTGG ATCATCACCA CCGCCCGCAA CCGGGCCATC GACCGGCTGC GACGCGAATC GGTCCGGGAC GAGAAGCACG CGCAGGCGGC GCTCGTCCAC TCCCCCGACA CCCCACCCCC GGAGGGCCCC GTGCGCGACG ACCGGCTGCG CCTCATCTTC ACCTGCTGCC ATCCGGCGCT GGCCCCGGCG GCGCGGATCG CGTTGACGCT GCGGTTGCTG GGCGGACTGT CCACCGTCGA CATCGCCCGT GCCTTCCTGG TGTCGGAGGC GACGATGGCC CAGCGGCTGG TGCGCGCCAA GGGAAAGATC CGCGACGCCC GCATCCCGTA CCGGATTCCC GACGACGCCG ACCTGCCCGA CCGGCTGCGG TCGGTGCTGA CCGTGGTGTA CCTGATCTTC AACGAGGGTT ACGCCGCCGC GACCGGCACC GACCTGACCC GCGACGACCT GTGCGTCGAG GCGATCCGGC TGGGGCGGCT GCTGGTGGAG CTCATGCCCG ACGAACCCGA GGCGGTGGGG CTGCTGGCGT TGATGCTGCT GTCGCGGTCG CGGCGGGCCG CGCGCACCGG CCCGGACGGG TCACTGGTAC CGCTGTCCGA ACAGGACCGG TCGCTGTGGG ACCGCGAGCT GGTCGCCGAG GGGCAGGGGC TGGTCCGGTT GTGCCTGCGG CGGGATCGGC CGGGTCCGTT CCAGATCCAG GCCGCCATCA ACGCCGTCCA CAGTGATGCG TCCAGTGTCG CCGACACCGA CTGGCGGCAG ATCCTGACCC TGTACGACCA GTTGACGGTG CACGCGCCCA GCCCGGTGGT CGCGCTCAAC CGGGCCGTCG CGCTGGCCGA AGTGGCCGGA GCCGAGGCGG CACTGTCCGA AGTCGAGCGG CTGGCGCTGG AGAAGCACCA CCTGTACCAC GCGATCCGCG CTGACCTGTT GCGGCGCTTG GGAAGACGGG ACGAGGCCCG CGCCGCCTAC GACGCGGCGA TCGCCCGCTC GGCCAACGCG ACCGAGCGGG CGTATCTGAG TTCGCGACGT GGCGAAATCA GCGCCGGGTG A
|
Protein sequence | MTTPPVDVAG VFQREHARAV SVLIGVFGDI DIAEEAVADA FTEAVRRWPD TGLPPSPAGW IITTARNRAI DRLRRESVRD EKHAQAALVH SPDTPPPEGP VRDDRLRLIF TCCHPALAPA ARIALTLRLL GGLSTVDIAR AFLVSEATMA QRLVRAKGKI RDARIPYRIP DDADLPDRLR SVLTVVYLIF NEGYAAATGT DLTRDDLCVE AIRLGRLLVE LMPDEPEAVG LLALMLLSRS RRAARTGPDG SLVPLSEQDR SLWDRELVAE GQGLVRLCLR RDRPGPFQIQ AAINAVHSDA SSVADTDWRQ ILTLYDQLTV HAPSPVVALN RAVALAEVAG AEAALSEVER LALEKHHLYH AIRADLLRRL GRRDEARAAY DAAIARSANA TERAYLSSRR GEISAG
|
| |