Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3085 |
Symbol | |
ID | 9157256 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 3198985 |
End bp | 3200184 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | putative RNA polymerase, sigma-24 subunit, ECF subfamily |
Protein accession | YP_003648016 |
Protein GI | 296140773 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.13548 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAACGCC GCGGCCTGCA CGGACTGATG GATGCCGGCG CGACGGTGCG GGCGGAGAAC GCCGCCCGCA CCTCGTACGG TCGACTGCTG GCCCTGCTCG CGGCCCGCAC CCACGACCTC GCCGCCGCGG AGGATGCGCT CGCCGACGCC TTCGAGCGCG CCCTCACGCG GTGGCCGGTC GACGGGGTGC CCGACGATCC CGACGCCTGG CTGCTCACCG TCGCACGCAA CCGCCAGCGC GACCACTGGA AGTCGGCGGC GCAGCGCAGC ACCGTCCCGC TCGATGACGT GCACGACGGC CCCGCTCTGC CGGACCAGCC CGACCAGCGC TTCGAGCTAC TCCTCGTGTG CGCCCACCCC GACGTCGCCG ACGCCGCCGT GCCGCTCATG CTCAACACCG TGCTCGGCTT CACCGCTGAA CAGATCGGCC GCGCATTCCT CATCCCCACC GCCACCATGG CTGCGCGACT GACCCGCGCC AAGAAGCGGA TCCAGCGCGA CCAGGTGCCC TTCACGATCC CCGAGGGCGC GGAGGTCGGC GCCAGGATCG ACGCGGTCCT CGAGGCCGTG TACGGCGCGT ACAGCCTTCA GTGGCCCACG CCGCCGCGCG AGAGACACGC TCTTCTGCTC GGCCTCGTCG AGACGATCAC CGATACCGCA CCCACCGTCG CCGAAGCCCA CGGTCTCGCG GCGACGCTGT ACCTGAGCAG CGCCCGGCTC CCCGCACGGC TCGCGGGCGA CGGTGGTTTC GTTCCACTGC CGCAACAGGA TCCGCGTCGT TGGGACCGAG ATCTCATCGC CCTCGGACAC CGTCACCTGC GCGCGGCGCA CGCACTGGGC ACCGTTGGAC GATTCCAGCT CGAGGCTGCT ATCGGCGCCG TGCACTGTGC CCGGGCACCC GGCGCGGCAC CGGATTGGCG CACGCTGCAC GGGCTGTACG GCTCGCTGCA GGCGATCGCA CCGACCCGCG GTGGGGCCGT CGCGCTCGCC GCGGTCACCG GTGAGCTCGA CGGTGCCGAG GCCGGCTTGG CCGCGCTCGA CGCGATACCG GATACCGAAC GGCTGCAATC GGCCTGGGCG TTGCGGGCCC ACCTCTTACG CCGCCTCGGT GACCCGCGCG CGGAATCCGC GTACGACAAG GCGATCTCAC TGACCACAGA TCCCGGCGAG CGGGCGTACC TCGCCGCGCG GCGCGGGTAG
|
Protein sequence | MERRGLHGLM DAGATVRAEN AARTSYGRLL ALLAARTHDL AAAEDALADA FERALTRWPV DGVPDDPDAW LLTVARNRQR DHWKSAAQRS TVPLDDVHDG PALPDQPDQR FELLLVCAHP DVADAAVPLM LNTVLGFTAE QIGRAFLIPT ATMAARLTRA KKRIQRDQVP FTIPEGAEVG ARIDAVLEAV YGAYSLQWPT PPRERHALLL GLVETITDTA PTVAEAHGLA ATLYLSSARL PARLAGDGGF VPLPQQDPRR WDRDLIALGH RHLRAAHALG TVGRFQLEAA IGAVHCARAP GAAPDWRTLH GLYGSLQAIA PTRGGAVALA AVTGELDGAE AGLAALDAIP DTERLQSAWA LRAHLLRRLG DPRAESAYDK AISLTTDPGE RAYLAARRG
|
| |