Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_1834 |
Symbol | |
ID | 9155984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 1917851 |
End bp | 1918843 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | RNA polymerase, sigma 70 subunit, RpoD subfamily |
Protein accession | YP_003646791 |
Protein GI | 296139548 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.918452 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACACC GCTTCAGCAC CACAGCCCCC GCAGCCGAGC GTGTCCGGCC CGAGTTGACG GAGGCCGATC TCGACGCTCA GAGCCCCGCA GCAGATCTGG TGCGGGTGTA CCTGAACGGC ATCGGCCGCA CCGCCCTGCT CACCGCGGAG GACGAGGTCG AATTGGCCAA GCGCATCGAG GCGGGGCTCT ACGCCCAGCA CCTGCTCGAT ACCAAGAAGC GGCTGTCGGC GTCGAAGAAG CGTGATCTGG CGTTCATCGT GCGCGACGGC CAGGCGGCCC GGCAGCACCT GCTGGAAGCC AACCTGCGCC TGGTGGTGTC GTTGGCCAAG CGCTACACCG GCCGCGGCAT GCCGCTGCTG GACCTGATCC AGGAGGGCAA CCTCGGACTC ATCCGCGCGA TGGAGAAGTT CGACTACGCC AAGGGATTCA AGTTCTCGAC CTACGCCACC TGGTGGATCC GGCAGGCCAT CACCCGCGGT ATGGCCGACC AGTCGCGCAC CATCCGGCTC CCCGTCCATC TCGTCGAGCA GGTCAACAAG CTGGCCCGCA TCCGCCGTGA GCTGCATCAG CAGCTGGGCC GCGAAGCCAC GGACGCCGAA CTGGCCGCCG AGTCCGGCAT TCCGGCGGAG AAGATCGCCG ACCTGATGGA CCACTCGCGC GACCCGGTGA GCCTGGACAT GCCCGTCGGC TCGGACGAAG AGGCCCCGCT GGGCGACTTC ATCGAGGATG CAGAGGCCGC GTCGGCCGAG TCCGCGGTGA TCTCCACGCT CATGCACAGC GACGTCCGGT CGGTGCTCGC CACCCTGGAC GAGCGGGAGC AGCAGGTGAT CCGGCTGCGG TACGGTCTCG ACGACGGTCA GCCGCGCACC CTCGACCAGA TCGGCAAGCT GTTCGGCCTG TCCCGTGAGC GGGTGCGCCA GATCGAGCGC GAGGTCATGA GCAAGCTCCG CAACGGCGAG CGCGCCGACC GGTTGCGCGC CTACGCCAGC TAA
|
Protein sequence | MAHRFSTTAP AAERVRPELT EADLDAQSPA ADLVRVYLNG IGRTALLTAE DEVELAKRIE AGLYAQHLLD TKKRLSASKK RDLAFIVRDG QAARQHLLEA NLRLVVSLAK RYTGRGMPLL DLIQEGNLGL IRAMEKFDYA KGFKFSTYAT WWIRQAITRG MADQSRTIRL PVHLVEQVNK LARIRRELHQ QLGREATDAE LAAESGIPAE KIADLMDHSR DPVSLDMPVG SDEEAPLGDF IEDAEAASAE SAVISTLMHS DVRSVLATLD EREQQVIRLR YGLDDGQPRT LDQIGKLFGL SRERVRQIER EVMSKLRNGE RADRLRAYAS
|
| |