Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4453 |
Symbol | |
ID | 4070936 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 5285124 |
End bp | 5286371 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637986492 |
Product | RNA polymerase ECF-subfamily sigma factor |
Protein accession | YP_593527 |
Protein GI | 94971479 |
COG category | [K] Transcription |
COG ID | [COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCTCA GGGATGTCCC CACAGCAGTC GAAGAAGTGT ATCGCTCCGA GTGGGGGCGT GTCGTCGCGA CCCTCATTGG AATGCTCGGC GGAGACTTTG ATTTGGCCGA GGAAACCGCG CAGGAGGCCT TTGCCGCCGC CGTAACCCAG TGGGAGAAAG ACGGCGTTCC TGAGTACCCA CGGGCTTGGA TCATCTCTAC CGCGCGCCAC AAGGCCATCG ACCGGATCCG CCGCAAGGCT AAATTTGAAG AGGAACTCGA GCCGCGCCTC GAACAGGGAA CTCTCGACGT CGCGACCCCC CCACAGGAGT ACACCGCCGA GATTCCTGAC GACCGACTGC GCCTGATCTT CACTTGCTGC CACCCGGCGC TGGCCTTGGA CGCGCAGATC GCACTTACAT TGCGCACGCT TTGCGGACTC GAGACCGAAG AGATCGCGCG CGCCTTTCTC GTGCCAGTGC CGACGATGGC GCAACGAGTG GTCCGCGCCA AGAGCAAGAT TCGTGACGCC GGCATTCCGT ATGCCGTCCC CGAGACGAGC CAGATGGCGG AGCGCCTCGA TGCGGTGTTG CACGTGATTT ACCTGGTCTT CAACGAGGGC TACTCAGCGT CGTCCGGCGA GTCGCTCACG CGCGCCGATC TTTCTGAAGA AGCGATTCGG TTGGCGCGCA TCGTCGTAGA GTTGCTGCCC GATCCCGAAG CGCTGGGCTT GCTTTCGCTG ATGCTGTTGC ATGAATCGCG CCGCGCCGCA CGCACCTCCG AAGACGGTGA CATGATCCTG CTCAACGATC AGGACCGCAC GCTGTGGGAC CGCGCACTGA TCGCCGAAGG TACTGCGTTA GTCGAGCGTT CCTTCGCGCT GCGGCGTCCC GGGCCTTACT CGATTCAAGC TGCCATCGCC GCGGTCCATG CCGACTCTCC AACTCCCGAT GCCACCGACT GGCGCCAGAT CGTCGCTCTC TACGATCTCT TGTTGCAGGT TGTCGCCTCG CCCGTCATCG AATTGAATCG CGCTGTAGCG GTGGCCATGC GAGACGGTGC TCCGGCCGGC CTTGCCGTAA TCGATACGAT TCTCGCTCGC GGTGATCTCG CGAACTATCA CCTCGCATAC TCCGCCCGCG CCGACATGCT GCGTCGTATC GGCAAAAAAT CCGAAGCCCG TAAAGCTTAC GAACGTGCCC TGGCGCTGAC CCAGCAGGCA CCGGAACAGA GATTTTTGAG GAAGAGAATT GCGGAAGTAT CTGGGTGA
|
Protein sequence | MSLRDVPTAV EEVYRSEWGR VVATLIGMLG GDFDLAEETA QEAFAAAVTQ WEKDGVPEYP RAWIISTARH KAIDRIRRKA KFEEELEPRL EQGTLDVATP PQEYTAEIPD DRLRLIFTCC HPALALDAQI ALTLRTLCGL ETEEIARAFL VPVPTMAQRV VRAKSKIRDA GIPYAVPETS QMAERLDAVL HVIYLVFNEG YSASSGESLT RADLSEEAIR LARIVVELLP DPEALGLLSL MLLHESRRAA RTSEDGDMIL LNDQDRTLWD RALIAEGTAL VERSFALRRP GPYSIQAAIA AVHADSPTPD ATDWRQIVAL YDLLLQVVAS PVIELNRAVA VAMRDGAPAG LAVIDTILAR GDLANYHLAY SARADMLRRI GKKSEARKAY ERALALTQQA PEQRFLRKRI AEVSG
|
| |