Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1662 |
Symbol | |
ID | 4069810 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2008874 |
End bp | 2010607 |
Gene Length | 1734 bp |
Protein Length | 577 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637983670 |
Product | sigma-54, RpoN |
Protein accession | YP_590737 |
Protein GI | 94968689 |
COG category | [K] Transcription |
COG ID | [COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog |
TIGRFAM ID | [TIGR02395] RNA polymerase sigma-54 factor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.125129 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGCTTC TCCAGCCTAG ACTGAATCTC AAAGTGTCTC AGAAACAGAT CCTCACCCCG GGTCTGGTTC AGATGGTCAG CGTCCTTGCG CTCAATAAAA TGGAGCTCAA GGAGATGATC AATGAGGAGA TGATTGAAAA CCCGGTCCTC GAAGAACTCG ACGAAAATGT ACCCCTCATA GACGACATCT CCGCCAAGGA AGAACAACGC GACCGCGATT CTTCCCTGGC CACCGCTGAA GAAGCCCCCG CCACTCCTGA AGCCAAGGAC CCATTTGAAG AAATCGACTT CGGCTCCTTC TTCCAGGAAT ACCTCGATCC GGGTTACCGC AGTTCGAGCG AAATCGAAGA CGTAGAACGT CCTTCCTTCG AAAACTTCCT CTCCAAGCCA ACCACCCTCA CCGACCACCT CATGTGGCAG CTTGGCTCCA TGCATTTGAA GGACGACGTT CTCGCCGCTG CCGAACTCAT CATCGGCAAC CTCAACGATG AGGGCTATCT CACCGCCAGC GAAGACGAAC TCCTCGGCAT CTCCACCGAA GAAGCCGGTG CCGAACCTTC CAGCGAGAGT GCGAAAGCCG TCTCCACTGA TGTTGACCAG CAGATTTCCG CCCTCGAGAT GGCGGGCTTC GAAGCTGGAG AAAATATCGA AGTCACTGAA GAGTCGGATT CCCATTCCGA GATCGACGGC GGCAACACTG CCGTTCAGGT TGAAACCGCC GTCGAACCGC CGCGGCCTAC TCTGGTCCCC CCGCGTACCG CGCCAGCCGC CGTTCCGTTC TGTCGCGACG CCCTTCGTGA AGCGATCCAT ATCATCCAGA ATCTCGATCC CGTTGGGGTT GCGACGCAAG ACCTCCGCGA GTGCCTGCTG ATCCAGCTTC GCTACTTCGA AGGCCTGCCC CACAAGAACG GCAACGGCCA CATCGCGCAA GCAATTGACG ACGGCATACG CATGGTCAGC GATCACATGC ACGAACTCCA GAACAAGCAG TACAAGGAGA TCTCGAAAGC ACTCGGCCGT CCCATCGAAT CGATCACTGC AGCGTTAGAT TTCATCCGCA CCCTCGACCC CAAGCCCGGC CTCCGCTACA ACAAGCAGGA GACGCGTCTC ATCGAGCCCG ATGTCGCCTT CGTCAAACAA GGTGACGAAT ACATCGTCGT CATGAACGAC GAAGAAATCC CGCAGCTCCG CGTCAATCCC GGCTACAAGC GCCTCCTCAA TCGCGACGCC GCCGAGAAAG ACGTTCGCAA CTACGTCAAA GAGCGCTACA AGTCAGCCAT CCAGCTCATC AAGAACATCG AGCAGCGTAA ACAAACGATC CTCAAAGTCT GCTACTCGAT CATCAATCGC CAGCGCGACT TCCTCGACCA CGGCATTGAC CAGCTCAAGC CGATGATGAT CAAAGAAGTC GCCGAAGAAA TTGGCGTGCA TCCTTCAACC GTCAGCCGAG CCGTCGCCAA CAAATACGTT CACACCTCGC AAGGTGTCTA CGAACTCCGC TACTTCTTCA GTGAAAGCGT GAATGGTCCG GAAGGCGGTG CGACATCACT GCTCATCCTC AAGCGCCGCG TAAAGAAGCT AATCGAAGAA GAAGACCCGG CCCGCCCGCT AACCGACGAG CAAATCACCC GCATCCTGCA ATCCCAGGGA ATTCAAGTCA CGCGCCGCAC CGTCGCCAAA TACCGCGAAG ACATGCGAAT TCCCAGCACC CACCAGCGCC GCGTCAAAAG CTAG
|
Protein sequence | MVLLQPRLNL KVSQKQILTP GLVQMVSVLA LNKMELKEMI NEEMIENPVL EELDENVPLI DDISAKEEQR DRDSSLATAE EAPATPEAKD PFEEIDFGSF FQEYLDPGYR SSSEIEDVER PSFENFLSKP TTLTDHLMWQ LGSMHLKDDV LAAAELIIGN LNDEGYLTAS EDELLGISTE EAGAEPSSES AKAVSTDVDQ QISALEMAGF EAGENIEVTE ESDSHSEIDG GNTAVQVETA VEPPRPTLVP PRTAPAAVPF CRDALREAIH IIQNLDPVGV ATQDLRECLL IQLRYFEGLP HKNGNGHIAQ AIDDGIRMVS DHMHELQNKQ YKEISKALGR PIESITAALD FIRTLDPKPG LRYNKQETRL IEPDVAFVKQ GDEYIVVMND EEIPQLRVNP GYKRLLNRDA AEKDVRNYVK ERYKSAIQLI KNIEQRKQTI LKVCYSIINR QRDFLDHGID QLKPMMIKEV AEEIGVHPST VSRAVANKYV HTSQGVYELR YFFSESVNGP EGGATSLLIL KRRVKKLIEE EDPARPLTDE QITRILQSQG IQVTRRTVAK YREDMRIPST HQRRVKS
|
| |