Gene Acid345_1662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1662 
Symbol 
ID4069810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2008874 
End bp2010607 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content56% 
IMG OID637983670 
Productsigma-54, RpoN 
Protein accessionYP_590737 
Protein GI94968689 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.125129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCTTC TCCAGCCTAG ACTGAATCTC AAAGTGTCTC AGAAACAGAT CCTCACCCCG 
GGTCTGGTTC AGATGGTCAG CGTCCTTGCG CTCAATAAAA TGGAGCTCAA GGAGATGATC
AATGAGGAGA TGATTGAAAA CCCGGTCCTC GAAGAACTCG ACGAAAATGT ACCCCTCATA
GACGACATCT CCGCCAAGGA AGAACAACGC GACCGCGATT CTTCCCTGGC CACCGCTGAA
GAAGCCCCCG CCACTCCTGA AGCCAAGGAC CCATTTGAAG AAATCGACTT CGGCTCCTTC
TTCCAGGAAT ACCTCGATCC GGGTTACCGC AGTTCGAGCG AAATCGAAGA CGTAGAACGT
CCTTCCTTCG AAAACTTCCT CTCCAAGCCA ACCACCCTCA CCGACCACCT CATGTGGCAG
CTTGGCTCCA TGCATTTGAA GGACGACGTT CTCGCCGCTG CCGAACTCAT CATCGGCAAC
CTCAACGATG AGGGCTATCT CACCGCCAGC GAAGACGAAC TCCTCGGCAT CTCCACCGAA
GAAGCCGGTG CCGAACCTTC CAGCGAGAGT GCGAAAGCCG TCTCCACTGA TGTTGACCAG
CAGATTTCCG CCCTCGAGAT GGCGGGCTTC GAAGCTGGAG AAAATATCGA AGTCACTGAA
GAGTCGGATT CCCATTCCGA GATCGACGGC GGCAACACTG CCGTTCAGGT TGAAACCGCC
GTCGAACCGC CGCGGCCTAC TCTGGTCCCC CCGCGTACCG CGCCAGCCGC CGTTCCGTTC
TGTCGCGACG CCCTTCGTGA AGCGATCCAT ATCATCCAGA ATCTCGATCC CGTTGGGGTT
GCGACGCAAG ACCTCCGCGA GTGCCTGCTG ATCCAGCTTC GCTACTTCGA AGGCCTGCCC
CACAAGAACG GCAACGGCCA CATCGCGCAA GCAATTGACG ACGGCATACG CATGGTCAGC
GATCACATGC ACGAACTCCA GAACAAGCAG TACAAGGAGA TCTCGAAAGC ACTCGGCCGT
CCCATCGAAT CGATCACTGC AGCGTTAGAT TTCATCCGCA CCCTCGACCC CAAGCCCGGC
CTCCGCTACA ACAAGCAGGA GACGCGTCTC ATCGAGCCCG ATGTCGCCTT CGTCAAACAA
GGTGACGAAT ACATCGTCGT CATGAACGAC GAAGAAATCC CGCAGCTCCG CGTCAATCCC
GGCTACAAGC GCCTCCTCAA TCGCGACGCC GCCGAGAAAG ACGTTCGCAA CTACGTCAAA
GAGCGCTACA AGTCAGCCAT CCAGCTCATC AAGAACATCG AGCAGCGTAA ACAAACGATC
CTCAAAGTCT GCTACTCGAT CATCAATCGC CAGCGCGACT TCCTCGACCA CGGCATTGAC
CAGCTCAAGC CGATGATGAT CAAAGAAGTC GCCGAAGAAA TTGGCGTGCA TCCTTCAACC
GTCAGCCGAG CCGTCGCCAA CAAATACGTT CACACCTCGC AAGGTGTCTA CGAACTCCGC
TACTTCTTCA GTGAAAGCGT GAATGGTCCG GAAGGCGGTG CGACATCACT GCTCATCCTC
AAGCGCCGCG TAAAGAAGCT AATCGAAGAA GAAGACCCGG CCCGCCCGCT AACCGACGAG
CAAATCACCC GCATCCTGCA ATCCCAGGGA ATTCAAGTCA CGCGCCGCAC CGTCGCCAAA
TACCGCGAAG ACATGCGAAT TCCCAGCACC CACCAGCGCC GCGTCAAAAG CTAG
 
Protein sequence
MVLLQPRLNL KVSQKQILTP GLVQMVSVLA LNKMELKEMI NEEMIENPVL EELDENVPLI 
DDISAKEEQR DRDSSLATAE EAPATPEAKD PFEEIDFGSF FQEYLDPGYR SSSEIEDVER
PSFENFLSKP TTLTDHLMWQ LGSMHLKDDV LAAAELIIGN LNDEGYLTAS EDELLGISTE
EAGAEPSSES AKAVSTDVDQ QISALEMAGF EAGENIEVTE ESDSHSEIDG GNTAVQVETA
VEPPRPTLVP PRTAPAAVPF CRDALREAIH IIQNLDPVGV ATQDLRECLL IQLRYFEGLP
HKNGNGHIAQ AIDDGIRMVS DHMHELQNKQ YKEISKALGR PIESITAALD FIRTLDPKPG
LRYNKQETRL IEPDVAFVKQ GDEYIVVMND EEIPQLRVNP GYKRLLNRDA AEKDVRNYVK
ERYKSAIQLI KNIEQRKQTI LKVCYSIINR QRDFLDHGID QLKPMMIKEV AEEIGVHPST
VSRAVANKYV HTSQGVYELR YFFSESVNGP EGGATSLLIL KRRVKKLIEE EDPARPLTDE
QITRILQSQG IQVTRRTVAK YREDMRIPST HQRRVKS