Gene Acid345_4453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4453 
Symbol 
ID4070936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5285124 
End bp5286371 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content62% 
IMG OID637986492 
ProductRNA polymerase ECF-subfamily sigma factor 
Protein accessionYP_593527 
Protein GI94971479 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTCA GGGATGTCCC CACAGCAGTC GAAGAAGTGT ATCGCTCCGA GTGGGGGCGT 
GTCGTCGCGA CCCTCATTGG AATGCTCGGC GGAGACTTTG ATTTGGCCGA GGAAACCGCG
CAGGAGGCCT TTGCCGCCGC CGTAACCCAG TGGGAGAAAG ACGGCGTTCC TGAGTACCCA
CGGGCTTGGA TCATCTCTAC CGCGCGCCAC AAGGCCATCG ACCGGATCCG CCGCAAGGCT
AAATTTGAAG AGGAACTCGA GCCGCGCCTC GAACAGGGAA CTCTCGACGT CGCGACCCCC
CCACAGGAGT ACACCGCCGA GATTCCTGAC GACCGACTGC GCCTGATCTT CACTTGCTGC
CACCCGGCGC TGGCCTTGGA CGCGCAGATC GCACTTACAT TGCGCACGCT TTGCGGACTC
GAGACCGAAG AGATCGCGCG CGCCTTTCTC GTGCCAGTGC CGACGATGGC GCAACGAGTG
GTCCGCGCCA AGAGCAAGAT TCGTGACGCC GGCATTCCGT ATGCCGTCCC CGAGACGAGC
CAGATGGCGG AGCGCCTCGA TGCGGTGTTG CACGTGATTT ACCTGGTCTT CAACGAGGGC
TACTCAGCGT CGTCCGGCGA GTCGCTCACG CGCGCCGATC TTTCTGAAGA AGCGATTCGG
TTGGCGCGCA TCGTCGTAGA GTTGCTGCCC GATCCCGAAG CGCTGGGCTT GCTTTCGCTG
ATGCTGTTGC ATGAATCGCG CCGCGCCGCA CGCACCTCCG AAGACGGTGA CATGATCCTG
CTCAACGATC AGGACCGCAC GCTGTGGGAC CGCGCACTGA TCGCCGAAGG TACTGCGTTA
GTCGAGCGTT CCTTCGCGCT GCGGCGTCCC GGGCCTTACT CGATTCAAGC TGCCATCGCC
GCGGTCCATG CCGACTCTCC AACTCCCGAT GCCACCGACT GGCGCCAGAT CGTCGCTCTC
TACGATCTCT TGTTGCAGGT TGTCGCCTCG CCCGTCATCG AATTGAATCG CGCTGTAGCG
GTGGCCATGC GAGACGGTGC TCCGGCCGGC CTTGCCGTAA TCGATACGAT TCTCGCTCGC
GGTGATCTCG CGAACTATCA CCTCGCATAC TCCGCCCGCG CCGACATGCT GCGTCGTATC
GGCAAAAAAT CCGAAGCCCG TAAAGCTTAC GAACGTGCCC TGGCGCTGAC CCAGCAGGCA
CCGGAACAGA GATTTTTGAG GAAGAGAATT GCGGAAGTAT CTGGGTGA
 
Protein sequence
MSLRDVPTAV EEVYRSEWGR VVATLIGMLG GDFDLAEETA QEAFAAAVTQ WEKDGVPEYP 
RAWIISTARH KAIDRIRRKA KFEEELEPRL EQGTLDVATP PQEYTAEIPD DRLRLIFTCC
HPALALDAQI ALTLRTLCGL ETEEIARAFL VPVPTMAQRV VRAKSKIRDA GIPYAVPETS
QMAERLDAVL HVIYLVFNEG YSASSGESLT RADLSEEAIR LARIVVELLP DPEALGLLSL
MLLHESRRAA RTSEDGDMIL LNDQDRTLWD RALIAEGTAL VERSFALRRP GPYSIQAAIA
AVHADSPTPD ATDWRQIVAL YDLLLQVVAS PVIELNRAVA VAMRDGAPAG LAVIDTILAR
GDLANYHLAY SARADMLRRI GKKSEARKAY ERALALTQQA PEQRFLRKRI AEVSG