Gene Acid345_4558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4558 
Symbol 
ID4071503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5404860 
End bp5405831 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content62% 
IMG OID637986598 
Productlysine decarboxylase transcriptional regulator, CadC 
Protein accessionYP_593632 
Protein GI94971584 
COG category[K] Transcription 
COG ID[COG3710] DNA-binding winged-HTH domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0557135 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAGGA GCACGACAAA CATCACGCCC CACATCTTCA TGCGCTCTCC GCGCCAACGG 
TCTGGCTGCT TCGGTTCCAC GCGTCCATTC TACGGCGTTG TAAACTTTGT TCGAATGTCG
GCAACAACGG GAACGCCGGA GCGTCGCGCC AAGTTCGGGC TTTACGAAGC CAACTTCGCC
AGCGGCGAGC TGCGCAAGAA CGGCGCCAAG CTGAAATTGC AGGAGCAGCC GTTTCAGGTG
CTGGCGCTGC TACTCGACCG CGCCGGGCAA GTCGTAACGC GAGAAGAGAT CCAGCAAAAG
CTCTGGCCCG CCGACACGTT CGTTGACTTC GATCACAGCC TCAACACGGC GATCAACAAA
TTGCGCGACG TGCTCGGCGA CACGGCATCG AACCCGCGCT TCATCGAGAC GCTCCCGCGT
CGCGGCTATC GCTTCATCGC CCCCGTCGAA TGGAATGAAC CTTCTGTCAT CCCGAGCGGA
GCCGCGCCAA GCGCGACGGA GTCGAAGGAC CCCTCTCGTC AAACCGAACC GTCAATCGCC
ACGATCGCGG AGCCCGAGCT TCCGCAAGCC GATCACCGCA TCACCCGGCT GCTCTTCGGC
GGATTGCAGC TGATGTACCT GATCTTCTAC GCCTGCGGCT TGTGGCGGCT CGACGAGATC
GGCGACATCG CCGAAGACCA CTTCCACGTT GCCGGTGGCA AAATCGAAGC GCTCGTGCTT
GTCACGGCAG TCATCGGGAT CGCGCTGCGC TTGTACACGC TGAGCGCGAC GGGCTTCAAC
TACCGCTTGC TCGGCGCGAA GTTCCGTAAG CTGATTCCGC TGGTCTATGT GCTCGACCTG
ATCTGGGCAT TCTCGCCCTT CCTGCTGCAG CCGCGCATCG GCCTTGGGCT GGCGTTTGCC
GCGTGCGCGG CACTGCTGTA TTCGCCGTTT GCGCAGCGTA TCCTGGCGAT GATCACGTGG
CCTGCGAAGT AG
 
Protein sequence
MMRSTTNITP HIFMRSPRQR SGCFGSTRPF YGVVNFVRMS ATTGTPERRA KFGLYEANFA 
SGELRKNGAK LKLQEQPFQV LALLLDRAGQ VVTREEIQQK LWPADTFVDF DHSLNTAINK
LRDVLGDTAS NPRFIETLPR RGYRFIAPVE WNEPSVIPSG AAPSATESKD PSRQTEPSIA
TIAEPELPQA DHRITRLLFG GLQLMYLIFY ACGLWRLDEI GDIAEDHFHV AGGKIEALVL
VTAVIGIALR LYTLSATGFN YRLLGAKFRK LIPLVYVLDL IWAFSPFLLQ PRIGLGLAFA
ACAALLYSPF AQRILAMITW PAK