Gene Acid345_0794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0794 
Symbol 
ID4068575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp982769 
End bp984943 
Gene Length2175 bp 
Protein Length724 aa 
Translation table11 
GC content60% 
IMG OID637982801 
Productlysine decarboxylase transcriptional regulator, CadC 
Protein accessionYP_589873 
Protein GI94967825 
COG category[K] Transcription 
COG ID[COG3710] DNA-binding winged-HTH domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAGAAA CCGCCAATCG CCAGCAGTTG GTCCGCTTCG GGCCGTACTG CGCAAACCGG 
CAAACCGGTG AACTCTACAA GTTCGGCACC CGGCTAAAAA TTCAAGACCG GCCGTTTGAG
ATCCTGTCCA TCCTGCTGGA ATCTCCCGGC GAACTAGTCA CTCGCGAGGA TCTTCGCACC
CGCTTGTGGC CGGACGGCAC CTTTGTCGAC TTCGATAACA ACATCAGCAC CGCCGTGCAC
AAGCTGCGAA GCGTTCTCTC CGACTCGGCC GACGCCCCGC GATATATCGA GACCGTAGGA
GGCCGCGGGT ATCGCTTTAT CGGTACTGTC GAAAGCGACG TCCCTGCCGA TTCCGAATTG
GCTGCCTCGG TCCAGGCTCC CGCCAATAAA CCTGAAACCG ACTCGACGAG TGCGAGGGTT
GTCTCCGTAA TAGTGAAGAA GCCGGTCGCT CTCACCACGG CTCCGGCCAG CGTCGAAACC
CGATCACGTA GATGGTGGCT GCTCATCCCC GCACTTGTCT TGCTCGGATT GATCGCCGCA
GCAGCGTGGA TCTACTTCAC CCCACCCGCC GAACCACGCG TGGTAAAGAT CGTGCAGCTC
ACGCGCTCGG GCACGTTGCA TCCAAACCAA AACGTTGTCA CCGATGGTCC GCGCATCTAT
TTTCTTGATC GCACGGGTCG CAACTTCGTG TTGAAGAGCA TGGCCGCCGG TGGCGGATCT
CCCACCCCGG TGCAGATCGG GCTTCCGCAC TACGACGTCC AGGACATTTC CCACGACAAC
TCCGAATTGC TGCTCCGTGA GCTTACGGAG GAGCACGAAC TGCGACAGCT ATGGATTGCC
GATATTGTCG GAGGCTCGGC GAGGCGAGTA GGCGATGTGG TCGCGATGGG AGCTACGTTC
TCACCCGACG GACAATCCTT GCTCTATTCA CAGGACGGCA AAGTACTGAC CTGCGATCGC
CAGGGTGGCA ATGCTCGCGT ACTGTTCGAG ACCCCAGGCG AGGTGACCCG CATGCGCTGG
GCGCCCGCTG GCGACGTGTT GCGCATGGTC ATAATCAACT CAAGCGGCGT CGGAACCGCT
CTCTGGGAAA CCCGGCCAGG AGACTGGAAG TTGCGACGCG TGCTGTCCGA TCGAGCAATG
CAGGACGAAC CCTGGACGCT GAGTTGGAGC GCGGATAGCC AGTGGCTTGC GTTCAGCGCA
GTCACGCCCA GTGGTCGAGA GATCTGGATG CTTGGCCGCG ACCACTGGTG GAAACATGCG
CCGCCGCGTC CCACCCGACT CATCGGCGGC CCCATCGATT TTGATCTGCC GACATTCAAC
AAGAACGACA ACACTCTCTA CGCGGTGGGA TCGCACCGGC GAGGGCAATT GCTGCGCTAC
GACGACGCCT CGCACAACTT CGTCCCTTAC CTGGGCGGTC CGTCGGTGGA TCAACTCGAC
TTCACGCACG ACGGACAGTG GGTCACGTAT GTCACCTACC CCGAACACAT CCTCTGGAGA
TCACGAGTTG ACGGGACTGA CGCACAGAAA CTCACAGAGG CTCCCATGCG TGCCTTCGTG
CCAAAGTGGT CGCCGGATGG ACAGTCCATC GCCTTCCGCG CAAACTTTGC CGCGGGTAAA
CCGTGGCAAA CCTATGTGAT TTCGTCAAAG GGCGGAGTTG CCCAACTCGT TGCGAGTGAG
CTATCGAACG TCGGCGGTCC GGGCTGGACT GCGGACGGCC ATGGCTTGAT CTTAGTCGGA
GAACACGACG AGAGTTTGCG TATCTTCGAT CTTCACTCGG GAAGCAGCAA GTTCATTCCA
AATTCGCAAC ATATGTCGGA CCCCGTCGTA TCGCCAAGCG GACGATACCT TCTCTCCGGC
AATCCCGACG ACAACCGCAT CGATGTCCTC GACCTGCAGA CCGGCCAAAG ACGCCACCTC
GCCGACCAGG TGGATTACCC CGCATGGTCG CGCGACGAGA AGTTCGTCTA CTTCAATCGC
TTCTCAGCGT GGTCTCCCGC GATGTACCGA ATTCGAGTGG CCGACGGCGT TGTCGAAAAA
CTCTTCGATC TCGACGGCTT CGCGGCAACC GGCAGCTGGT CCACCTGGAG CGCACTGGCG
CCGGATGGCT CGGTGCTCCT GCTGCGCGAT CTTGGAGGAA CTGATCTCTA CGCGATTCAG
TGGGAAGCGC ACTGA
 
Protein sequence
MGETANRQQL VRFGPYCANR QTGELYKFGT RLKIQDRPFE ILSILLESPG ELVTREDLRT 
RLWPDGTFVD FDNNISTAVH KLRSVLSDSA DAPRYIETVG GRGYRFIGTV ESDVPADSEL
AASVQAPANK PETDSTSARV VSVIVKKPVA LTTAPASVET RSRRWWLLIP ALVLLGLIAA
AAWIYFTPPA EPRVVKIVQL TRSGTLHPNQ NVVTDGPRIY FLDRTGRNFV LKSMAAGGGS
PTPVQIGLPH YDVQDISHDN SELLLRELTE EHELRQLWIA DIVGGSARRV GDVVAMGATF
SPDGQSLLYS QDGKVLTCDR QGGNARVLFE TPGEVTRMRW APAGDVLRMV IINSSGVGTA
LWETRPGDWK LRRVLSDRAM QDEPWTLSWS ADSQWLAFSA VTPSGREIWM LGRDHWWKHA
PPRPTRLIGG PIDFDLPTFN KNDNTLYAVG SHRRGQLLRY DDASHNFVPY LGGPSVDQLD
FTHDGQWVTY VTYPEHILWR SRVDGTDAQK LTEAPMRAFV PKWSPDGQSI AFRANFAAGK
PWQTYVISSK GGVAQLVASE LSNVGGPGWT ADGHGLILVG EHDESLRIFD LHSGSSKFIP
NSQHMSDPVV SPSGRYLLSG NPDDNRIDVL DLQTGQRRHL ADQVDYPAWS RDEKFVYFNR
FSAWSPAMYR IRVADGVVEK LFDLDGFAAT GSWSTWSALA PDGSVLLLRD LGGTDLYAIQ
WEAH