Gene Acid345_3544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3544 
Symbol 
ID4069276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4192723 
End bp4195560 
Gene Length2838 bp 
Protein Length945 aa 
Translation table11 
GC content62% 
IMG OID637985567 
Productlysine decarboxylase transcriptional regulator, CadC 
Protein accessionYP_592619 
Protein GI94970571 
COG category[K] Transcription 
COG ID[COG3710] DNA-binding winged-HTH domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0810782 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACTTC CCTCCAATGA AGCCAAAGTC GTCGCCTTTG GCCTCTTCAA AGCTGACCTC 
AACGCCCGTT CCCTGACCAA GGGAGGCGCC TCCGTGCGTC TGCAGGACCA ACCTTTCGAG
GTGCTGGGTC TTCTCCTGGA GCGTCCCGGG GAAATCGTCA GCCGGGAGGA GATCCGCCAG
CGGCTTTGGT CGTCGGATAC GTTCGTCGAG TTCGACGATG GTTTGAATAC AGCTATAAAG
AAGTTGCGCA CCGCCCTGGG AGACAGCGCC CTCAACCCGC GCTTTGTCGA AACCGTTCCC
CGGCGGGGAT ATAGGTTTAT CGCCCCAGTG TCGGTTCTGT CGCCGGTGGC CACGCCCGAA
CCTGAGTCGC AAACTGAGGA TGTGGTCATC GCCTCCCGGG AGCGCTCGCA AATCACCATT
ACCCAGCCGC ATCATCCGGT TTTCTGGAGC ACCGTCGCGC TGGGGCTGGC GATCTCGTTC
CTCGCCGGCG GGTGGTACTA CCGCTCCCGC TCCGGATCAG ACTCGGTGGT AAGCGCTGCG
CCGATCCGCA TGCGTCCGGC GGTGGCTGTT CTTGGCTTCC ATGATTTGAC CGGCCGACGA
GATACTGCGT GGCTTTCCAC CGCCATCGCC CAGATGCTTT GCACCGAACT CGGCGCTGGC
GAGCAGTTGC GCATGGTTTC CGGCGAGCAA GTCGCGCGCG CTCGCAAGGA AGTGGCGTGG
GACCGCGAAG ATACGCTGGA AAAGGCGTCA CTTACCAAGC TGCGGTCGAG ATTGGGGTCG
GATTACGTCG TAATTGGCGC GTATACCGTG GTGGATTCGC CGGCCGGTCC ACAAATCCGA
CTGGATGTGC GCCTTCAGGA CGCCCGCGCC GGTGAAACGC TTCTGGAAGA ATCCGATACC
GGCAACCAGT CCGAGCTATT TGCGTTGGTT TCGCATACCG GTCAGCGTCT GCGCGACCGG
CTTGGCATTG ACCGCGCCAA CGGCGATCAC GACGCCCAGA TTCGCGCCTC TCTGCCGAAG
AACGTTGAGG CCGCACGCCT CTACTCCGAG GGTGTCGCGC AATTGCGCGA TGGCAACGCC
GTCCAAGCCC GCGATTTGTT GACGAAAGCC ATCGCCATCG AGCCAGAGCA CGCACTTTCG
CACGCTGCGC TCGCCGAGGC CTGGACGATG CTCGGTTATG ACTCGCGCGC CAGCCAGGAA
GTCCAACTCG CGTTCAATCT CTCGCAGGAC CTTTCGCGCG AGACCAAGCT CTCCATCGAA
GGCCGCTACC GCGCATTGGC CCACGACTAT CCCAAGGCAA TTGAGATTTA TCGCAAGCTG
CGCGACCTCT ATCCCGATAA CATTGATTAC GCGCTGGTTC TCGCGCGGGT GCAGAACAAG
GGTGGGTTGC CGAAAGACGC GCTCCTGACC ATCGCGACGA TGCGGCAGAT TCCCGGGACG
GCCAGCCATG ACCCGCGCAT AGATCTCGCC GAGGCGTCGG CTGCCGAGAC GCTCAGCGAC
TTCCAGCGGA GCCTGCAGGC GGCCTCCGTC GCCGCGGTTA AGGCTGAATC CGAGGGACGC
ACGCAAGTCG TGTTGGAGGC CCGCGCCAAG CAGATCTGGG ATTACGATCG CCTCGGCGAT
TTCGATAAAG CGATGGCCGC CGCCAAGAGC AGCGTGGAAC TAGCCAAGGC CAGTGGCAAC
GAACGCGAAA TGGCCACCAT GCAACACACC ATGGGTCATC TGGAATACGA CCAGGGTGAT
CTGCCGGGAG CCGTACGCGA TTACGAGGCC GCACTTCGCG AGTTCCGCAA ACTTGGGGCG
CTGTGGGACA CCGCCTCCTG CGCCCACAAT CTCGGCGTCG TCTACCAGGA CCAGGGACTG
ACTCAACAGG CGCGCACCTA TATGGAAGAG GCCCTGCGCA TTCAGCGTGA GATCAACGAT
GAACGCGGCG TCGCCTCCGA TCTCGACGAT CTCAGCAACA TCCTGCTCAG CACCGGAGAC
CTCGACGCAG CCCTGCGCAT GAAGCAGGAG GCGCTGCAAA TCTTCCAGAA GCTCGGCAAC
CGCATGGGCG AATCAATTAC CCTCGGCAAT CTTGCCGAGG TGTACCTCGC GAAAGGCGAT
CTCGCCGCCG CCAAGGACAG CTACGACCAT TCCTACGCGC TTAAACAGCA GATCGGTTAC
AAGCGCGGCT ATGGATACAG TCTCTCCGGC ATCGCCAACA TCCTTATATT GCAGGACAAA
CTGGCCGAGG CGCAGACCAC CGCCAACCAG GCATTGGCGG CGCGCCAGGA ACAGAAAGAT
GAGAGCAACA TCCAGCTGAG CCGCGCGCAA CTCGCGGAGA TCGCCCTTGA ACAGGGCCAG
AACGATGTCT CGGTTTCGCT CGCAGGCGAG ACGACGAAAT ACTTCGACAA GCAGCATGCC
CCCGCCAGTT CGACCTACAG CTACGCGGTC CTCGCGCGCG GATTAGCGCA GCAAGGCAAA
TCCGGCGAAG CGCTAACCAA CGCCCAACGT TCACTCACGA TCTCGAAACA AACCGGGGAC
CTGATCACTA ACTTCGGCTC GCAACTCGCG CTGGCAGAAT CGCAAGCTGC CGCCGGACAG
AAGCCCGCCG CCGTGCACAC GCTGCAAGCG TTGCAATCCC GCGCACACGC CGCCGGTGTC
GTTCGGTATG AACTCGAAGC CCGGTTACGC TTAGCCGAAC TACAGGGGGA CGGCGCCTCT
GCGCAGCGTC AACTTGATGA AGTACACCGC GACGCCGCGG CACGCGGATT CCTGCTGATC
GCGAGGAAGG CGGCGGGAAG TCGGGCGCAA ATCGCGGCAG TTGCGCACGT ACCGGATATC
CCGTCTGCGC GACATTAG
 
Protein sequence
MELPSNEAKV VAFGLFKADL NARSLTKGGA SVRLQDQPFE VLGLLLERPG EIVSREEIRQ 
RLWSSDTFVE FDDGLNTAIK KLRTALGDSA LNPRFVETVP RRGYRFIAPV SVLSPVATPE
PESQTEDVVI ASRERSQITI TQPHHPVFWS TVALGLAISF LAGGWYYRSR SGSDSVVSAA
PIRMRPAVAV LGFHDLTGRR DTAWLSTAIA QMLCTELGAG EQLRMVSGEQ VARARKEVAW
DREDTLEKAS LTKLRSRLGS DYVVIGAYTV VDSPAGPQIR LDVRLQDARA GETLLEESDT
GNQSELFALV SHTGQRLRDR LGIDRANGDH DAQIRASLPK NVEAARLYSE GVAQLRDGNA
VQARDLLTKA IAIEPEHALS HAALAEAWTM LGYDSRASQE VQLAFNLSQD LSRETKLSIE
GRYRALAHDY PKAIEIYRKL RDLYPDNIDY ALVLARVQNK GGLPKDALLT IATMRQIPGT
ASHDPRIDLA EASAAETLSD FQRSLQAASV AAVKAESEGR TQVVLEARAK QIWDYDRLGD
FDKAMAAAKS SVELAKASGN EREMATMQHT MGHLEYDQGD LPGAVRDYEA ALREFRKLGA
LWDTASCAHN LGVVYQDQGL TQQARTYMEE ALRIQREIND ERGVASDLDD LSNILLSTGD
LDAALRMKQE ALQIFQKLGN RMGESITLGN LAEVYLAKGD LAAAKDSYDH SYALKQQIGY
KRGYGYSLSG IANILILQDK LAEAQTTANQ ALAARQEQKD ESNIQLSRAQ LAEIALEQGQ
NDVSVSLAGE TTKYFDKQHA PASSTYSYAV LARGLAQQGK SGEALTNAQR SLTISKQTGD
LITNFGSQLA LAESQAAAGQ KPAAVHTLQA LQSRAHAAGV VRYELEARLR LAELQGDGAS
AQRQLDEVHR DAAARGFLLI ARKAAGSRAQ IAAVAHVPDI PSARH