Gene Acid345_2962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2962 
Symbol 
ID4068863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3505921 
End bp3507531 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content59% 
IMG OID637984981 
Productcitrate lyase, alpha subunit 
Protein accessionYP_592037 
Protein GI94969989 
COG category[C] Energy production and conversion 
COG ID[COG3051] Citrate lyase, alpha subunit 
TIGRFAM ID[TIGR01584] citrate lyase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0727414 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.222216 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCG ATTCGAAAAT GGTATTGGAT GGCTCAGCTA CGGCGCTTGC TGAAGATGAA 
ACCGAGTTTG TTCGGAATGC CGCAGGGCGA CTCGTACCTA CCGTTGTGAA TGGCGTGCCG
CAGGTGCCAT TCCTTGGAGT GGGGAAGTAT CGGCCAGAAG GACGCAAGGC CGCGCCGCCG
GTGCGGAGCG CTGCAGACTA TCCAGAGGAT GGCGATAAAC GCGTTGCCGA TATCGAGACT
GCGCTGCACA AGTGCGGCAT TCGCGACGGG ATGACGATTT CGTCGCACCA TCATTTGCGT
GATGGAGATC GCGTGGCGCT CGAAGTCCTG CAAACAGCAG GACGCATGGG CGTGAAGAAC
CTGCGCTGGT TCCCGAGCGC GTCGTTCCCT GCGCAAGCGC CAGTGATCGA ACTGATGAAG
TCCGGTGTGG TTCATCACAT CGAGGGCAGC ATGAATGGCC CGCTCGGTGA CTATTGCACC
CAGGGCAACA TGGCCGGATT GGGCGTACTT CGCTCGCATG GTGGTCGCTT CCAGGCGATC
CAGGATGGCG AGGTGCACAT TGATATCGCG GTGATTGCGG CGCCTTCGGC CGATATGTTC
GGTAACGCCG ATGGTTCCCA CGGCAAGAGC GCCTGCGGAT CACTGGGCTT TGCGCTGGCT
GATTCGATGT ATGCCGACCG CGTGATCGTG GTGACCGACA ATCTCGTGCA GTTTCCGTGC
GTGCCGTGGC AGATCCAGGG CAACAACGTG GATTACGTCG TCGAGGTGCC GAGCATTGGC
GATCCGGCCA AGATCGTGAG CGGTACCACA CAGATCACGC GCTCGCCTGA TCGGCTGCGC
ATCAGCGAAT TGGTTGCGCA CTTCATGAAG GCGTCGGGGA TTTTGCGCAA CGGGCTCTCG
TTTCAAGCGG GGGCAGGTGG TATCGCGCTG GCGTTCGTGC AGTATCTGAA GCCGATGATG
AAAGAGGCGG GCATTAAAGC CAGCTTCGTG CGAGGCGGAT CGACGAAGTA TCTCGTCGAA
ATGCTGGAAG AAGGCCTGAC GGAATACATC CTCGATGGCC AGACGTTCGA TCTCGACGCG
GTGCGCTCGA TCGCTTCGAA TCCGCGGCAC GTAGCCACGT CGCCATTCAC CTCGTACAAC
TATCACGGCA AGGGCAACTT CGCTTCGATG GTGGACGCGT GCATCCTGGG CGCAACCGAA
GTGGACGTGA ACTTCAATGC GAATGTCGTT ACCCACTCGG ATGGACGGCT GCTGCATGGC
ATCGGAGGCT GGCAGAATTG TCTCGCCTCG CGATGCACGA TCCTCGCACT ACCGGCGTTC
CGCGACCGGA TTCCGGTCGT GGTGGATGAG GTTACGACCC TGACCGGGCC TGGCGAGTTG
ATTGATGTCG TCGTGACCGA GCGAGGGATC TGCATCAATC CGCGGCGGAC CGACCTGATT
GAGTCGGTGA AAGACTCCGA GTTGCCAGTC CTCGACATAC GAGAGCTAAA GAAAGAAGTC
GAAACGATTT GTGGTGGCAT ACCTGAGAAA ACGAAACCAA GCGATCAGCC TGTGGCCGTT
GTGAAGTGGG TGGACGGCAC CGTACTCGAC ACAGTTTGGA AGACTTATTA A
 
Protein sequence
MSGDSKMVLD GSATALAEDE TEFVRNAAGR LVPTVVNGVP QVPFLGVGKY RPEGRKAAPP 
VRSAADYPED GDKRVADIET ALHKCGIRDG MTISSHHHLR DGDRVALEVL QTAGRMGVKN
LRWFPSASFP AQAPVIELMK SGVVHHIEGS MNGPLGDYCT QGNMAGLGVL RSHGGRFQAI
QDGEVHIDIA VIAAPSADMF GNADGSHGKS ACGSLGFALA DSMYADRVIV VTDNLVQFPC
VPWQIQGNNV DYVVEVPSIG DPAKIVSGTT QITRSPDRLR ISELVAHFMK ASGILRNGLS
FQAGAGGIAL AFVQYLKPMM KEAGIKASFV RGGSTKYLVE MLEEGLTEYI LDGQTFDLDA
VRSIASNPRH VATSPFTSYN YHGKGNFASM VDACILGATE VDVNFNANVV THSDGRLLHG
IGGWQNCLAS RCTILALPAF RDRIPVVVDE VTTLTGPGEL IDVVVTERGI CINPRRTDLI
ESVKDSELPV LDIRELKKEV ETICGGIPEK TKPSDQPVAV VKWVDGTVLD TVWKTY