Gene Acid345_2963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2963 
Symbol 
ID4068864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3507528 
End bp3508748 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content60% 
IMG OID637984982 
Productcitrate lyase acyl carrier protein 
Protein accessionYP_592038 
Protein GI94969990 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG2301] Citrate lyase beta subunit
[COG3052] Citrate lyase, gamma subunit 
TIGRFAM ID[TIGR01608] citrate lyase acyl carrier protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.265448 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.540248 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCAA CGGCAGAGGC GGTGGTTCGT GGCGAGTCTG GACGCGTCGG GAAAGACGTG 
CGTTCGGACC TGCACGTAGC ATTTGAGCCA CGGACATCGG GTGGGATCGA GATTGAGCTG
CAATCTCGCG TGGCGATCTA TTACGGCGAG AGCATTAAGA CACAGGCCCG CGAGGTGCTG
GCGCAATTGG GCGTGGAGCA CGCGTATTTG AAGATCACGG ATGAAGGCGC GTTACCGTTT
GTGATTGCGG CGCGGATAGA AGCGGCAGTG AAGCGGACGG GAGTCGGCAC GGCGCGGCGA
GCCTTGCCGG CGCAGATTGC ATTACCGAAA GCGAGCGCAA AGGATCGGGT GCGTCGGTCG
CGGCTGTATC TGCCAGGGAA TGAGCCGAAG TACTTCATTA ATGCTGGGCT GCATGCGCCC
GACGCGGTGA TTCTCGATCT TGAAGACTCG GTGCATGCGG CGGAAAAAGA TTCAGCGCGA
CTGGTGGTGC GGAATGCGCT GCGCGCAGTG GATTTCCTCG CGGCAGAACG GATGGTGCGG
ATTAATCAAC TTCCGATGGG ATTGGCTGAC CTCGAAGAGA TCGTGCCGCA GTCGCCGGAT
TTGATGCTGA TTCCTAAAGT CGAGACCGCG GAGCAGGTTG CGGAAGTTGA TGGGCGGATC
ACCGCGATTT GCCAGCACGA GAAGATCGAA CGGTCGATCT GGCTGATGCC GATTCTGGAG
TCGGCGCTAG GGATTGAGAA TGCGTTTGCG ATCGCGAAGG CGTCGCCACG GAATTGCGCG
CTGACGGTTG GCCTCGAAGA CTATACCGCC GATCTGGGAG TGGTGAAGAC GAAGGATGGC
GCAGAAACGT TGTATGCACG GCAGCGTCTG GTGAATGCCG CGAAGGCTGC GGGAATTACG
GCGATTGATT CGGTCTTCGG GGATGTGGGC GACATGGACG GTCTGAAGGC CTGGGGCGAA
CGTTCGGGGG CGATGGGCTT TGAAGGCATG GGCTGCATTC ATCCCCTGCA GGTGCGGGTG
ATTCACGCCG CATTTGCTCC GTCGCAACAG GAAATAGATA GGGCGCTGAA GATTGTGGCG
GCGTTTGATA ATGCGCAGCA GAAGGGGCTC GGCGTGGTCT CGCTGGGGTC GAAGATGATT
GATCCGCCGG TGGTTCATCG AGCAGTGAAA CTCGTGGACC GCGCGCGCAG TATGGGCCTG
GTGCGGGAGG AGACGAAATG A
 
Protein sequence
MSATAEAVVR GESGRVGKDV RSDLHVAFEP RTSGGIEIEL QSRVAIYYGE SIKTQAREVL 
AQLGVEHAYL KITDEGALPF VIAARIEAAV KRTGVGTARR ALPAQIALPK ASAKDRVRRS
RLYLPGNEPK YFINAGLHAP DAVILDLEDS VHAAEKDSAR LVVRNALRAV DFLAAERMVR
INQLPMGLAD LEEIVPQSPD LMLIPKVETA EQVAEVDGRI TAICQHEKIE RSIWLMPILE
SALGIENAFA IAKASPRNCA LTVGLEDYTA DLGVVKTKDG AETLYARQRL VNAAKAAGIT
AIDSVFGDVG DMDGLKAWGE RSGAMGFEGM GCIHPLQVRV IHAAFAPSQQ EIDRALKIVA
AFDNAQQKGL GVVSLGSKMI DPPVVHRAVK LVDRARSMGL VREETK