Gene Acid345_0153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0153 
Symbol 
ID4069738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp161506 
End bp163062 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content59% 
IMG OID637982153 
Productpyruvate carboxylase subunit A 
Protein accessionYP_589232 
Protein GI94967184 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.27286 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00223112 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAAAGC TCTTCAACAA AATCCTTATC GCCAACCGCG GCGAGATTGC GGTCCGCGTT 
ATTCGCGCCT GTCGCGAGAT GGGGATCGAG TCGGTCGTCG TCTATTCCGA CGCGGATCGC
CGCGCCTTGC ATGTTCGGAA GGCCGACTAC GCGTATCACA TTGGGCCGTC TGCGGCGAGC
GAGTCGTATC TGCGCATTGA CAAGATTCTC GATGTAGCGA AGAAGAGCGG GGCGGAGGCG
ATCCATCCCG GCTACGGATT CCTTTCGGAA AATGCACGGT TCGCGCGGGC GTGCGTCGCC
GCCGGAGTGA AGTTCATTGG CCCGACAGGT GACGCGATGG ATGCCATGGG ATCGAAGACG
AAAGCACGCC AGGCAATGGC CAAGGCGAAC GTTCCCATGG TGCCTGGCAA CGCCGAGGGC
CTGAAGAGCG CAGAAGAGGC TGAAGTCCTC GCTGAGCAAA TTGGATATCC GGTCATGCTC
AAGGCTGCAG CGGGTGGTGG TGGCAAGGGC ATGCGGATGG TGAAGTCGCG TGGGGAACTG
CGCTCGGCGC TGGAAGCCGC GAGCAGTGAG GCGCAGCGCT CCTTCGGCGA CAGCGAAGTC
TATCTTGAGA AATTCATCGT CAATCCGCGG CATATCGAGA TGCAGATCTT CGCGGATGAG
CATGGCAACA CGGTATGGCT TGGCGAGCGC GAATGCTCAG TACAGCGCCG CCATCAGAAA
GTCCTCGAAG AATCGCCATC ACCGATCGTA GATCCCGATA TGCGGCGGCG AATGGGAGAG
GTCGCGGTCC GCGTGGCACA GGTGGCGAAT TACACCAACG CGGGTACCGT CGAGTTCCTG
GTTGATGAGC AAAAGAACTT CTACTTCCTC GAAATGAACA CGCGGCTGCA AGTCGAACAT
CCGGTGACGG AACTGGTGAC CGGCATGGAT CTTGTCCATC TGCAGATTCG CGTGGCCTCC
GGCGAACTTC TACCATTCAA GCAGGAAGAT CTCTTGCTCC GCGGGCATGC GATCGAGTGC
CGCGTTTACG CGGAAGATCC GGATAACCAG TTCTTCCCGT CTCCCGGGCG GATCACTCGA
CTGATTTCAC CCTCCGGTCC GGGGATTCGT CGCGATAGCG GAATGTACGA AGGCTGGACC
GTACCGCTCG ACTACGATCC GTTGCTCGCT AAGCTAATTG CGTACGGCAG CGATCGCGAA
CAATGCATTC ATCGTTTGCA GCGCGCTTTG TACGAGTATT TCGTCGGCGG AATTAAGACG
AATATTCCAC TATTCCGACG CATTCTGAGC GACGCAGATT TTCAAGCCGG AAAGCTGCAC
ACCGGCTTCC TGGACCGGTT GTTATCGGAG CCGCATTTGC CGGCGCCGGA ACATGAGGAA
CGCACGCGCA TGGCTGCCAT CGCTGCTGCG ATCTTTGCCA GCACCGAGCC GCAAGTGACA
CTGGGTAAAC CGTTCGATGG AGTGACGTAC GCAGCGGCGG CGAAGCCAGT CAACGGTGCC
GCGAATGGCG AGAAGTCGGA ATGGAAATTC CAGGGACGCA CGGAGGCGCT TCGCTAA
 
Protein sequence
MAKLFNKILI ANRGEIAVRV IRACREMGIE SVVVYSDADR RALHVRKADY AYHIGPSAAS 
ESYLRIDKIL DVAKKSGAEA IHPGYGFLSE NARFARACVA AGVKFIGPTG DAMDAMGSKT
KARQAMAKAN VPMVPGNAEG LKSAEEAEVL AEQIGYPVML KAAAGGGGKG MRMVKSRGEL
RSALEAASSE AQRSFGDSEV YLEKFIVNPR HIEMQIFADE HGNTVWLGER ECSVQRRHQK
VLEESPSPIV DPDMRRRMGE VAVRVAQVAN YTNAGTVEFL VDEQKNFYFL EMNTRLQVEH
PVTELVTGMD LVHLQIRVAS GELLPFKQED LLLRGHAIEC RVYAEDPDNQ FFPSPGRITR
LISPSGPGIR RDSGMYEGWT VPLDYDPLLA KLIAYGSDRE QCIHRLQRAL YEYFVGGIKT
NIPLFRRILS DADFQAGKLH TGFLDRLLSE PHLPAPEHEE RTRMAAIAAA IFASTEPQVT
LGKPFDGVTY AAAAKPVNGA ANGEKSEWKF QGRTEALR