Gene Caul_0587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0587 
Symbol 
ID5898042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp642215 
End bp644257 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content67% 
IMG OID641561069 
Productdehydrogenase E1 component 
Protein accessionYP_001682218 
Protein GI167644555 
COG category[C] Energy production and conversion 
COG ID[COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit
[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGGGA CGAACCCACG CGCGACAAAG GAAGACGCCG CGAAAGCTGC GTTTCTGTCC 
GAGATGTTTG GCAAGATTTG CTTTGTGCGC GCCTTTGAGG AGGAGGCGCT GCGACTGACT
CAGGCGAACC CGCCGCGCGT GGCCGGTTCG ATGCACCTCT GCGCAGGACA AGAGGTCGTA
CCCGTGGCGG CCATGGAGGC CTTGGGGGAC GAAGACCAGG TCGTCTGCAC CTACCGGGGA
CATGGTTGGG CGCTGGCGGC GGGTCTCGAT CCGGAGGCGG TCATGGCGGA GATCTGCCAG
CGGTCGACCG GCTTGAACGG CGGGCGCGCG GGGTCCGCCT ATATGATGGC CCCGCACACC
CGTTTCATTG GCGAGAACTC AATCGTCGGC GCCGGCACGA CGATCGCGTG CGGCGTGGCG
ATGGCTAACC GCCTACGAGG CCGGGACAAC GTCGTCATGG TCACCATCGG CGATGGCGCG
ATGAATCAGG GCTCCGTCCA CGAGGCGATG GCTTTCGCTG CGGTGCGAAA GCTGCCTGTC
ATCTTCGTGG TTGAAAATAA CGGCTGGTCC GAACTCACGC CGACGTCGGA CATGTTCCAC
GCTGAGCGAC TGGCGGTGCG AGGCAAAGCG TATGGCATCC CATCCGCCAC CATTTCCGGA
ACCGATCCGG TGGTGGTGCG CGACAGCTTC GCCATGGCGG CCGCTCATGC GCGGGCCGGC
AATGGACCGT CGCTCATCGA GTGCACGGTT CCTCGGCTGT GGGGGCACTA CAATCGGGAT
ATCGAGCACT ACCGGTCCAA GGCCGATCGC GCCGAGGCCA CCGCGCGTGA TCCCTTAGTC
CTGCTTGCCG CCCGCCTCCA GCAGGACGGC GTCATGACTG ACGACGAGGT TGCGGCGATC
CGGAAGTCGC AAGAGGACGC CGCGCGCGCA TTGGTCTTGC GGGTCATGGC CTCCCCGGCG
CCCAGCCCGG CCGACGCACT CCAACCGATC CACGGCCAAA CGACGGAGGA TCGAAAAGCG
CGGGCGCCCG AATCCCGTTC GATGAGTTAC GTCGAGGCGG TGAACGCCGC GCTCCGCGCC
GAGCTGGAGG AAGACGAACG CACCGTCCTC TATGGCGAGG ATGTCGGTAA GAGCGGCGGC
ATCTTTGCAG CAAGCCGTTA TCTGCAACGC GACTTCGGCG CAGACCGCGT ATTCGACACG
CCGATCGCGG AAAACGCGAT CCTAGGCTCG GCGGTGGGCG CGGCCCTTGG CGGCCTGAAA
CCCATCGTTG AGATCATGTG GGCCGACTTC ATCTTTGTCG CCCTCGACCA GCTGGTGAAC
CAGGCCGCAA ACGTCCGCTA TATCACTGCG GGCAAGTCCA GCGTGCCGCT GGTCGTGCGG
ACCCAGCAAG GGGCGACGCC GGGCTCCTGC GCGCAGCATT CGCAATCGAT CGAAGCTATT
CTCGCCCACG TGCCGGGTCT CAAGGTCGCC TTGGCGGCGA CCCCGCACGA CGCTTATACG
CTGCTGCGAG CGGCCGCTGC CGATCCCGAT CCTTGCGTGG TGATCGAAGC GCGCGCGCTC
TATGCCGACA AGGGCGAAGT GGAGATCGCT GCGACTGCGG AACCCGCGGG CCGCGCACGG
TTGCGCCGCT CCGGCGCCGA CCTCGCCATT ATCACCTGGG GGACCATGGT CGGCCCCGCC
TTGGCGGCGG CCGAGCGCCT GGCCGCGGCT GGATGCGACA CGGCCGTTTT GGATCTTCGC
TGGCTGGCGC CCCTCGACGA GGCCGCTCTC CTGGAGGTCG TGCGCAAGGC GGGCGGCCGG
GTCCTGGTGG TGCACGAGGC CGTAAGGACC GGTGGCTTCG GGGCTGAAAT CGTCGCCCGC
CTACACGAAG CCCTGACTGG CGAGATGGCG TTGCGCATCC GGCGGGTAAC GACGCCCGAC
ACGCGGATAC CTGCGGCGCC GTCGCTTCAG GCAGCCCTCA TCCCGGATGC CGACAGCATC
ATCGCCGCCG CTCTCGCCCT GACCGGCAAG CCGTACGATG TGACCCAGGA GACCGTCGCA
TGA
 
Protein sequence
MPGTNPRATK EDAAKAAFLS EMFGKICFVR AFEEEALRLT QANPPRVAGS MHLCAGQEVV 
PVAAMEALGD EDQVVCTYRG HGWALAAGLD PEAVMAEICQ RSTGLNGGRA GSAYMMAPHT
RFIGENSIVG AGTTIACGVA MANRLRGRDN VVMVTIGDGA MNQGSVHEAM AFAAVRKLPV
IFVVENNGWS ELTPTSDMFH AERLAVRGKA YGIPSATISG TDPVVVRDSF AMAAAHARAG
NGPSLIECTV PRLWGHYNRD IEHYRSKADR AEATARDPLV LLAARLQQDG VMTDDEVAAI
RKSQEDAARA LVLRVMASPA PSPADALQPI HGQTTEDRKA RAPESRSMSY VEAVNAALRA
ELEEDERTVL YGEDVGKSGG IFAASRYLQR DFGADRVFDT PIAENAILGS AVGAALGGLK
PIVEIMWADF IFVALDQLVN QAANVRYITA GKSSVPLVVR TQQGATPGSC AQHSQSIEAI
LAHVPGLKVA LAATPHDAYT LLRAAAADPD PCVVIEARAL YADKGEVEIA ATAEPAGRAR
LRRSGADLAI ITWGTMVGPA LAAAERLAAA GCDTAVLDLR WLAPLDEAAL LEVVRKAGGR
VLVVHEAVRT GGFGAEIVAR LHEALTGEMA LRIRRVTTPD TRIPAAPSLQ AALIPDADSI
IAAALALTGK PYDVTQETVA