Gene Caul_1870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1870 
Symbol 
ID5899325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2005527 
End bp2006756 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content66% 
IMG OID641562360 
Product3-methyl-2-oxobutanoate dehydrogenase (2-methylpropanoyl-transferring) 
Protein accessionYP_001683497 
Protein GI167645834 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACC GTCCTGCCCT CCGCCTCCAC ATCCCCGAAC CCCAAGCCCG CCCGGGTGAC 
CAGCCCGCGT TCGATCGCGC GCTCATTCCT GAGCCGGGCG ACACCCGCCG GCCCAAGACG
GCGGCCGCCG AGGCCGACAT GCGGGACCTG CCGTATGGTC TCGTGCGGGT GCTCAACGAC
GCCGGCGAGG CTTCAGGACC CTGGAACCCG AACCTGCCGG TCGAGACGCT TTTGGCCGGT
CAGCGGGCGA TGTTGCTGAC CCGCGCCTTC GACGAACGCC TGTTCCGCGC GCACCGTCAG
GGCAAGACCA GTTTCTACAT GAAGTCGACC GGCGAGGAGG CCATCGGCGC GGCCCAGTCC
CTGTTCCTGG ATCGCGACGA CATGTGTTTC CCGACCTATC GGGTCCTGAG TTGGCTGATG
GCGCGGAACT ATCCGCTGAT CGACCTGTGC AATCAGATCT TCTCAAACGC CAATGATCCC
TTGAAGGGCC GACAGCTGCC GATCCTGTAT TCGGCTCGCA AGTACGGCTT CTATTCGCTG
TCGGGCAACG TCGGCAGCCG TTTTGGCCAC GCGGTCGGCT GGGCCATGGC ATCGGCGTTC
AAGGGCGGGG ATTCGATCGC CTTGGCCTAT ATCGGCGAAG GCACCACGGC CGAGGGCGAC
TTTCACGAGG CGCTCACCTT CGCCAGCGTC TATCGCGCGC CGGCCATCCT GTGCGTCACC
AACAATCAGT GGGCCATTTC CAGCTTCTCC GGCATCGCCG GCGCCAACGA GACCACGTTC
GCGGCCAAGG CGCTGGCCTA CGGCCTGCCG GGCCTGCGGG TGGACGGCAA CGACTTTCTG
GCCGTCTGGG CGGCGACCGA ATGGGCGGCG GAGCGGGCGA GGCTGAACCT CGGGGCGACC
CTGATCGAAC TCTACACCTA CCGTGCATCC GGGCATTCGA CGTCGGATGA CCCGACCAAA
TACCGCCCGG CGGACGAGGC CGAGGCCTGG CCTCTGGGCG ACCCGGTCGA GCGGCTGAAG
ACCCATCTGA TACGGCTCGG CGCCTGGGAT GAGGAACGCC ATGCCGCCCT GATCGCCGAG
CTCGACGCCG AGGTTCGCGC CGCCGTCAAG GAAGCCGAGG CGGTCGGCAC GCTCGGCAAG
TCCAAGCCGA GCGTCAAGGA GATGTTCGAG GGCGTCTTCA AGGATCCTGA TTGGCGCGTC
ACCGAACAGC GCCGCGAGCT GGGGATCTGA
 
Protein sequence
MKNRPALRLH IPEPQARPGD QPAFDRALIP EPGDTRRPKT AAAEADMRDL PYGLVRVLND 
AGEASGPWNP NLPVETLLAG QRAMLLTRAF DERLFRAHRQ GKTSFYMKST GEEAIGAAQS
LFLDRDDMCF PTYRVLSWLM ARNYPLIDLC NQIFSNANDP LKGRQLPILY SARKYGFYSL
SGNVGSRFGH AVGWAMASAF KGGDSIALAY IGEGTTAEGD FHEALTFASV YRAPAILCVT
NNQWAISSFS GIAGANETTF AAKALAYGLP GLRVDGNDFL AVWAATEWAA ERARLNLGAT
LIELYTYRAS GHSTSDDPTK YRPADEAEAW PLGDPVERLK THLIRLGAWD EERHAALIAE
LDAEVRAAVK EAEAVGTLGK SKPSVKEMFE GVFKDPDWRV TEQRRELGI