Gene Caul_4838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4838 
SymbolsdhA 
ID5902300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5236105 
End bp5237892 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content68% 
IMG OID641565358 
Productsuccinate dehydrogenase flavoprotein subunit 
Protein accessionYP_001686456 
Protein GI167648793 
COG category[C] Energy production and conversion 
COG ID[COG1053] Succinate dehydrogenase/fumarate reductase, flavoprotein subunit 
TIGRFAM ID[TIGR01812] succinate dehydrogenase or fumarate reductase, flavoprotein subunitGram-negative/mitochondrial subgroup
[TIGR01816] succinate dehydrogenase, flavoprotein subunit, E. coli/mitochondrial subgroup 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCCT ACAAGTTCAT CGACCACAAG TTCGACGTCG TCGTCGTCGG CGCCGGCGGT 
TCGGGGCTCC GCGCCGCGCT GGGCTGCGCC CAGGCCGGGC TGAAGACCGC CTGCGTGACC
AAGGTGTTCC CGACGCGCAG CCACACGGTC GCCGCCCAGG GTGGGATCTC GGCCTCGCTG
GGCAACATGG GCCAGGACGA CTGGCGCTGG CACATGTTCG ACACCGTCAA GGGCTCGGAC
TGGCTGGGCG ACCAGGACGC CATCGAATAT CTGACCCGCA ACGCGCCGGC CGCGGTCTAT
GAGCTGGAGC ACTGGGGCGT GCCGTTCTCG CGCACCGTGG ACGGCAAGAT CTATCAGCGC
GCCTTCGGGG GCATGACCAA GAACTTCGGC GAAGGCCCGA TCCAGCGCAC CTGCGCGGCG
GCCGACCGCA CCGGTCACGC CATGCTGCAC ACGATGTACG GCCAGTCCCT GGCCCACGAC
ACCGAGTTCT TCATCGAGTA TTTCGCCCTC GACCTGATCA TGGAAGACGG CGTCTGCCGG
GGCCTCACCG CCTGGAAGCT GGACGACGGC ACCCTGCACC GGTTCCAGGC TCAGATGGTC
ATCCTGGCCA CCGGCGGTTA CGGCCGCGCC TACTTCTCGG CCACCTCGGC CCACACCTGC
ACGGGCGACG GCAACGCCAT GGCCCTGCGC GCCGGCCTGC CGCTGCAGGA CATGGAATTC
GTGCAGTTCC ACCCCACCGG CATCTACGGC GCCGGCTGCC TGATCACCGA AGGCGCCCGC
GGCGAAGGCG GCTACCTGAC CAATTCGGAA GGCGAGCGCT TCATGGAGCG CTATGCGCCG
TCCGTGAAGG ACCTGGCCCC GCGCGACATG GTCAGCCGGG CCATGACCAT CGAGATTCGC
GAAGGGCGCG GCGTGGGCCC CAACAAGGAC CACATCTTCC TGCACCTGGA CCATCTGGAC
CCGAAGATCC TGCACGAGCG CCTGCCCGGC ATCTCCGAGA CCGCCAAGGT CTTCGCCGGC
GTTGACGTGA CCAAGGCGCC GATCCCGGTG CTGCCGACCG TCCACTACAA CATGGGCGGC
ATCCCGACGA ACTATCACGG CGAGGTCGTC ACCAAGTCGG GCGACAATCC CGACCAGGTG
ATCCCCGGCC TGATGGCCGT GGGCGAGGCG GCCTGCGTCT CGGTGCACGG CGCCAACCGC
CTGGGCTCCA ACAGCCTGAT CGACCTGGTG GTGTTCGGTC GCGCCGCGGC CCTGCGCTGC
GCCGAGATTC TCAAGCCCCT GGCCACCCAG CCGGAGCTGA AGGACAGCAT GACCGACGGC
CACCTGGCGC GCTTCGACCG CTACCGCAAC GCCAATGGCC ATCAGCCGAC CGCCGCCCTG
CGCCTCGAAA TGCAGAAGGC CATGCAGGAA GACGCCGCGG TGTTCCGCAC GGGCGAAACC
CTGGTCGGCG GCGCCCAGCG CCTGGCCGTG GTCTGGGAAA AGGCCAAGGA CATCAAGGTC
AACGACCGCG GGATGGTGTG GAACACCGAC CTGATGGAGA CCCTGGAGTT CGACAACCTG
ATCGGCCAGG CCGTCGTGAC CGTGGCCGGG GCGGTCAACC GCACCGAAAG CCGCGGCGCC
CACGCCCGCG AGGACTTCTC CACGCGGGAC GACGCCAACT GGATGAAGCA CACCCTGGCC
TGGCTCGACC CGTCGACCGG CCAGGTGAAG ATCGATTTCC GGCCGGTGCA CAACTACACC
ATGTCCAAGG ACATCGACTA CATCCCGCCC AAGCAGCGCG TGTACTGA
 
Protein sequence
MSAYKFIDHK FDVVVVGAGG SGLRAALGCA QAGLKTACVT KVFPTRSHTV AAQGGISASL 
GNMGQDDWRW HMFDTVKGSD WLGDQDAIEY LTRNAPAAVY ELEHWGVPFS RTVDGKIYQR
AFGGMTKNFG EGPIQRTCAA ADRTGHAMLH TMYGQSLAHD TEFFIEYFAL DLIMEDGVCR
GLTAWKLDDG TLHRFQAQMV ILATGGYGRA YFSATSAHTC TGDGNAMALR AGLPLQDMEF
VQFHPTGIYG AGCLITEGAR GEGGYLTNSE GERFMERYAP SVKDLAPRDM VSRAMTIEIR
EGRGVGPNKD HIFLHLDHLD PKILHERLPG ISETAKVFAG VDVTKAPIPV LPTVHYNMGG
IPTNYHGEVV TKSGDNPDQV IPGLMAVGEA ACVSVHGANR LGSNSLIDLV VFGRAAALRC
AEILKPLATQ PELKDSMTDG HLARFDRYRN ANGHQPTAAL RLEMQKAMQE DAAVFRTGET
LVGGAQRLAV VWEKAKDIKV NDRGMVWNTD LMETLEFDNL IGQAVVTVAG AVNRTESRGA
HAREDFSTRD DANWMKHTLA WLDPSTGQVK IDFRPVHNYT MSKDIDYIPP KQRVY