Gene Caul_3607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3607 
Symbol 
ID5901062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3890239 
End bp3891528 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content70% 
IMG OID641564118 
Productgluconate 2-dehydrogenase (acceptor) 
Protein accessionYP_001685232 
Protein GI167647569 
COG category[C] Energy production and conversion 
COG ID[COG2010] Cytochrome c, mono- and diheme variants 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.105717 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTGAAAC GTGTCCTCAT CGGATTGCTG GCGGTCGTGG CCGTAGGGCT GGTCGGCTTC 
GGCGTCTTCG CCTGGCGTCC CGCGATCGGC AAGATCGCGC CGCCCCCGCC ATCGGCCTTC
TCGCCGGACC TGGTCGCGCG GGGCGAGGTT CTCGCGGGGG CCGGCTACTG TTCGACCTGC
CACACCACCA AGGGCGGCCA GCCGTTCGCC GGCGGCTATC CGATGAAGAC CAGTTTCGGC
GTGATCTATT CGACCAACAT CACCCCCGAC GCCAAGACCG GCATCGGAAC CTGGTCGGAG
GCCGCGTTCC GCCGGGCGAT GCATCAGGGC GTGGCCCGCA ACGGCGCGCA CCTGTTTCCG
GCCTTCCCCT ACGATCATTT CACCAAGCTG TCCGACGCCG ACGTCTCGGC GCTCTACGCC
TACATGATGA CCCGCCCGGC GGTGGTCGCC CCGGCCAAGC GCAACGGCAT ACCGTTCCCG
CTCAACATCC GCGCTCTGCA GGCGGGCTGG AAGCTGTTGT TCTTCAAGCC TGGACGCTTC
GTGCCGGACA GCGGCAAGAG CGCCGAATGG AACCGCGGCG CCTATCTGGC CCAGGGCGTC
AGCCACTGCG GCGCCTGCCA CACGCCGCGC GGGGCGCTGG GGGCCGAAAA GCGCGACAAG
GCCTTCGCCG GGGCTCCGAT CGACAACTGG ATCGCCCCGC CCCTGACCGC CGCCAATCCC
TCGCCGGTCG CCTGGGACCA GGCCGAACTG GTCGCCTATC TGCGCACCGG CGTCAGCCTC
TATCACGGCG TCGCCGCCGG CCCGATGGCG CCGGTGGTCC ACGATGGACT CGTCAGATTG
CCGGACGCCG ACATCCAGGC CCTGGCGACC TATTTCGTGG CCGTCGACGG GGCGGCGAGC
CGGAGCGCTA GCCTGGCGAC GGCCCTGCAA AAGGCGGCGA CGGCCGACCG GCTGAACGTC
GGGACCCAGA TCGACCCGGC CGCGCGGCTC TACACCGCCG CCTGCGCCTC CTGCCACTAT
AACGGCGGCG GTCAGCCCAA CCCGCTGCGG CCGGACCTGG CGCTCAACAG CGCGGTCAGT
CTGGATGACC CGACCAACCT GATCCGGGTG GTGCTCTACG GGGTCAGCGC CAGGGACGGC
GCGCCGGGCG TGGTGATGCC CAGCTTCAAC CGGTTCAGCG ACGCCGACGT CGCGACGTTG
GCGGCCTATC TGCGCGCCAC CCGCACCGAC AAGCCGGCCT GGCCGAAACT GACCGACAAG
GTCGCGGCGA TCCGCGCGCA GGGAAGGTGA
 
Protein sequence
MLKRVLIGLL AVVAVGLVGF GVFAWRPAIG KIAPPPPSAF SPDLVARGEV LAGAGYCSTC 
HTTKGGQPFA GGYPMKTSFG VIYSTNITPD AKTGIGTWSE AAFRRAMHQG VARNGAHLFP
AFPYDHFTKL SDADVSALYA YMMTRPAVVA PAKRNGIPFP LNIRALQAGW KLLFFKPGRF
VPDSGKSAEW NRGAYLAQGV SHCGACHTPR GALGAEKRDK AFAGAPIDNW IAPPLTAANP
SPVAWDQAEL VAYLRTGVSL YHGVAAGPMA PVVHDGLVRL PDADIQALAT YFVAVDGAAS
RSASLATALQ KAATADRLNV GTQIDPAARL YTAACASCHY NGGGQPNPLR PDLALNSAVS
LDDPTNLIRV VLYGVSARDG APGVVMPSFN RFSDADVATL AAYLRATRTD KPAWPKLTDK
VAAIRAQGR