Gene Caul_2167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2167 
Symbol 
ID5899622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2350360 
End bp2352033 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content67% 
IMG OID641562658 
Productelectron-transferring-flavoprotein dehydrogenase 
Protein accessionYP_001683793 
Protein GI167646130 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins)
[COG2440] Ferredoxin-like protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.068758 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0582717 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAG AGCTCGAACG CGAGTCGATG GAATACGACG TCGTCATCGT CGGCGGCGGA 
CCGGCGGGCC TGTCGGCCGC CATCCGGCTC AAGCAACTGG CGGCCCAGGC GGGGACCGAG
GTCTCGGTCG CGGTGCTGGA GAAGGGTTCC GAGGTCGGCG CGCACATCCT GTCGGGCGCG
GTGATCGACC CCAAGGGCCT GGCCGAGCTG TTCCCCGACT GGAAGGAACG CGGCGCGCCG
CTCGAGACGC CGGTCACCAA GGACGTCTTC CGGCTTCTGG GGCCGTCGGG CGACATGGCG
CTGCCGATGT TCGCCATGCC GCCGTTCATG CACAACCACG GCTGCTACAT CGCCTCCTTG
GGCAATGTCT CGCGTTGGCT GGCCGCCCAG GCCGAGGAAC TGGGCGTCGA GATCTATCCG
GGCTTCGCGG CCTCGGACCT GGTCTGGAAC GCGGACGGCT CGGTCAAGGG CGTCGTCGTC
GGCGTGGTGG GCGTGGCCAA GGACGGCCAT CACAAGCCCG ACTACAATCC CGGCATGGAA
CTGCACGGCA AGTACGTGTT CATCGCCGAG GGCGTGCGCG GCTCGCTGGC CAAGCAACTG
ATCGCCAAGT TCGACCTGAG CGCCGGCAAG TCGCCGCAGA AGTTCGGCAT CGGCATCAAG
GAACTGTGGC AGGTGCCGCC CGAGAAGCAT CAGCCGGGCC TGGCCGAGCA CACCACCGGC
TGGCCGCTGG ACAACCAGAC CGGCGGCGGC AGCTTCATGT ACCACTTCGG GGACAACTAC
GTGGCCATCG GCTACGTGGT GCACCTGAAC TACAAGAACC CCTGGCTGTC GCCGTTCGAC
GAGTTCCAGC GCTTCAAGCA CCATCCGTCG GTCAAGCCGC ACCTGGAAGG CGGCAAGCGC
ATCGCCTACG GCGCCCGGGC CATCACCGAG GGCGGCTATC AGTCGGTGCC GAAACTGACC
TTCCCGGGCG GGGCGCTGAT CGGCTGCTCG GCCGGCTTCG TCAACGTGCC GCGCATCAAG
GGCAGCCACA ACGCCATGAA GACCGGCATG TTGGCCGCCG ACGCCGCCTT CGCCGCGCTC
GCCGCCGGCC GGGCCAGCGA CGAGCTGCTC GCCTACCAGT CCGCCTACGA AGGCTCGTGG
GTCGCCAAGG AACTCAAGAT CGTCCGCAAC GCCAAGCCGC TGCTGGGCAA GTTCGGCACC
GCCCTGGGCG GGGCGCTGGG CATGTTCGAC ATGTGGGTCA ACCACCTGAC CGGCGGCTTC
TCGTTCTTCG GCACGATGAA GCACGAGAAG ACCGACGCGG CCTCGACCGG CCTGGCCAAG
GACTACAAGC CGCTGGTCTA TCCCAAGCCC GACGGGGTGA TCAGCTTCGA CAAGCTGAGC
TCGGTGTTCA TCTCGGCCAC CAACCACGAG GAAGACCAAC CGGCCCACCT GACGCTGAAG
GATCCGTCGA TCCCGATCGC GGTGAACCTG CCCAAGTACG GCGAACCGGC CCGGCTCTAC
TGCCCCGCCG GCGTCTACGA GGTGCTCTAT AACGAGCAGG GCGCCGATCC GCGCTTCCAG
ATCAACGCCC AGAACTGCGT CCACTGCAAA ACCTGCGACA TCAAGGATCC GTCGCAGAAC
ATCGTCTGGA CGACGCCCGA GGGCGGCGGC GGACCCAACT ATCCGAACAT GTAG
 
Protein sequence
MSEELERESM EYDVVIVGGG PAGLSAAIRL KQLAAQAGTE VSVAVLEKGS EVGAHILSGA 
VIDPKGLAEL FPDWKERGAP LETPVTKDVF RLLGPSGDMA LPMFAMPPFM HNHGCYIASL
GNVSRWLAAQ AEELGVEIYP GFAASDLVWN ADGSVKGVVV GVVGVAKDGH HKPDYNPGME
LHGKYVFIAE GVRGSLAKQL IAKFDLSAGK SPQKFGIGIK ELWQVPPEKH QPGLAEHTTG
WPLDNQTGGG SFMYHFGDNY VAIGYVVHLN YKNPWLSPFD EFQRFKHHPS VKPHLEGGKR
IAYGARAITE GGYQSVPKLT FPGGALIGCS AGFVNVPRIK GSHNAMKTGM LAADAAFAAL
AAGRASDELL AYQSAYEGSW VAKELKIVRN AKPLLGKFGT ALGGALGMFD MWVNHLTGGF
SFFGTMKHEK TDAASTGLAK DYKPLVYPKP DGVISFDKLS SVFISATNHE EDQPAHLTLK
DPSIPIAVNL PKYGEPARLY CPAGVYEVLY NEQGADPRFQ INAQNCVHCK TCDIKDPSQN
IVWTTPEGGG GPNYPNM