Gene Caul_3935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3935 
Symbol 
ID5901397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4259376 
End bp4260533 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content68% 
IMG OID641564456 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001685558 
Protein GI167647895 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0946647 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.167315 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCTGA ACCTCCCAAA CCAAGACCCC GACGCCGCCG ATCCTGATGC GAATTGGGGC 
CTGCCCGGCT GGCTCTATGA CAACGCCCGC TTCTTCCGCG AGGAACAGGA CAAGGTCCTG
CGTCCGTCGT GGCAGATCGT CTGCCATCTG AACGATATCC CCAAACCCGG CGACTTCCAC
ACCTTCGACT TTCTGGGCGA GAGCACGATC GTCGTGCGCG GCAAAGACGG CGGGGCGCGG
GCCTTCGCCA ATGTCTGCCG GCACCGGGCG GCGCGGCTGC TGGACGGCCC CAGCGGCCAT
TGCGCGCGCG TGGTCTGCCC CTACCACGCC TGGACCTACG ACCTGGACGG CCGGCTGATC
GGCGTGCCGC ATCGCGAGAC TTACCCTGCC TTGAAGATGG AAGAACAGGG GCTCCACACG
GTGCAGGTCG AGGTCTATCG CGGCTTCGTC TTCGTGCGGC TCGAGGGCGA CGGCCCCAGC
GTGGCCGAGA TGATGGCGCC CTACGACCAC GAGCTGGAGC CCTATCGCTT CGAGGACATG
GTCCCGTTCG GCCGCGTCAC CCTGCGACCG CGCGCGGTGA ACTGGAAGAA CATCAGCGAC
AACTATTCAG ACGGCCTGCA CATCCCGGTC GCCCATCCCG GCCTGACCCG GCTGTTCGGG
CGGGGCTATG GCGTCGAGGC CGCCCCCTGG GTCGACAAGA TGTGGGGCCA GCTGATCGAG
GAGCCGTCGC GCAATCCGTC CGAGCGGATG TACCAGCGGG TGCTGCCCGA CGCCCTCCAC
CTGCCGCCCG AGCGCAAGCG GCTGTGGACC TATTTCAAGC TCTGGCCCAA CCAGGCGTTC
GACATCTATC CTGACCAGGT GGACTTCATG CAGTTCATCC CGGTCTCGCC CACCCAGACG
ATGATCCGCG AGATCGCCTA CGCCCTGCCC GACGACCGGC GGGAGATGAA GGCGGCGCGG
TACCTCAACT GGCGCATCAA CCGCCAGGTC AACGCCGAGG ACACGCAGCT GGTGGCCCGC
GTGCAGCAGG GCATGGCCTC GCGGAGCTTC ACGGCCGGGC CGCTGGCGGA CTCGGAGGTC
AGCTTGCGGA GCTTCGGCCG CAAGATGCGG GCGCTGATCC CGGAAGCGCG GCTGCACCGG
CCGCCGGAGG GGTGGTGA
 
Protein sequence
MDLNLPNQDP DAADPDANWG LPGWLYDNAR FFREEQDKVL RPSWQIVCHL NDIPKPGDFH 
TFDFLGESTI VVRGKDGGAR AFANVCRHRA ARLLDGPSGH CARVVCPYHA WTYDLDGRLI
GVPHRETYPA LKMEEQGLHT VQVEVYRGFV FVRLEGDGPS VAEMMAPYDH ELEPYRFEDM
VPFGRVTLRP RAVNWKNISD NYSDGLHIPV AHPGLTRLFG RGYGVEAAPW VDKMWGQLIE
EPSRNPSERM YQRVLPDALH LPPERKRLWT YFKLWPNQAF DIYPDQVDFM QFIPVSPTQT
MIREIAYALP DDRREMKAAR YLNWRINRQV NAEDTQLVAR VQQGMASRSF TAGPLADSEV
SLRSFGRKMR ALIPEARLHR PPEGW