Gene Caul_1917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1917 
Symbol 
ID5899372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2058332 
End bp2059708 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content64% 
IMG OID641562407 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001683544 
Protein GI167645881 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGACC GGACACGGAC GACGCTTTCC GATGGAACCA CCATCGACGA CTTGATCAAC 
GTGCAGACAC GAGAGGTTCA ACTTCGCACC ATCTCGGACC CGGAACTGTA CGACATCGAG
ATGCGCAAGG TGTTCGGAAG GACCTGGCTG CTTCTCGGCC ATGAGAGCGA GATTCCGAAC
TCGGGCGACT TCATCACGCG TCTGATGGGC TCCGACCCGG TGCTGATCAC GCGCCAGAAG
GACGGCTCGA TCAAGGTCAT GCTGAACGTC TGCCCGCACC GCGGCATGCG CGTCTGCACC
AGCGACGCCG GCAACGCCAA GGTTCACACC TGCATCTATC ACGGCTGGGC CTTCAAGAAC
GACGGCGAGT TCATCGGCGC GCCGGTTGCC GGCGAGCAGA TGCACGGGAC CATTTTGCCC
AAGGAAAAGC TCGGCCTGAC CCAGGCGCGC AGCCATCTCT ACGGCGGCCT GATCTTCGCC
ACCTGGAATC TGGAAGGCCC CTCGTTCGAC GAGTTCCTCG GCGACATGAA GTGGTACTAT
GACATGCTGT TCGAGCGCAC CGACAAGGGC CTCGAGGTCC TCGGCCCGCC GCAGCGCTTC
ACCATCAAGG CCAATTGGAA GACGGCGTGC GAACAGTCCG CCGCCGACGG CTTCCACACC
CTGACCCTGC ACCGCTGGCT GGGCGAGTTC GCCAAGTTCG GCGATGGCGA CCTGACCACC
TCGATGTACG GCACCGAGGT CGGCTCGCTG CAAGGCCACG CACTGCGCTG CATGCCCGTC
GCCAACAAGT TCAAGCAGGC CGCCGGCTTC CGTGACGGCA ATCTGACGCT TGAGGAAAAG
CTGGCCATCG TGCCGCCTCC CGGCATCACC AAGGAAATGC TGCCCGAGCT GATCCGCAAC
CTGACCCCGG AACAGCTCGG CCTGCTGGTC GACAGCCCGC CGCAAGTCGG CGGCATGTTC
CCCAACGTGC TGATCGCCTT CATCTATGTG CCGCAGCCCG ACGGCGAGAT CATCGGCCTG
ACCTCCCTGC ACACCTACAT CCCCAAGGGT CCGGACGAGC TGGAGTTCTG CAACTTCATC
ATGGCCGAAA AAGACGCGCC GGAAGAGCAC AAGCAGAGGG CGCTGCAGTT CGCGGTGCGA
ATGCTGGGCA CCTCGGGCAT GGTGGAGCAG GACGACTCCG ACACCTGGCC CCACATCACC
CAGACGGCCA AGGGCGCGGC CGCGCGCAAC GTCACGATGA AGTACCAGGC CGTCTGCACC
ACCCCTCCTC GGCCCGATTG GCCCGGTCCA GCCCTGGTCT ATGAAGGCTT CACCAAGGAC
GACACCCAGT GGAACTGGTG GCTGGCCTAC CGCGACCTGA TGAACAGCAC CCTCTGA
 
Protein sequence
MLDRTRTTLS DGTTIDDLIN VQTREVQLRT ISDPELYDIE MRKVFGRTWL LLGHESEIPN 
SGDFITRLMG SDPVLITRQK DGSIKVMLNV CPHRGMRVCT SDAGNAKVHT CIYHGWAFKN
DGEFIGAPVA GEQMHGTILP KEKLGLTQAR SHLYGGLIFA TWNLEGPSFD EFLGDMKWYY
DMLFERTDKG LEVLGPPQRF TIKANWKTAC EQSAADGFHT LTLHRWLGEF AKFGDGDLTT
SMYGTEVGSL QGHALRCMPV ANKFKQAAGF RDGNLTLEEK LAIVPPPGIT KEMLPELIRN
LTPEQLGLLV DSPPQVGGMF PNVLIAFIYV PQPDGEIIGL TSLHTYIPKG PDELEFCNFI
MAEKDAPEEH KQRALQFAVR MLGTSGMVEQ DDSDTWPHIT QTAKGAAARN VTMKYQAVCT
TPPRPDWPGP ALVYEGFTKD DTQWNWWLAY RDLMNSTL