Gene Caul_1974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1974 
Symbol 
ID5899429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2119372 
End bp2120775 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content63% 
IMG OID641562463 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001683600 
Protein GI167645937 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGGCCC CGCTGAACCG AGAAACCCTG TCGGATGGAA CCGCGATATC CGACCTGATC 
AACCTGGACA CGCGCGAGGT CAAGATGCGC GCCCTGTCCG ATCCGGAACT CTTTGCACTG
GAGATGGAGC GCATCTTCGC CAAGACCTGG CTGTTCCTCG GCCACGAAAC CGAAATCCCC
AATCCCGGTG ACTTCGTCAC CCGCGACATG GGCTCGGACG GGGTGATCGT CGCCCGGGAT
CGCGAGGGCC AGATCCACGT CTCGTTGAAC GTCTGCCCCC ACCGCGGGAT GAAGATTTCG
ACCTTGGAGG CCGGCAACAC CCTGGCGCAC GTCTGCATCT ACCATGGCTG GGCCTTCAAG
CCGAACGGCG ACTTCGTCGG CGCTCCGGTT CGCAGCGAAT GCATGCAGGG CAAGATGCTG
ACGGATGAGC AGTTGTCGCT GAAAAAGGCC CGGATCGCGA TCTACGGCGG GCTGATCTTC
GCCACCTTCA ATATTGACGG CCCGAGCTTC GACGAGTTCC TCGGCGACGC GAAATGGTAT
TTCGATACTC TGTGGAACCG CACGGCCGGA GGCATGGAAG TGCTTGGGCC GCCCCAACGC
TTCATCATCC GAGCCAATTG GAAGACGGCC TGCGAGCAGT CGGCCTCAGA CGGCTTCCAT
ACCCTGACCC TCCACCGGTG GCTGGGCGAG GTCGGCCCCT ACGCCAAGAA GCCCGAGGCG
GAAGGCCAGG GCGCCGACCT GGCCGCCGAG ATGGGCGGAT GCGAGGTCTG GACCGATGGC
GGCCACACCA TGCGCTGCAT CGACCTGGAC CGTAAGATTC GCCGCATCAC TGGGCGCGAT
CCGTCCGAAC TGTCCGCCGC CGAGAAGCTC GCACTGCTGC CTCCCCCAGG CATGACCCCG
GAGATGGTGC CGGAACTGCT GGAACGTTTC GACGACGACC ATCTTCGCTT GATGGCCTGG
CGGCCGCCTC AGGTCGGCAA TTTCTTTCCC AATGGCCTGT TCGAGTTCAT CTACCTGCCG
CAGCCGGACG GGACTGTGGC CGGCGCCATG GCCCTGCACG CCTATGTGCC CAAGGGCCCC
GACAAGCTCG AATTCATGAA CTGGATTTTC GCGGAGAAGG ACACCCCGCC TGCGTTGAAG
GCCCGCATGC TGCGCCAGTC GATCCAGCTT CTGGGCACCT CGGGGATGGT CGAACAGGAC
GACTCGGACA CGTGGCCGCA CCAGACCATC GTCGCCAAGG GTGCGGTTTC CAAGGATATC
ACCATGAAAT ACCAGGCCCT CTACGAGACG GGCCGGCCCG CCAACTGGCC CGGCCCGGGT
CATGTCGGTG AAGGTTTCAC CAAGGACGAT ACCCAGTGGC AGTGGTGGAA GTCCTGGTAC
GACCTGATGG TCGTCGACGC CTGA
 
Protein sequence
MLAPLNRETL SDGTAISDLI NLDTREVKMR ALSDPELFAL EMERIFAKTW LFLGHETEIP 
NPGDFVTRDM GSDGVIVARD REGQIHVSLN VCPHRGMKIS TLEAGNTLAH VCIYHGWAFK
PNGDFVGAPV RSECMQGKML TDEQLSLKKA RIAIYGGLIF ATFNIDGPSF DEFLGDAKWY
FDTLWNRTAG GMEVLGPPQR FIIRANWKTA CEQSASDGFH TLTLHRWLGE VGPYAKKPEA
EGQGADLAAE MGGCEVWTDG GHTMRCIDLD RKIRRITGRD PSELSAAEKL ALLPPPGMTP
EMVPELLERF DDDHLRLMAW RPPQVGNFFP NGLFEFIYLP QPDGTVAGAM ALHAYVPKGP
DKLEFMNWIF AEKDTPPALK ARMLRQSIQL LGTSGMVEQD DSDTWPHQTI VAKGAVSKDI
TMKYQALYET GRPANWPGPG HVGEGFTKDD TQWQWWKSWY DLMVVDA