Gene Caci_4970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4970 
Symbol 
ID8336324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5682962 
End bp5683948 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content68% 
IMG OID644958069 
ProductRieske (2Fe-2S) domain protein 
Protein accessionYP_003115671 
Protein GI256394107 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.22227 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.214971 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCGTG AGGTGACCGG CGCGAGTGCC GAACCACATC AGCACCCGTG GGGCTGGTAT 
CCCGTCGCGT TCAGCCGTGA AGTGACGTCC GGCAAGGTCA TCAGCCGGAA GTTCGCCGAG
TCCGAGATCG TCATCTACCG GACCGAGACC GGCCGGGCCC ACATCGTCAG CCCCTACTGC
CCGCATCTGG GTGCTCACAT GGGCGGCGCC CGCGTCGACC GCGAGTTGCT GGTGTGCCCC
TTCCACGCTT TCGCCTTCGC GCCCGACGGC CGCTGTGTCC GCACCGGATA CGGCACGCCG
CCACCCCGCG GCGCACAGCT CACCTCGACG CCGGTCCGCG AAAGGAACGG TTTCATCTTC
GCCTGGCACC ATCAGAACGG GTCCCCGCCG TCCTGGGAGG TCGAGCCGTT CGACTTCAGC
GGATGCGGCC GAGGCGCGGT CTGGTCTCGC ACGTTCCAGG GAAACCAGAT CGACTTCCTG
GAGAACACCG CGGACATCGG GCACTTCTCC ACCCTGCACC ACGTCCAGGC GACGCTCGTG
GGCACGCCGC GCATGGACGG CCACCGCTAC GCCACCGACA TCGACCTGTC CGGCTTCTAT
CGCAGCGAGC TGGTCACCCA CGCCCACGTG GAAGTGTTCG GCCTCGGATA CGTCACCGTG
CGCTTCGACA TGCCGAAGCT CGGCATCAGC GCCATCGAGT TCGCCGGACT CACTCCGGAA
GGCGGCGGCG CCATGACGCT GCGTCGCATC ACCCACGGAC GCCTCGCCGG CACTCGACTG
CCGCGCGCGG CAGCCTGGGC CCGGCGTCCG GCCTCCGATC TCCTGGGCTT CGCGCTGAAG
GTGGCGGGCA ACTCTCAGGT CACGGCCGAT GTGCGCATGT GGAGTCGCCG TGTCGTCACC
GAGACTCCCA AGCTCGCCCA AGGCGACGGT CCCATCGCCC CCGCCCGCCG CTGGGCTCAG
CGGTTCTACC AGGAACAGGA AGCGTGA
 
Protein sequence
MTREVTGASA EPHQHPWGWY PVAFSREVTS GKVISRKFAE SEIVIYRTET GRAHIVSPYC 
PHLGAHMGGA RVDRELLVCP FHAFAFAPDG RCVRTGYGTP PPRGAQLTST PVRERNGFIF
AWHHQNGSPP SWEVEPFDFS GCGRGAVWSR TFQGNQIDFL ENTADIGHFS TLHHVQATLV
GTPRMDGHRY ATDIDLSGFY RSELVTHAHV EVFGLGYVTV RFDMPKLGIS AIEFAGLTPE
GGGAMTLRRI THGRLAGTRL PRAAAWARRP ASDLLGFALK VAGNSQVTAD VRMWSRRVVT
ETPKLAQGDG PIAPARRWAQ RFYQEQEA