Gene Caci_5030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5030 
Symbol 
ID8336384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5763828 
End bp5765402 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content68% 
IMG OID644958129 
ProductX-Pro dipeptidyl-peptidase domain protein 
Protein accessionYP_003115731 
Protein GI256394167 
COG category[R] General function prediction only 
COG ID[COG2936] Predicted acyl esterases 
TIGRFAM ID[TIGR00976] putative hydrolase, CocE/NonD family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0204968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGTTCG ACATCCGCGT CGAATCAGGG CTCGAAGCGA CGATGCGCGA CGGGACGATC 
CTGCGCGCCG ACGCCTACCG CCCGATGGGG AGCGGACCGT GGCCCGTGCT GCTGGTCCGC
ACCCCGTACG ACAAGCAGAA CGCAGAGGTG CTCTCACGCC TCGATCCGCA AGGCGCCGCC
GGCCGCGGCT ATCTGGTCAT CGTCCAGGAC TGCCGCGGCC GGTTCGCCTC CGACGGTGTG
TGGGAACCGC TGCTGCACGA CGGCCCAGAC GGCTACGACA CGATCGTCTG GGCCGCGAGC
CTGCCCGCCT CGAACGGCCG GGTCGGAACG TACGGACCCA GCTACCTGGG CTATACCCAG
CAGGCGGCGA GGGCGGCGCA ACCACCTGGG TTGTGCGCCA GCGTTCCGGC GTTCACCTGG
TCGGATCCGA ACGACGGGCT GATGGCGCGC GGCGGCGCCT ACGAACTCGG GCTGATGACG
CACTGGACTT TGTCGCTCGG GTTCGATGTC TTGGCACGCC GGTACGCCGA CGCTCCGCAG
GAGTTGGCAT CCCGGCTCGC GGCGCTGAAC GGTGCGCTGG AGGACTTCCG GTCGCGGGTC
GTCTGGGACT CGCCTGCAGA GGACCTGCCT GTGCTCCGGC GCCTGGGGCT GACGACGCCA
AAGCCGACCA GTGCTCCGCA CCAGCGCTCG GCTGCGATAC CGACCCTGAC CATCGCAGGC
TGGTTCGACT GCTTCCTCCA AGGGAGCCTC GACAACCACG TCGCAGCGAC AGCCAGCGGC
GCGCCGACGG CACTGATCGT CGGTCCGTGG ACGCACGACG ACCAGAGCAG CCAGGGCGGC
GCGTCCCTCA ACGCACGCGA ACTCGACTTC CTCGATCGGC ACCTCAGGCC CGACTCCAGC
GTCCAGGCGC CCGAGTCACC CGAGTCACCC GAGTCACCCG TGCAGGTATT CGTGATGGGC
ACTGACGAAT GGCGCCGCTT TCCGTCCTGG CCATCCCAGA GCACTGAGAG CTCCTGGTAT
CTGCACCCTG ATGCATGCCT GGCGCCTCTC TTGCCGCCGA ATTCACCGCC GGACTCCTTC
GACCACGACC CCGACGACCC CGTCCCCACC CTCGGGGGCG CCATCCTCCT CGGCCCCGAT
TTCCCTTCCG GACAGTGCGA CCAGGCGCAG ATCGAGGAGC GCGACGACGT CCTGATCTAC
ACCAGCGAAC CGATGAAAAC CTCGCTCGAA GTCATCGGCC GCGTCCGCGT AGAACTGTTC
GCCACATCGA CGGCACCGAG CACCGACTGG ATCGCACGCC TCTGCGACGT CGACGAACAC
GGCGTCTCCC GCAACATCAC CGACGGCATC CTCCGCGCGC CATCAGCCGA GCCCCAGCGT
CAGCCTCAGA AACACACGAT CGACCTGTGG TCCACAGCCC ACGCATTCCT CCCCGGCCAC
CGCATCCGCC TCCAGATCAC CTCTACCTGC TTCCCCCGCT GGGCCCGCAA CCCCGCCTCG
TCCACCGCGC GCCAAACCGT GCACCACGGC AACGCAACAC CGTCACGGCT CATCCTCCCG
AGGACGCCGG CTTAA
 
Protein sequence
MVFDIRVESG LEATMRDGTI LRADAYRPMG SGPWPVLLVR TPYDKQNAEV LSRLDPQGAA 
GRGYLVIVQD CRGRFASDGV WEPLLHDGPD GYDTIVWAAS LPASNGRVGT YGPSYLGYTQ
QAARAAQPPG LCASVPAFTW SDPNDGLMAR GGAYELGLMT HWTLSLGFDV LARRYADAPQ
ELASRLAALN GALEDFRSRV VWDSPAEDLP VLRRLGLTTP KPTSAPHQRS AAIPTLTIAG
WFDCFLQGSL DNHVAATASG APTALIVGPW THDDQSSQGG ASLNARELDF LDRHLRPDSS
VQAPESPESP ESPVQVFVMG TDEWRRFPSW PSQSTESSWY LHPDACLAPL LPPNSPPDSF
DHDPDDPVPT LGGAILLGPD FPSGQCDQAQ IEERDDVLIY TSEPMKTSLE VIGRVRVELF
ATSTAPSTDW IARLCDVDEH GVSRNITDGI LRAPSAEPQR QPQKHTIDLW STAHAFLPGH
RIRLQITSTC FPRWARNPAS STARQTVHHG NATPSRLILP RTPA