Gene Caci_5541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5541 
Symbol 
ID8336901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6391408 
End bp6393441 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content69% 
IMG OID644958645 
ProductPeptidyl-dipeptidase Dcp 
Protein accessionYP_003116241 
Protein GI256394677 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.481391 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCATCG AGAATCCCTT TTTCCCGGCC AGCACCCTGC CCTACGAGCT GCCGCCGTTC 
GCCGCCATCC GGGAGGAGCA CTACACCCCG GCGTTCGACG CCGGGTTCGC CGAGCACCTG
GCCGAGATCA AGGCGATCGC CGAGAACCCG GAGCCGGCGA CGTTCGAGAA CACGATCGTC
GCCATGGAGC GCGCCGGCGC GCTCCTGCGC CGGGTCGCCG CGGTCTTCTT CGCGCAGACC
TCCTCCGACA CCACCGACGG CATCCAGGAC ATCGAGCAGG ACGTCATCCC GCGCCTGGCC
GCCCACTCGG ACGCGATCAC CCTGGACGCG GCGCTGTTCG CGCGCATCGA CGACCTCTAC
GAGCGCCGCG CGGATCTCGG GCTGAGCGAG GTCGAGGTAC GCCTGTTGGA GAAGCACCAC
CTGAACTTCG TGCTCGGCGG CGCGAAGCTC TCCGCCGCCG ACAAGGACCG CCTCAAGGAG
CTCAACGAGC AGCTCGCCGC GCTGTCCACG GACTTCGACC GGAACCTGCT GGCCGCGAAC
AAGGCCGGAC AGCAGGTCTT CGACTCCGCC GAGCAGCTGG CCGGGCTGTC GGCGGACGCG
GTCGCGGCGG CGAAGGAGAA CGGCGAGGCG GTCGGGCTGC CCGGCAAGTA CGTCATCTCC
CTGAAGAACT TCTCGAACCA GACGCAGCTG GCCTTCCTGG ACGACCGCGA GGCGCGGCAC
GCACTGCTGA CCGCGTCGCT GGAGCGCGCT TGGGACACCA ACGGCCCGAT CGCCGTGCAG
ATCGCGAAGC TGCGGGCCGA GCGCGCGGCG CTGCTGGGCT ACAGCTCCTA CGCCGAGTAC
GCGGTGCAGG ACCGGACGGC CCAGAGCACC GAGGCGGTCG AGGACCTGAT GTCGCGGCTG
ATCCCGGCTG CGGTGGCGAA CGCCGCCAAG GAGGCCAAGG CGTTGCGCGC GCATCTGCCG
GCCGGGCAGA GCCTGGAGGC GTGGGACTGG ACGTACTACT CGGAAAAGGT GCGCCTGGCC
GAGTACGACG TCGACTCCGA GGCGCTGCGG CCGTATCTGG AGCTGGACCG CGTGCTGATC
AACGGCGTGT TCCACGCCGC GGAGCTGGTG TACGGGATCA CCTTCAAGGC TCGTCCGGAC
CTGGTCGCCT ACCACCCGGA CGTGCGCATC TGGGAAGTGT TCAACACCGA CGGCTCCGGC
ATCGGACTGT TCCTCGGCGA CTTCTACGCG CGCGGCTCCA AGCGCGGCGG CGCGTGGATG
ACGAACTACG TGGACCAGTC CGGTCTGCTC GGCCAGGCGC CGGTCGTGGT GAACAACCTG
AACCTCGCCA AGCCGCCGGC CGGCGAGCCC ACGCTGTTGA CGTGGGACGA GGTCCGCACA
CTGTTCCACG AGTTCGGACA CGCCCTGCAC GGCCTGTTCT CCGACGTGGA GCACCCGACC
TTCTCCGGCA CGAACACGCC GCGCGACTTC GTTGAGTACC CGTCCCAGGT GAACGAGATG
TGGGCGGAAT GGCCGGAAGT ACTGGCGAAC TACGCTAAGC ATTACCGCAC TGGCGAGCCG
GTCCCGGCGG AACTGCTGGA GCGCATGGCC GAGGCGGAGA AGTTCGGGCA GGGCTTCGCC
ACCGTGGAGA TCCTCGGCGC GGTGATGCTG GACTGGGCTT GGCACAAGCT GGCAGCCGGC
GAGGACCCGG GCGACGCCAA GGAGTTCGAG GCCGCCGCGC TGACGCACTA CGGACTGCTG
GTTCCCGAGA TCCCGTCGCG GTACCGCACC AGCTACTTCG CGCACATCTG GGGCAACGAC
TACAGCGCCG GGTACTACTC GTACCTGTGG AGCGAGGTCC TGGACAAGGA CACGGTCGAC
TGGTTCAAGG ACGGCGCGGC GCAGGGCCGG ACGATCCGCG AGAGCGGCGA GGCGTTCCGG
CGTGCGGTGC TCTCGCGCGG CGGGAGCGTG GATCTGATGG CGGCGTTCGC GCAGTTCCGG
GGGCGCGCGC CGGAGGTCGG TCCGATGCTG CGGGCCCGAG GGCTCGAGGG CTGA
 
Protein sequence
MTIENPFFPA STLPYELPPF AAIREEHYTP AFDAGFAEHL AEIKAIAENP EPATFENTIV 
AMERAGALLR RVAAVFFAQT SSDTTDGIQD IEQDVIPRLA AHSDAITLDA ALFARIDDLY
ERRADLGLSE VEVRLLEKHH LNFVLGGAKL SAADKDRLKE LNEQLAALST DFDRNLLAAN
KAGQQVFDSA EQLAGLSADA VAAAKENGEA VGLPGKYVIS LKNFSNQTQL AFLDDREARH
ALLTASLERA WDTNGPIAVQ IAKLRAERAA LLGYSSYAEY AVQDRTAQST EAVEDLMSRL
IPAAVANAAK EAKALRAHLP AGQSLEAWDW TYYSEKVRLA EYDVDSEALR PYLELDRVLI
NGVFHAAELV YGITFKARPD LVAYHPDVRI WEVFNTDGSG IGLFLGDFYA RGSKRGGAWM
TNYVDQSGLL GQAPVVVNNL NLAKPPAGEP TLLTWDEVRT LFHEFGHALH GLFSDVEHPT
FSGTNTPRDF VEYPSQVNEM WAEWPEVLAN YAKHYRTGEP VPAELLERMA EAEKFGQGFA
TVEILGAVML DWAWHKLAAG EDPGDAKEFE AAALTHYGLL VPEIPSRYRT SYFAHIWGND
YSAGYYSYLW SEVLDKDTVD WFKDGAAQGR TIRESGEAFR RAVLSRGGSV DLMAAFAQFR
GRAPEVGPML RARGLEG