Gene Caci_4253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4253 
Symbol 
ID8335607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4828475 
End bp4829554 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content70% 
IMG OID644957356 
ProductMembrane dipeptidase 
Protein accessionYP_003114958 
Protein GI256393394 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.655109 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.32173 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAACAA CTGAAGGCCT CGACGCCTCG ACGCTGGAGC TGCACCGGCG CGTGGTGGTT 
GCCGATACCC ACAATGACCT GCTCTGCTCG GTCGTGCTGC GGCCGGTGGC GCAGTGGTCC
GATTACTTCC GCGCGCAATG GTTGCCGCAG TTGCGGGCCG GCGGCGTGGA CGTTCAGGTG
CTGCCGGTGT TCATCGATGA CTCCTTCCGT CCTGAAGGTG CTCTGCGCCA GACGTTGCGG
ATGATCGAGG CGGCGCACCG GATTGCCGAG GGCAACGCCG ATGAGGTCAG CCTGTGCCTG
GATGGCGCCG ACATCGATCG CGCCCTGGAC GCCGGGCGGA TCGCGCTGGT CCTCGCGCTG
GAAAGCGCAC CTGGCATCGA CGCCGACATC GAACTGCTCA CCACCTTGTA CCGCCTCGGT
GTCCGCATCG CCTCCCTAGC GCACTTCGGG CGCACGCCGC TCGCTGACGG CTCGGCGGAG
GACGCGGCCG GGAGCCGGCT CACCGCTGCC GGCGTCGAGG CGTTCGCGGA GATGGAACGC
ATGGGCATGG TGTTCGACGT CTCCCACCTC GGTGCGGCGG GCGTGGACCA TGTCCTGGAG
TTGGCGACCC GGCCGCTGCT CGCCACGCAT TCCTCCGCTC GCGCGCTGTG CGACCACCAC
CGCAACCTCA CCGACGCGCG CCTGGCGGCC ATCGCGGCCG GTGGCGGCGT GGTCTGCGTG
AACTTCTTTC CCGGCTTCGT CGATGCCCAC GAGCCCTCCG TGTCCCGCCT CGTCGACCAC
ATCGAGCACA TCGGCAAGGT CGCCGGTACC GACCATGTCG GCATCGGGCC GGACTTCGTC
GTCGAGGTGC TGCGCGACGT GACGCCTGGC GGCGTGGAGA TCGGCCTGAT GGCCGGCTGC
GATCCGTTCG ACACGCTGCC GGGACTGCCC GGACCTGCGG GATTGCCGCT GCTCACCGCC
GAACTGCTGG CCCGAGGCGT GGACGAGGCA GTGATCGCCG CGACGCTCGG TGGCAATGTC
CTGCGACTGT TCCGCGCCGA GCTCGGCGTG CCCGCGGAGC GTCGGGGAGC CGCCGCGTGA
 
Protein sequence
MGTTEGLDAS TLELHRRVVV ADTHNDLLCS VVLRPVAQWS DYFRAQWLPQ LRAGGVDVQV 
LPVFIDDSFR PEGALRQTLR MIEAAHRIAE GNADEVSLCL DGADIDRALD AGRIALVLAL
ESAPGIDADI ELLTTLYRLG VRIASLAHFG RTPLADGSAE DAAGSRLTAA GVEAFAEMER
MGMVFDVSHL GAAGVDHVLE LATRPLLATH SSARALCDHH RNLTDARLAA IAAGGGVVCV
NFFPGFVDAH EPSVSRLVDH IEHIGKVAGT DHVGIGPDFV VEVLRDVTPG GVEIGLMAGC
DPFDTLPGLP GPAGLPLLTA ELLARGVDEA VIAATLGGNV LRLFRAELGV PAERRGAAA