Gene Caci_5497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5497 
Symbol 
ID8336857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6337216 
End bp6339354 
Gene Length2139 bp 
Protein Length712 aa 
Translation table11 
GC content62% 
IMG OID644958601 
ProductGCN5-related N-acetyltransferase 
Protein accessionYP_003116197 
Protein GI256394633 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0755902 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0749364 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCCGAA GATTATCCGG TCTCAGCCGG AGCCTTCACA GAAGGGAAGG TGGGTACGAC 
AATGCGCGCA TGTCTAGTGA GTCGCAGCAA CCCACTTCGC GCACCATAAC GATTGCGTCG
GCGCGGCGAG GTGATTCTGC TGAGGTAGAC GCCGCCGTGC TGATCGGCAA CGCGGCGCGC
GACACCCTGG GACATCTTCC GTTTGCCGCC TACAGGGAGG CTGCGCACAA CGGCTGTCTG
CTTCTGGCGC GAGATGGCGG AGAGATCGTC GGTTACGCGC TCTACGGCCT TACACGCGAC
CATGTGCGAC TGACCCACCT GTGCGTCAGG CCCGATTACC GTGGGCTCGG CATCGGCCGC
AGACTTGTCG AACGTATTGT TGACGAACGG GCCGACTACC CGGGGATCAA GGCGAGTTGC
CGCCATAGCT ATGGCCTGGG TGAAATGTGG GTGAGCTTGG GCTTCAGCCA GATCGGCGAA
CGGCTTGGTC GAAGTGACGA TGGGCATATT CTGGTGATTT GGTGGCGGGA CCATGGGCAC
CCGAACCTGC TAAGCCGCCA AGAGGAGCTT GTGCTCGTGC GCGCCGCGGT GGACTTGAAC
GTGGTGCGGG CTCTGGCCGG CCCACAGCGT CAGGACACCC TCGATGTTCA GGCCCTATTG
GACGACCAGA GCGTCGACAG GCTGGAGTTG ATTCGCACCC CTGCCCTAAA CGCAGAGATC
GACACCATTG AGAGCACGCT GCGAGCGCAG TGCACCAAAA GCATTCGCAA CATGGCGGCT
GTGCGGAGCA AAACAGCCGC TGTTGGCGAG GCCAGGGCTG CGATCATCGC GGCAATCCCT
CAGGCGTCCC CTGGGTACCC GCTGACGAGG CAGGACGAGT TCGACCTTCG GCACGCGTGT
GACGCGATCG CTGCCGGATT GAATGTGCTC ATCACCAGCG ATCGTCGGAT GATCAGGACG
TTGGGCGCAG CGTGCGAGCG CCTGTACGGT TTGCGGATCA TGCTCCCCGT TGACGTCATC
ACGCATCTCG ATGAGTTGGT TCGGGCGGAG GCTTATCGCC CGGCCGGGCT ACTCGGCACG
GCGTGGAACT CCCGCTTGCT TGGCGCCGGC GATGCGGCTC GGGCTATGCA CCTAGTGGGA
GATGGTGAGA GGCCCAAGGC GTTCCAGGAC TTAATAAAAT CGCTGGCACG CGACCAGCAC
GAGCGGATCG GCATCGTTGA CGCAGCTGGC CACATGGCGG CCGCGTACTG CGCCTACGCG
TGCGGCGACG AGTTGGTTGT CCCATTAGCG CGCGTAGCCC CTGTGCGGCT CAGCGAGACT
CTGGCGCGTC AGATAATCTT CCTTCTCCGC CAGATGGCAA GAGGGAATGC TGTCTCGACG
ATTCGGCTGG CCGACGCCCA CGCAAGCCGT TCGCTCCGGT TGGCGGCTCT CGAAGATGGA
TTCCAGCCGG TCGATGGAGG GCTGGTTGCC CACTCTCTCA ACGTTTGTGG AACCGCAGAG
GAACTCAGCC GTCGAGCAAT CCTTGCGGCG CGTTACGCAG GTCTGAGTGA GCCGCCATTG
CTACGATCAG CCATGCCATC AGTCGCCGCT GCCGAGATCG AGCGTGTTTG GTGGCCGGCC
AAGGTCGTCG ACAGCGCGCT GACCTCGTAT CTCATCCCTA TAAAGCAGAT CTTTTCGAAC
GACTTGCTTG GGGTTCCAGA GACGCTCGAC GGCCGTGACG ATCAGTTGGG CCTGAGCCGG
GAGCATGTCT ACTACCGAAG CCCTCGTGGA CCGCGACCGG CAGCACCGTC TCGTTTGCTG
TGGTACATGA GCGCTGGCGG GCAGACCCGT TCGCAGAGCC CAGGCGTGAT TGCCTGTTCG
CAGCTTGACG CTGTCGTGAC TGGCGTACCT CAAGAGCTGC ACAGTAGGTT TCGGCACCTG
GGCGTGTGGA ACGAACAACA AGTCGTCGCA GCGGCCCATG AAGGACAGGT GCAAGCGCTG
GTGTTCACGA ACACGGAGAT CCTGCCGCAA CCCGTGGGGC TTCGAGAGCT GCGCGAACTC
TCAAAGCGGT ACGGGGAGCA GGCAACTCCG CAGGGACCGG TACGGATCTC GGCTGAGTAT
TTCGGCGCCA TCTACCTGGC AGGCCAGAGC CAATCGTGA
 
Protein sequence
MFRRLSGLSR SLHRREGGYD NARMSSESQQ PTSRTITIAS ARRGDSAEVD AAVLIGNAAR 
DTLGHLPFAA YREAAHNGCL LLARDGGEIV GYALYGLTRD HVRLTHLCVR PDYRGLGIGR
RLVERIVDER ADYPGIKASC RHSYGLGEMW VSLGFSQIGE RLGRSDDGHI LVIWWRDHGH
PNLLSRQEEL VLVRAAVDLN VVRALAGPQR QDTLDVQALL DDQSVDRLEL IRTPALNAEI
DTIESTLRAQ CTKSIRNMAA VRSKTAAVGE ARAAIIAAIP QASPGYPLTR QDEFDLRHAC
DAIAAGLNVL ITSDRRMIRT LGAACERLYG LRIMLPVDVI THLDELVRAE AYRPAGLLGT
AWNSRLLGAG DAARAMHLVG DGERPKAFQD LIKSLARDQH ERIGIVDAAG HMAAAYCAYA
CGDELVVPLA RVAPVRLSET LARQIIFLLR QMARGNAVST IRLADAHASR SLRLAALEDG
FQPVDGGLVA HSLNVCGTAE ELSRRAILAA RYAGLSEPPL LRSAMPSVAA AEIERVWWPA
KVVDSALTSY LIPIKQIFSN DLLGVPETLD GRDDQLGLSR EHVYYRSPRG PRPAAPSRLL
WYMSAGGQTR SQSPGVIACS QLDAVVTGVP QELHSRFRHL GVWNEQQVVA AAHEGQVQAL
VFTNTEILPQ PVGLRELREL SKRYGEQATP QGPVRISAEY FGAIYLAGQS QS