Gene Caci_6020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6020 
Symbol 
ID8337383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6938527 
End bp6939546 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content71% 
IMG OID644959124 
ProductAcetamidase/Formamidase 
Protein accessionYP_003116718 
Protein GI256395154 
COG category[C] Energy production and conversion 
COG ID[COG2421] Predicted acetamidase/formamidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.218038 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAACT CGCTCCCGGT CCTGACGTTC ACCCCCTCCG AAGGCGACTA CGTCTGGACC 
TTCGGCGGCG CCCCGCCGCT GGCCCGGGTG AAGCCCGGCG ACACCCTGGA GCTGTTCACC
GAGGACTGCT TCGCCGGCCG GGTCCGCTCC GAGAAGGACC TGGTCACCGA GGTCTGCGAG
TTCCCGTTCC TGAACCCGCA GACCGGACCC TTCTACGTCG AGGGCGCCGA ACCCGGCGAC
ACGCTGGCCG TGCACTTCGT CTCCATCGAG CCGGCCCGGG ACTGGGCCGC CTCGACCACC
GTGCCGCTGT TCGGCGCGCT GACCTCCACC CACACCACCG CCACGCTCCA GCCGCCGCTG
CCGGAGCGGG TCTGGATCTG GCAGCTGGAC CGCGAGCGCC GGACCTGCCT GTTCCGCGCC
CACGACAGCG ACATCGAGGT CGAGCTCCCG ATGGACCCGA TGCACGGGAC CGTCGGCGTC
GCCCCGGCCA ACCTCGAGGT CCGCTCAGCC CTGGTCCCCG ACGCCCACGG CGGCAACATG
GACACCCCCG AGATGCGCGC CGGCGTGACC TGCTACCTCG GCGTGAACGT CGAGGGCGCA
CTGTTCAGCC TCGGCGACGG CCACGCCCGC CAGGGCGAGG GCGAGACCTG CGGAGTGGCG
GTCGAGACGG CGATGAACTC GGTGATCACC ATCGAGCTGA TCAAGGGCGT CCCCACCCCC
TGGCCGCGCC TGGAGTCCGA CACCCACATC ATGACCGCCG GCTCCGCGCG ACCGCTGGAG
GACGCCTTCC GGATCGCCCA GCTCGACCTG GTGCAGTGGG TCGCGCGCGA CTACGGGCTC
AGCGAGCTGG ACGCCTACCA GCTGGTGACG CAGGGGGTGG AGTCGCCGCT GGCGAACGTG
TGCGACACGA ACTACACGTC GGTCGCCAAG ATGCGCAAGG CTTTCCTGCC TTCGGGCACG
TCGGCGTACG GCGTGCACGA GACGCTGCGG TCGCGGGCGG CGGCTTATCT GGCGGGGTGA
 
Protein sequence
MANSLPVLTF TPSEGDYVWT FGGAPPLARV KPGDTLELFT EDCFAGRVRS EKDLVTEVCE 
FPFLNPQTGP FYVEGAEPGD TLAVHFVSIE PARDWAASTT VPLFGALTST HTTATLQPPL
PERVWIWQLD RERRTCLFRA HDSDIEVELP MDPMHGTVGV APANLEVRSA LVPDAHGGNM
DTPEMRAGVT CYLGVNVEGA LFSLGDGHAR QGEGETCGVA VETAMNSVIT IELIKGVPTP
WPRLESDTHI MTAGSARPLE DAFRIAQLDL VQWVARDYGL SELDAYQLVT QGVESPLANV
CDTNYTSVAK MRKAFLPSGT SAYGVHETLR SRAAAYLAG