Gene Caci_5049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5049 
Symbol 
ID8336403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5794124 
End bp5795812 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content70% 
IMG OID644958148 
ProductCHAP domain containing protein 
Protein accessionYP_003115750 
Protein GI256394186 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCACAA ACCGTAAGAA CTTTGCCATC AGCCTGCTGG CCGCGACGCT GCTGGCGCCC 
ATGGCCTCGG TACTGACCGC CGGGTCGGCC TCGGCCACGA CGGTCGGCGC CACGATCGCC
GCCGTCGCCG ACGGGCAGAT CGGCAACGGG TCGACCGGCC CGTGCGGGCT CGGGTCCCGC
TATCTGGGGT ACTCCGCGCC CAACGGACCG ACCAACCAGC ACAACAGCTG CGCCTCGCCC
GGCAGCAACT CCGGCCAGTC CCAAGCGTGG TGCGCCGACT TCGCGGGCTG GGTGTGGAAC
GAGGCCGGCG TCACCGTCGA CGGCACCCTG AACGACCTGG CCAGCAGCTT CTACGACTAC
GGACAGAGCC ACGGCACATG GTCCTCGACC CCGCACGTCG GTGACGCGGT CTACTTCGAC
AGCTCCATCA AGGGCGGTTA CGGCCACGTC GCGATCGTCA CCGCGGTCAA CAGCGACGGC
ACCTTCACCG AGGTCGGCGG CAACGAGAGC AGCCTGGTCG GCAGCGGCAA CAACTGGAAG
TCGGCGCCCG GGAGCCAGGT CGTCTGGAGC GACAAGGTCT ACGACGGCAA CGGCAACTGG
ACCGGCGGCT ACGCGGACGT CAAGGTCATG GGCTTCTCCA GCCCGGTCGG CGGCACCACC
CCGCCCCCGC CGCCGTCCAG CACCCCGCAC AGCGGCTCCT CCGGCCTGAC CGCCGCCTCC
AACGGCGGAT ACACGACCGC GTGGAAGGGG ACTGACGGCT ACCTGTGGCT GGCCAACGGC
AACGGCGCGG CGATCTCCTC CAAGGGCAAC CCGTGGCTGC TCGGCGTCGC GGCGAACACG
ACTCCTTCGA TGGCGACCCT GTCCGACGGC TCGTGGGTCT CCGCCTGGCA GGGCAGCGAC
GGCTACCTGT GGCTGGCCAC CGGAACCGGC ACCGCGATCA CCGCCAAGGG CAACCCGTTC
CTGCTCGGGG TCGCACCCGG CACCAGCCCG TCGATCGTCG CGCTGCCCAA CGGCGGCTGG
GAAGTGGCGT GGAAGGGTCA GGACGGCTAC CTGTGGCTGG CCACCGGCTC GGGGACGAAC
ATCACCGCCA AGGGCAATCC GTTCCTGCTC GGCGTCTCCG GCACGACCAG CCCGGCTCTG
GCGGCGCTCC CCAACGGCGG CTTCCAGGCC GCGTGGAAGG GCGGGGACGG CTACCTGTGG
CTCGCCTCCG GCTCCGGCGT CAACATCACC GCCAAGGGGA ACCCGTTCCT GCTCGGCGTG
GTCAACAATC CGGCGCTGGT GACGATGCCC GACGGCAGCT TCGAGACCGC GTGGAAGGGC
GGCGACGGAT ACCTGTGGCT CGCCTCCGGA ACCGGCGCCA CGATCACCGC GAAGGGCAAC
CCGTTCCTGC TCGGCGTCTC CGGCGACACC AGCCCGTCCA TCGCAGCACT GCCCAGCGGC
GGCTTCGAGA CCGCTTGGAA GGGCGGCGAC GGATACCTGT GGCTGGCCAC CGGTTCGGGT
GCGAACATCA CGGCGAAGGG GAACCCGTTC CTGCTCGGTG TGGCGAACAA CCCCGAGCTC
GTGACCAAGG CCGACGGCAG CTTCCAGGCC GCGTGGAAGG GCGGCGACGG CTACCTGTGG
CTCGCCTCCG GCTCCGGCAT CAACATCACC GCCAAGGGCA ACCCGTGGCT GCTGGGCGTC
GCTTCGTAA
 
Protein sequence
MRTNRKNFAI SLLAATLLAP MASVLTAGSA SATTVGATIA AVADGQIGNG STGPCGLGSR 
YLGYSAPNGP TNQHNSCASP GSNSGQSQAW CADFAGWVWN EAGVTVDGTL NDLASSFYDY
GQSHGTWSST PHVGDAVYFD SSIKGGYGHV AIVTAVNSDG TFTEVGGNES SLVGSGNNWK
SAPGSQVVWS DKVYDGNGNW TGGYADVKVM GFSSPVGGTT PPPPPSSTPH SGSSGLTAAS
NGGYTTAWKG TDGYLWLANG NGAAISSKGN PWLLGVAANT TPSMATLSDG SWVSAWQGSD
GYLWLATGTG TAITAKGNPF LLGVAPGTSP SIVALPNGGW EVAWKGQDGY LWLATGSGTN
ITAKGNPFLL GVSGTTSPAL AALPNGGFQA AWKGGDGYLW LASGSGVNIT AKGNPFLLGV
VNNPALVTMP DGSFETAWKG GDGYLWLASG TGATITAKGN PFLLGVSGDT SPSIAALPSG
GFETAWKGGD GYLWLATGSG ANITAKGNPF LLGVANNPEL VTKADGSFQA AWKGGDGYLW
LASGSGINIT AKGNPWLLGV AS