Gene Caci_0616 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_0616 
Symbol 
ID8331945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp718584 
End bp720947 
Gene Length2364 bp 
Protein Length787 aa 
Translation table11 
GC content68% 
IMG OID644953768 
ProductMicrobial collagenase 
Protein accessionYP_003111393 
Protein GI256389829 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00460764 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.253983 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCACC GAACACGCCT GCCCGGAATC GCCGCCGCGG TGCTCGCCGC GGTGGCGGTC 
GCCGGGATGA CCGGGGTGGG AGCGGCCTCC GCCGCCCCGC GTCCGGCCGC CGCCTCGGTC
GCCCCGACCG TGAGCCACGC GAACACGAAC ACGAACACGA ACAAGAACGC CACGCCGAAG
GGTTCCGCCG ACCCGGCCAT CGCCGGGAAG TCCGTGGTCG CGAAGACCGG AAAGCCGGTC
GGCGACCTCG CGCCGCGCGC ACCGATCGGC GGCAGCGCCG ACGGAACCAA GACCGGCGTT
GCGCAGACCG GCGCCACGCA AACCAGCGCC GCAAAGCAGT CCCCGGCCGC CAAGACCCCC
GCCAAAACCC CCGCAGCCCA GTCCTGCACC CCCGCGGACT TCGCCTCCCG CTCCGGCAGC
ACCCTCAGCG CCTTCGTCAA GGCATCGACC ACCGACTGCA TCAATACCCT GTTCTCGGTC
ACCGGCTCCA ACGCTTCGGC GATCTTCAGC GAAGCCAAGA TGGAGACCAT CGCAGGGTCC
TACAAGACCG CCGCCGCCAG CTACCCGGGC ACCGACGCGA CGAGCGTCCA GCAGCTCGTG
CTGTTCCTGC GCGCCGGGTA CTACGTGCAG TACAACAGCA ACGGCTCGAT CCCGGCCTAC
GGCCAGGCGC TCGCCACGCT GGTCGAGGGC GGCCTCGACG CCTTCTTCGC CGGCTCGCAC
GCCCTGGACG TCAACGACAA CAACGGCCCG GTCCTCGGTG AGACGATCAT CCTCACCGAC
AGCGCCGACG AACAGGCCCG CTACCTCAGC ACCTACCAGC GGATCCTGAA CGCCTACACC
AGTTCCTACA ACGCCTACTC GACGATGCTG AACGCGGTGA ACGACGTCTA CACGCCGCTG
TTCCGAGGCC ACCAGTTCCC GGCGTTCGTC ACGGCCGTCA CCGCGAACCC GAGCATCATC
GACACCCTGA ACAGCTTCGC GCTGAACCAC AAGAACCTGC TGGGAGGGGA CAACTCCTAC
CTGGACTCCA ACGCCGGGCT GGAGATGAGC CGGTTCGTTC AGCACACGGC GCTGCAGGCC
AAGGTGCGTC CGCTGATGAA GGGTCTGCTC GCGGCCTCGT CGATGACCGG ACCGACGGCG
CCGCTGTGGG TCGCGGTCGC CGGCAACGCC GACTACTACG ACCAGGCGAA CTGCGCCTAC
TACAACGTGT GCGATCTGGC CGCCAAGCTC ACCGCCGCCG TGCTCACCAC GACGACGCAC
TGCGACGCCA GCCACACCGT GCTGTCCCAG GCGCTGACCG CCTCCGACAC CAGCGCCGTG
TGCGCGAGCA TCCTGGGCCA GTACTCGTAC TTCCACACCG TGGTGCACGA CAGCGGCCCG
ATTCCCGGGC AGTACGACCA GAACTTCGTG CTGACCGTGT TCGCCTCGCC CACGGACTAC
CAGACCTACG CCGGACCGAT CTACGGCGTG GACACGGACA ACGGCGGCAT CACCCTGACC
GGGGATCCGA CCGATCCGTC CAACATCGTC CGCTCGATCA TGTACCAGTG GGACACCGAC
AACGGCTTCG TGGCGCGCGT GTGGAACCTG AACCACGAGT TCACCCACGC GCTGGACGCC
GAGTACGACA CCAAGGGCGA CTTCACCGCC GAGATCGTGG TCCCGGACAT CTGGTGGATC
GAGGGCGTCG CGGAGTACGT CTCGTACAGC TACCGCGATG TCACCGACAC CGAGGCGGTG
AGCGAGGCGG CGACCCACCG GTACGCGCTG AGCACCCTGT GGCAGAGCTC GTATGACAAC
AGCGACGAGA CACGCACCTA CCCCTGGGGC TACCTCGCCG TCCGCTACAT GATGGAGCGG
CACCCCGCCG ACATCGCCAC ACTCCTGGCG AAGTTCCGCG TCGGCGATTA CCAGGGCGCG
TACGCCTTCT ACGGCACGAC CATCGGCACG GCGTACGACG CCGACTTCAA TTCCTGGCTC
GACCAGTGCG CGGCCGGCGC CTGCCAGGCC GGCGGCGGAA CCACGCCGCC GCCCCAGAAC
TGCTCCGACC CCGACACCCG GGCGATGGAC CAGAACTGCT CGCGCACCGG CGAGTCGGCG
GCGGCCGGCG CGATCGACTA CTTCTACATC GACATTCCCG CCGGGACGTC GTCGCTGACC
ATCACCACCA CCGGTGGCAG CGGCACGGCG TACCTGCTGT ACAACCCCTC GACGTGGGCG
ACCCCCACCG CGTACACGCA GGGCTCGTTG AACAACGGCA CGACACAGAG CCTGACGATC
ACCGATCCGC CATCCGGCTA CCGGTACATC AGCCTGTACG GGCAGACCGC CTTCAGCGGG
GTGACCATCA CCACGTCCTA CTGA
 
Protein sequence
MRHRTRLPGI AAAVLAAVAV AGMTGVGAAS AAPRPAAASV APTVSHANTN TNTNKNATPK 
GSADPAIAGK SVVAKTGKPV GDLAPRAPIG GSADGTKTGV AQTGATQTSA AKQSPAAKTP
AKTPAAQSCT PADFASRSGS TLSAFVKAST TDCINTLFSV TGSNASAIFS EAKMETIAGS
YKTAAASYPG TDATSVQQLV LFLRAGYYVQ YNSNGSIPAY GQALATLVEG GLDAFFAGSH
ALDVNDNNGP VLGETIILTD SADEQARYLS TYQRILNAYT SSYNAYSTML NAVNDVYTPL
FRGHQFPAFV TAVTANPSII DTLNSFALNH KNLLGGDNSY LDSNAGLEMS RFVQHTALQA
KVRPLMKGLL AASSMTGPTA PLWVAVAGNA DYYDQANCAY YNVCDLAAKL TAAVLTTTTH
CDASHTVLSQ ALTASDTSAV CASILGQYSY FHTVVHDSGP IPGQYDQNFV LTVFASPTDY
QTYAGPIYGV DTDNGGITLT GDPTDPSNIV RSIMYQWDTD NGFVARVWNL NHEFTHALDA
EYDTKGDFTA EIVVPDIWWI EGVAEYVSYS YRDVTDTEAV SEAATHRYAL STLWQSSYDN
SDETRTYPWG YLAVRYMMER HPADIATLLA KFRVGDYQGA YAFYGTTIGT AYDADFNSWL
DQCAAGACQA GGGTTPPPQN CSDPDTRAMD QNCSRTGESA AAGAIDYFYI DIPAGTSSLT
ITTTGGSGTA YLLYNPSTWA TPTAYTQGSL NNGTTQSLTI TDPPSGYRYI SLYGQTAFSG
VTITTSY