Gene Caci_3300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3300 
Symbol 
ID8334653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3641182 
End bp3643170 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content69% 
IMG OID644956445 
Productputative secreted protein 
Protein accessionYP_003114048 
Protein GI256392484 
COG category[R] General function prediction only 
COG ID[COG1409] Predicted phosphohydrolases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCACG AGCATGAAGA CCCGCACTGC CAGTGCCCTC GCACCGCCGA CTTCGACTCC 
GCCGGTCCTG CCGCGACCGG TCGGCGCGCG TTCCTGCGCA ACGCCGGGTT GGTGGGTGCG
GGGGCGACCG CGCTGGCTTC CGGCACCACG GTGTTCGGGC CGGCGGCGTC TGCTGCCGCG
TCTGCTGCCG CCGCCGATGC CAAGCACGGC GGGGCGAGCA TCTACGACGT CACCGCCATC
GACACCGACG CCCCGGGCGC CCGCTGGGAT CCGGACCCGG ACAACCCGGT GTTCACGCTG
GTCGTGATGC CGGACACGCA GTACATGTTC GACGAGGACC GGATCCACCC GGCGCCGATG
GAGGCTTCGT TCCGCTACAT CCTCAGCGGG GACCAGGACG CGAACGTGGT GTTCATGGCG
CACCTCGGCG ACCTGACGCA GAACGGTGAG GCGCGGGAGT TCGCCGCGGC TGGCGCGGTC
TTCGAGATGC TGGACCGCAA GAAGGTCGCC TACAGCGTCG TCGCCGGCAA CCACGACGTG
TCCGGGGACG ACCAGCGCGG CGCCACGCCG TATTCGGGCA CGTTCACCAA GGCGCGCGCG
GCGAAGTCCC CGAGCTTCGG CGGCGCGAGC CCCGACGGGT ACAACACCTT CCACATCTTC
CGCGGCGGCG GTCGCGACTG GCTGCTGCTC GGTCTGGACT GGCGGCTCTC GCCGGCCGGG
TTCGCCTGGG CCAACCAGGT CATCGCCGAC CATCCGACGC TGCCGGTGAT CGTCACCACG
CACGAGGTCG CCTATGCCGA CGACTCCGGG ACCGCGTACC TGTCCGACTA CGGGCAGCAG
CTGTGGGACG GGCTGATCAA GAACCACGAC CAGGTGTTCC TGAGCCTGAA CGGGCATTAC
TGGCCGCCCG GGTCCACGAC GCTGAGCAAC GCCGCGGGCC ACGATGTGCA CGTCCACATC
ACCAACTATC AGGACCGTTA CTACGGCGGC GCGGCGATGA TCCGTACGTA CCGCTTCGAC
ATGCGGCGCA ACACCGTGGA CGTGTCGACG TTCTCGCCGT TCATCCGCGG CATGGCCACC
AGCCAGGTGA ACGAGCTGGC CGGGCAGGAG ATGGAGCTCA CGTCGGCGGT CGACCGCTTC
TCGATGGCGA TCGACTTCGA CCAGCGTTTC GGCGGCTTCG CGCCGGTCGC GGCACGTCCG
GCGCGTCCGG CGGCCAAGAT GCTGACCCGC GACACGCTGG CCTACTGGCG CTTCGACGGC
GGCGGCGCGG ACGGCTCGGC GCTCGGCGAC AGCCAGATCA TCCGCGACCT GTCGGGCAAG
GGCAACGACC TGGTGAAGGA GAACGTCCCC GGCACCACCG GCACGCCGGT GAGCTGGTCC
ACCACCGAGT TCCACCCCGA CCAGCCGGCT CACGGCTCGC TGCGCTTCAC CGGCCAGGGC
CACCCGACGC GCGGCGCCTG GCTGCAGACC GTCCCGAACG CGCCGCTGAA CGCCGAGACG
TTCCAGCACG GCTACACCTT CGAGATGTTC TTCAAGCTGC CCGCGGATTG GGACTCCTCG
CAGAGCGCGT GGAGCGGCCT GCTCAGCCGC TGGGGCATGA GCAGCGAGGC GGGCAAGTCC
GGCGGCAACA CCGACCCGCA GGAGCCGATC GCCACCCTGA GCCTGTCCGG CGGCTCCGAG
TTGCAGTGGA ACGTCTACCC GCTGAACCAG CCCGGCGCGT CGACCGCCTG GAGCCACCTG
CTGCCCCTCG GGCAGTGGTG GCACGTCGCG ATCGTCAACG ACGGCAAGGT GAACCGCATG
TACGTCAACG GCTGCGAAGA GGGCCGCAAC CCCTCGACCC CCGCCATCGG CCTGACCACC
CTGAACCACT CCTGGCTCCT CGGCGGCTAC GAATACGCCG GCGCTATCAA CCAGATCCAC
AACGGCTGGA TCGGCGATGT CCGCATCACC GCCCGGCCGC TCCGCATCGA CGAGTTCATG
AACGCCTGA
 
Protein sequence
MNHEHEDPHC QCPRTADFDS AGPAATGRRA FLRNAGLVGA GATALASGTT VFGPAASAAA 
SAAAADAKHG GASIYDVTAI DTDAPGARWD PDPDNPVFTL VVMPDTQYMF DEDRIHPAPM
EASFRYILSG DQDANVVFMA HLGDLTQNGE AREFAAAGAV FEMLDRKKVA YSVVAGNHDV
SGDDQRGATP YSGTFTKARA AKSPSFGGAS PDGYNTFHIF RGGGRDWLLL GLDWRLSPAG
FAWANQVIAD HPTLPVIVTT HEVAYADDSG TAYLSDYGQQ LWDGLIKNHD QVFLSLNGHY
WPPGSTTLSN AAGHDVHVHI TNYQDRYYGG AAMIRTYRFD MRRNTVDVST FSPFIRGMAT
SQVNELAGQE MELTSAVDRF SMAIDFDQRF GGFAPVAARP ARPAAKMLTR DTLAYWRFDG
GGADGSALGD SQIIRDLSGK GNDLVKENVP GTTGTPVSWS TTEFHPDQPA HGSLRFTGQG
HPTRGAWLQT VPNAPLNAET FQHGYTFEMF FKLPADWDSS QSAWSGLLSR WGMSSEAGKS
GGNTDPQEPI ATLSLSGGSE LQWNVYPLNQ PGASTAWSHL LPLGQWWHVA IVNDGKVNRM
YVNGCEEGRN PSTPAIGLTT LNHSWLLGGY EYAGAINQIH NGWIGDVRIT ARPLRIDEFM
NA