Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_1524 |
Symbol | |
ID | 8332863 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 1727831 |
End bp | 1729633 |
Gene Length | 1803 bp |
Protein Length | 600 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 644954670 |
Product | DNA protecting protein DprA |
Protein accession | YP_003112286 |
Protein GI | 256390722 |
COG category | [L] Replication, recombination and repair [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake |
TIGRFAM ID | [TIGR00732] DNA protecting protein DprA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0174562 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.00583447 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCTCCC TGCTCGCCTT CCCCGACACC CCGAACCCCG ACCTGACGAA CCCTGACCTG ACCTCGAATC CTGATCTGCC GGCGAACCTG GAACCGACAG TGACCCCCGG CCTCACCACG ACCCCAGACC TGTCGGCCAC CGTCAGACTG CCCCGCCACC CCGAAGCCGA GCCGGAGCCT TCCTCGATCG CCGACGCCGC CCCACCACCG CAACCGCAAG TGCAAACGCA AGCTCCAGAA CTCTCGCTCC TCGCCCCCGA CTCGCTCTTC TACGCGAGTC CCAGCGCCCC ATCCCAGAAC GACTCAGCGC CCTCAGATAT CCCGAAGCCC TCGCTCCTCG CCCCGACTTC CCTCTTCTAC GCACCAGCCG ACCGCACCCC GCCGTCGTCC CCTGACACCC CCACCGGCGA GCCGCTCCTC TCGCCGACCT CGCTCTACTA CGCGTCCGCC GACCGCCCGC TGCCCCCGGC ACAGCCGAAC CCTGAGCCGG CCGCCACCGC ACCGCCTACC GCCGAACCGC CCACCGCCGA ACCCCCGACC CCACCCGCGT CACCCACCGG CGGACCGCCA CCTCACCCAC CCACATCTCG CCGTGTGCCC GAACCTGTCC TCGTCACGTC CGAACCCCAC GACCCCACCG CCGCCGACCG CCGCGCACGC CTCGTGCTCC ATCACGCCTC CGAACCCGGC GACGCGCTGC TCTGCCGCCT CGTCGCACGC TTCGGCGCCC CGACCGTCGC CACCGCGCTC CTGTCCGGAA GCGCCGCCGA CCTCCTGTCG GCCCACGACG AGGAGTCGGC CGCGGACCGC GCGGCCCTCA TCAGCCGCGC CCACACGCTC CAGACCCGCT GCTACCGCGC CGATCCCGAC GCTGATCTGG CGGTCAGCGA GAAGACCGGC GCCCGCTACG TCATCCCTGC CGACGACGCG TGGCCGACCG CGCTCGACGA CCTCGGCGAC GCCGCCCCGC TCGGCATGTG GTTTCTGGGC ACCGCCGATC TGGTCGCGGC ATCCAGACGC GGGGTCTCCA TCGTCGGCGC ACGGATCGCC ACCGGCTACG GCATGCACGT CGCGGGCGAA CTCGCCGCGG GCCTGGCCGA ACGCGGCTGG GCGATCATCT CCGGAGCCGC GCGGGGCATC GACGGCGCGG CGCACCGTGG AGCGCTGGCG GCCCGGGGCG TCACCGTCGC GGCGCTCGCC TGCGGCATCG ACCTCGTCTA CCCCGCAGGT CACGAGGCAC TGATCGGTGC CATCGCCTCC GAAGGCCTCG TGCTCTCCGA ACTTCCGCCG GGTACGGCCG TCAGTCGCTT CCGATTTCTG GATCGCAACC GTGTCATCGC CGCGCTCGGG CTCGGAACCG TCGTCGTCGA AGCTGCGGCC CGCAGCGGCT CGCTCGTCAC CGCGCGCCTC GCCGACAGCC TGGGGCGTCC CGTGCTCGCG GTCCCGGGAC CTGTCACGTC TGAGGCCTCC GAAGGCACCC ACCAGCTGAT CCGCGACGGC GCGCTCCTGG TCACGCGCGC CGCCGAAGTG GTCGAACACC TCGGCGACCT CGGCGCAGAC CTGGCTGAGC CCACCACCAC CGCCCGCACC AGCTCCCGCC GTCCCCGCGA CTCCCTCGAC CCGATCGCGG CGCGCGTCCT CGACGCCCTC CCGATCGCCG GCCGAGGCAC GCTCGACACG GTCGAAGCCG CCCTGGCCGC CGGTCTCGAA CCCCGCGCCG TGCACGCAGC CCTCGGCCGC CTCACCACAG CCGGCTGGGT GGATCGCGAC GAGCGGGGGT GGTGTGCTCG CCTGCTTCCT TGA
|
Protein sequence | MTSLLAFPDT PNPDLTNPDL TSNPDLPANL EPTVTPGLTT TPDLSATVRL PRHPEAEPEP SSIADAAPPP QPQVQTQAPE LSLLAPDSLF YASPSAPSQN DSAPSDIPKP SLLAPTSLFY APADRTPPSS PDTPTGEPLL SPTSLYYASA DRPLPPAQPN PEPAATAPPT AEPPTAEPPT PPASPTGGPP PHPPTSRRVP EPVLVTSEPH DPTAADRRAR LVLHHASEPG DALLCRLVAR FGAPTVATAL LSGSAADLLS AHDEESAADR AALISRAHTL QTRCYRADPD ADLAVSEKTG ARYVIPADDA WPTALDDLGD AAPLGMWFLG TADLVAASRR GVSIVGARIA TGYGMHVAGE LAAGLAERGW AIISGAARGI DGAAHRGALA ARGVTVAALA CGIDLVYPAG HEALIGAIAS EGLVLSELPP GTAVSRFRFL DRNRVIAALG LGTVVVEAAA RSGSLVTARL ADSLGRPVLA VPGPVTSEAS EGTHQLIRDG ALLVTRAAEV VEHLGDLGAD LAEPTTTART SSRRPRDSLD PIAARVLDAL PIAGRGTLDT VEAALAAGLE PRAVHAALGR LTTAGWVDRD ERGWCARLLP
|
| |