Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3834 |
Symbol | |
ID | 8335187 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4341613 |
End bp | 4342812 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644956971 |
Product | protein of unknown function DUF58 |
Protein accession | YP_003114574 |
Protein GI | 256393010 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00386989 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.0608669 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCCGAT CGGGTGTGGC CGTCGCCTTC GCCGCCGCAC TGCTCGGAGT GCTCGGGGCG GTGAGCCGGT ACCGGGAACT GCTGCTCCTG GCCATCGGCT GCGCGACAAC ACTGGCGATC GCGATCGCTT GGGCTGCCTC GAAGAGAACC AACCTGGTCG CGACCAGTGA GTACGCACCC GCACGGCCCG AGGACGGGCA GCTGGTCGAG GCAACCGTGC ACGTACGGAA CCACGGGCGG CGTACCAGCC GGCCGATGGT CGCCGTGGAG CAGGTGGGCG CCGACGCCTA CGGACTGGAA ATCCCGGAAC TGGCTGCCGC ACAGAAACAC GACGGGACCT ACACGTTCGT CGCACCGCGA CGCGGACAGC TGACGGTCAG CCGCGCGCCG GCAGCGAACA CCGATCCGAT CGGCTTGGTC CGCAGAACCG AACTGGACGG TCAGGACACG CAGATCCGTG TGTATCCGCG CTGGCACAGC GGAATCGCGC CGATCCTGGG CCCGGACGCG CGCGTCGGCC GCGGCACGGT CGGGGTACCC CGCGGCGAGT ACGACTTCCA CTCGCTGCGC GACTACGAGC CCGGTGATCC GCTACGGCTC ATCCACTGGC GCGCCACGGC GAAGCGCGGG GAGCCGCTGG TGCGGCGCCT GGAGGTGCCG GACGAGGCCG AACAGCTGAT CGTGCTGGAC AACAGCGCAC TCTCGTTGAA CGCAGAGGAT TTCGAGCACG CGGTGCGGGT CGCCGCCTCG CTGGCGGTCG CCGCGCGGCG AGCGGGCCTG GCCTTGGAAC TGCGCACCGT GTGCGGGCCC GCGGTCGCGC GGCTGCGGCG CACGGGCCGG TCCGCCAGCG CCACCGCCGC GATGGAGCTC CTGTGCGACG TCGAGCAGAT GCCGTTGAAA CAGGGCCCTG ATCTGGCTGC CGTGCTGGCC GGCTTGGGAC GCGGCCGGGC AGACGTCGCG CGACGCGGCG CGGCGTTCCG ATCGGAGAAC GCCGGCCCGG TCGTACTCGG CGTGGTGACC GGCTTTCTCA GTACCCGTAC AGCGACGGCA CTGAGCCGGG CCCGGCAGAG GTTTGAGGCC GCCTATGTCG TCCAGGTCGG CGAAAAGGTA CCTGTGACCC GTGTCAAGGA TGTCGAATGC GTTCGCATCA AGACCAGTGA GGACCTTGTG GGGCAATGGA AGCGCCTGGT ACGCGGCTGA
|
Protein sequence | MTRSGVAVAF AAALLGVLGA VSRYRELLLL AIGCATTLAI AIAWAASKRT NLVATSEYAP ARPEDGQLVE ATVHVRNHGR RTSRPMVAVE QVGADAYGLE IPELAAAQKH DGTYTFVAPR RGQLTVSRAP AANTDPIGLV RRTELDGQDT QIRVYPRWHS GIAPILGPDA RVGRGTVGVP RGEYDFHSLR DYEPGDPLRL IHWRATAKRG EPLVRRLEVP DEAEQLIVLD NSALSLNAED FEHAVRVAAS LAVAARRAGL ALELRTVCGP AVARLRRTGR SASATAAMEL LCDVEQMPLK QGPDLAAVLA GLGRGRADVA RRGAAFRSEN AGPVVLGVVT GFLSTRTATA LSRARQRFEA AYVVQVGEKV PVTRVKDVEC VRIKTSEDLV GQWKRLVRG
|
| |