Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_7969 |
Symbol | |
ID | 8339347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 9245399 |
End bp | 9247066 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644961054 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003118633 |
Protein GI | 256397069 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCATCC TGGTCGTCCT GGCGCTGATC CTCGTCGAGG CACTGTTCGT CGCCTCGGAG ATCGCGCTCG TTTCGTTGCG CGAGAGCCAG ATCGAGACCT TGGCCCAGCA GGGCCGCCGC GGCCAGGTGG TGGCCAGACT GGTGCGCGAC CAGAACCGCT GGCTGGCCAC CGTCCAGATC GGGGTCACGC TGACCGCGCT GCTGTCCTCG GCCTACGGCG CCATCACGCT GTCGGAGAGC GCCAAGCGAG GGCTGATCAG CGCCGGGCTC GGCACCGGGC TGGCCGGCTT CCTCGGGGTC GTCGGCGTCA CCCTGGTCAT CACCTTCGTG ACCCTGGTGA TCGGCGAGCT GGCGCCCAAG CGGCTGGCGC TGCAGCGGCC GGAGCCCACG GCGCTCGCGG TCGCGCCGTT CCTGAACCGG GTCGCCACGG TCATGCGGCC CTTCATCTTC CTGCTCTCGG TGTGCACCAA CGGCGTGGTG CGGCTGTTCG GCGGCGACCC GAACGTGGGG CGCGAGGCGG TGTCCTCCGA CGAGCTGCGG CTGATGGTGG CCGGCAACGA GACGCTGAAC AGCGACGAGC GCGAGCTGAT CGACGAGATC TTCGACGCCG GCGACCGGCA GCTGCGCGAG GTGCTGGTAC CGCGGACCGA GGTCACGTTC CTGGACGCCG ACCTGCCGAT CCGCCAGGCG GCGCGGATCG CCGCCTCCGA GCCGCACTCG CGCTATCCGG TGATCGAGGG CTCGGCGGAC AACGTCATCG GGTTCGTGCA CGTCCGGGAC TTCCTCAACC CGGAGCTGGC CGGCCGCTCG ATCCGGCTGG AGGAGATCGC CCGGCCGGTG AAGATGCTGC CGACCTCCAA GCAGGTGCTC TCGGCGCTGT CGGAGATGCG CGCGGAATCC ATGCACCTGG CGATCGTCGT CGACGAGTAC GGCGGGACCG CCGGCATCGT CACGCTGGAG GACCTCATCG AGGAGCTGAT CGGCGACATC CAGGACGAGT ACGACGTCGG CCAGGCCGGC ACCACCCGGC TGGTCGGCGG CGTGATGGAG GTCGACGGCC TGCTGAACCT CGACGACTTC ATGGACGAGA CCGGGATCGA GCTGCCCGAC GGGCCGTATG AGACCGCGGC GGGCTTCATC GTGGCCCAGC TCGGCCGGCT TCCCGAGCTC GGCGACGAGG TGGTGGTCGC TCTGGAGGCC CCGCCGACGC GCGGCGGCTC CGACGACGGC CACGAGGACG CCGCCCCGGA GATCCACCGC TACGAGCTGA AGGTGGCGGA GATGGACGGC CGCCGGATCG CGCGGATCCG CATCACACGG CTGCCGGAGG GCGGCGCGGA CGGCACAGAT GGCTCAGACG GCTCTGATAG TTCCGACACT TCCGAGGGCG CCGACAGCTC CGAGATCGCC GAGCCCTCCG AGATCTCCGA CGGCTCCGAC GGTTCCGCTG CGGACAGTCC GGTTCCCGAT GATTCCGGCG CTGACGGTTC CGATGCTGAC AGTTCTGCTG CCGAAGGTTC GGTGAACGGC TCAGTCATCC CCGCCGAACC GTCCCCGGAC ATCACCTCCG GTGATCATTC GGACACTGAT GGTGCCGACC CGGCCGACCA GGTCGGTGCG GGTGAGGAGA TCCCGGGAGA ACCCGATCAA AGCGAGCCGC GGATCTGA
|
Protein sequence | MSILVVLALI LVEALFVASE IALVSLRESQ IETLAQQGRR GQVVARLVRD QNRWLATVQI GVTLTALLSS AYGAITLSES AKRGLISAGL GTGLAGFLGV VGVTLVITFV TLVIGELAPK RLALQRPEPT ALAVAPFLNR VATVMRPFIF LLSVCTNGVV RLFGGDPNVG REAVSSDELR LMVAGNETLN SDERELIDEI FDAGDRQLRE VLVPRTEVTF LDADLPIRQA ARIAASEPHS RYPVIEGSAD NVIGFVHVRD FLNPELAGRS IRLEEIARPV KMLPTSKQVL SALSEMRAES MHLAIVVDEY GGTAGIVTLE DLIEELIGDI QDEYDVGQAG TTRLVGGVME VDGLLNLDDF MDETGIELPD GPYETAAGFI VAQLGRLPEL GDEVVVALEA PPTRGGSDDG HEDAAPEIHR YELKVAEMDG RRIARIRITR LPEGGADGTD GSDGSDSSDT SEGADSSEIA EPSEISDGSD GSAADSPVPD DSGADGSDAD SSAAEGSVNG SVIPAEPSPD ITSGDHSDTD GADPADQVGA GEEIPGEPDQ SEPRI
|
| |