Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5779 |
Symbol | |
ID | 8337140 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 6677676 |
End bp | 6679082 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644958883 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003116478 |
Protein GI | 256394914 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGACCC TCCTGGGCCT GCTGGCCATC GCGGTGCTCA CCGCCGCCAC CGGCTATTTC GTGGCGCAGG AGTTCGCCTA CATCGCCGCG GACCGGGGAC GGTTGCGGCA GCTCGCCGAG GACGGCGACG CCGCCGCCGA GCGCGCCTTC GAGGTCACCG GTCGGCTGTC GTTCATGCTG TCCGGGGCGC AGCTCGGGAT CACCGTGACC GCGCTGCTGG TCGGCTACGT CGCCCAGCCG CTGCTCGGCT CCGGCCTGGC CGATCTGCTG GGGTTCACCG GCTGGTCGCA CGACGCGCGG CTGTCGCTGT CGGTCGTGGT GGCGCTGGCC GTGGCGACCG TGGTGCAGAT GGTGGTCGGC GAGTTGCTGC CGAAGAACCT GGCGATCGCC AAGCCGATCG AGGCGGCCAA GGCACTCGGC GGCTCCACCC TCCTGTATTT GAAGGTGGTC GGTCCGGTCA TCCGGCTGTT CGACGGCGCC GCCGTCCGGC TGGTCCGCGC CGTCGGCATC GAGCCGGTCG AGGAGCTGCC GCAGGGCGCC AGCGAGGAGG ACCTGCAGCA CATCATCTCC GAGTCGCACA CGCAGGGCTT GCTGGACACC GAGCTGTCCG AGCTGTTGGA CCGCGCGCTG GACTTCCGAG GGCTGACAGC CGGGCAGGCC ATGACGCCGC GGGTGAAGGT GCACACCGTG TCGGCCGAAG CGCCGGTCTC CCTGGTGGTG GAGATGCTGA TCACCGGCAA CGCCCGGTTC CCGGTGACCG GCCATGACAT AGACGATCTG ATCGGTGTCG CCGGACTGAC CGAGGTCCTG GCGGTGCCGG CGGCACTGCG GGCCACGACG CCGGTCCGGG ACGCGTGCGC GCCGGCGCTG CTGGTACCGG AGCACCTTCC GCTCCCCGAG TTGCTGGAGC GGCTGCGCTC CGAGCACCGG CAGCTGGCGT GCGTCATCGA CGAGTTCGGC GGCTTCGCCG GGGTGGTGAC GTTGGAAGAC GTCACCGAGG AGCTGGTCGG CGACATCTGG GACGAGGACG ACCTGGACGA CGAGGTGGTC CGGCGACAGC CAGACGGCGC CTGGAGCGTC CCGGCACGGA TGCGGATCGA CGAGGCCGCC GACGCCACCG GGATCCCGCT GCCGGAAGGC GAGCACTACA CGACGGTCTC CGGCCTGGTG CTGGACCGCC TCGGCCGCAC CGCGCGCATC GGCGACGAGG TGGAGCTGGC GGTCCGCGCG CCGTACACGC AGGACGGGCC CGGGATGCTG TCGGTGCTGA TCCACATCGC GGCGGTCAGC CGGCAGGTAC CGGCGACCGT GCTGATCACG ATGGACACCG AAGACCACGC CGAAGACTCC GAACACAGCG CAGACCCCGA ACACAGTGCC GCCCCCGACG CCCGGGAGGC TTCGTGA
|
Protein sequence | MLTLLGLLAI AVLTAATGYF VAQEFAYIAA DRGRLRQLAE DGDAAAERAF EVTGRLSFML SGAQLGITVT ALLVGYVAQP LLGSGLADLL GFTGWSHDAR LSLSVVVALA VATVVQMVVG ELLPKNLAIA KPIEAAKALG GSTLLYLKVV GPVIRLFDGA AVRLVRAVGI EPVEELPQGA SEEDLQHIIS ESHTQGLLDT ELSELLDRAL DFRGLTAGQA MTPRVKVHTV SAEAPVSLVV EMLITGNARF PVTGHDIDDL IGVAGLTEVL AVPAALRATT PVRDACAPAL LVPEHLPLPE LLERLRSEHR QLACVIDEFG GFAGVVTLED VTEELVGDIW DEDDLDDEVV RRQPDGAWSV PARMRIDEAA DATGIPLPEG EHYTTVSGLV LDRLGRTARI GDEVELAVRA PYTQDGPGML SVLIHIAAVS RQVPATVLIT MDTEDHAEDS EHSADPEHSA APDAREAS
|
| |