Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5044 |
Symbol | |
ID | 8336398 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5788836 |
End bp | 5791145 |
Gene Length | 2310 bp |
Protein Length | 769 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644958143 |
Product | hypothetical protein |
Protein accession | YP_003115745 |
Protein GI | 256394181 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCCGAC GACATCAGCG CACGGCGACT CTGGTCGCCC TCGGCGCGGC CTTCTTCGCC AGCGCGATCG GCAGTGCGTC CGCCATGGCG GCGACGCCCC ATGCCGCCGC GCCGCAGGCC GCCACGCAGA GCGTCAGCTA TCTGGGCCAC CAGTTCACCG TCCCGGCAAG CTGGCCGGTC ATCGACCTGG CGAAGGCGCC GACCACCTGT GTCCGCTTCG ATGAGCACGC CGTCTACCTC GGCCAGCCGG GTGCGCAGCA GGACTGCCCC AGCAAGGTCT TCGGACGGAC CGAGACCCTG CTGATCCAGC CCGCCGCCGC CTCCACGGCA GCGGCCATGA CCACCGACAA CTCCGCCACC CGCGAGCTCG ACACGACCGG CGACGGCTTC AAGGTCAGCG CCACCTACAA CACCGACCGC GCGCTGGCCC AGTCCATCCT GACCAGCGCC GCGCTCCCGG CACCGTCCGC CACGGCGCAC ATACCGACGC CGGGCACGGT GACCGCGCCG ACGTCCACCG CGCCGACGAG CAAGGCAGGT CAGGCGAGCA CGTCCACGCA GTCCGCACAC TCACTGGCTA CCGCCGCCGT CGCGGCCAGC AGCACCAACT TCACCGGCCA AGGCTTCGAC GCCTGCGCCG CGCCGAGCTC GTCGGCGATG AGCGCGTGGA AGAGTTCCTC GCCCTACTCC GCCGTCGGCA TCTACATCGG CGGGGCGAAC CGGGGCTGCG CGCAGCCGAA CCTCACCTCC ACCTGGGTCT CCGACGAGGC GGCGGCCGGC TGGCGCTTCC TGCCGATCTA CGTCGGCCTG CAGGGCCCTG GCAACGGCTG CGGGTGCGCG GCCATCAACT CCGCGAGCGA GGGCACCGCC GCCGCGGACG ACGCCATCAA CGACGCCGTC TCCCTCGGCT TCCCGGCCGG CACCGAGATC ACCTACGACA TGGAGGCCTA CACCACCGGC GGCTCCTACT CCTCGCTGGT GGTCGGCTTC GAAGCCGCCT GGTCCGCCGA GCTGCACGCC CACGGCTACC TGTCCGGCGT CTACGGCAGC ATGGGGAGCA CGGTGTCGGA CCTGATCAAC AACTACAGCT CCACCACCAT GCCGGACGTC CTGGACTTCG CCAGCATCCC CGGCAGCGGC AGCAGCACCG TCTCCGACCC CGGCATCCCC AGCGCCGACT GGGCCAACCA CCAGCGCATC CACCAGTACA CCCAGGGCCA CGACGAGACC TGGGGCGGCG TGGACATCCC CATCGACGCC GACTACTTCG ACGTCCAGGT GTCCTCCAGC GCCCCACCGC CGAGCGCTCC GCACAGCAGC GCCTCGGGAC TGGCCGTCGC CTCCAACGGC GGGTTCAACA CCGCTTGGAA GGGGACTGAC GGCTACCAGT GGGTGGCCAA CGGCAGCGGC GCGGGCATCT CGGCCAAGGG CAACCCGTTC CTGCTCGGCG TCGCGGCGAA CACGACTCCG TCGATGGCGA CGCTGTCCGA CGGTTCATGG ATCTCGGCGT GGCAGGGCAG TGACGGCTAC CTGTGGCTGG CCACCGGCTC CGGAGCGAAC ATCTCGGCCA AGGGCAACCC GTTCCTGCTC GGCGTCGCCG CCGGCACCAG CCCGTCGATC GTCGCGCTGC CCAACGGCGG CTGGGAGATC GCGTGGAAGG GTCAGGACGG CTACCTGTGG CTGGCCACCG GCTCCGGCAT CAACATCTCC GCCAAGGGCA ACCCGTTCCT GCTCGGCGTG TCCGGCACCA CCAGCCCGTC TCTGGCGGCT CTGCCCAACG GCGGGTTCGA AGCGGCGTGG AAGGGCGGGG ACGGCTACCT GTGGCTCGCT TCCGGCTCCG GTATCACCAT CACGGCTAAG GGCAACCCGT TCCTGCTCGG CGTCGTCAAC AACCCGGCGC TGGTGACCAT GCCCGACGGC AGCTTCGAGG CGGCTTGGAA GGGCGGCGAC GGGTACCTGT GGCTCGCCTC CGGCTCCGGC GCCACGATCA CCGCCAAGGG CAACCCGTTC CTGCTCGGCG TCTCCGGCGA CACCAGCCCG TCGATCGCGG CCCTGCCCAG CGGCGGCTTC GAGACGGCGT GGAAGGGTAA CGACGGCTAC TTGTGGCTGG CCACCGGCAA CGGTGCGAAC ATCACGGCCA AGGGCAACCC GTTCCTGCTC GGCGTGGCGA ACAACCCCGA GCTCGTGACC AAGTCTGACG GCAGCTTCGA AGCGGCGTGG AAGGGCGGCG ACGGCTACCT GTGGCTCGCC TCCGGCTCCG GAATCAACAT CTCCGCCAAG GGCAACCCGT TCCTGCTCGG CGTCGCGTAA
|
Protein sequence | MIRRHQRTAT LVALGAAFFA SAIGSASAMA ATPHAAAPQA ATQSVSYLGH QFTVPASWPV IDLAKAPTTC VRFDEHAVYL GQPGAQQDCP SKVFGRTETL LIQPAAASTA AAMTTDNSAT RELDTTGDGF KVSATYNTDR ALAQSILTSA ALPAPSATAH IPTPGTVTAP TSTAPTSKAG QASTSTQSAH SLATAAVAAS STNFTGQGFD ACAAPSSSAM SAWKSSSPYS AVGIYIGGAN RGCAQPNLTS TWVSDEAAAG WRFLPIYVGL QGPGNGCGCA AINSASEGTA AADDAINDAV SLGFPAGTEI TYDMEAYTTG GSYSSLVVGF EAAWSAELHA HGYLSGVYGS MGSTVSDLIN NYSSTTMPDV LDFASIPGSG SSTVSDPGIP SADWANHQRI HQYTQGHDET WGGVDIPIDA DYFDVQVSSS APPPSAPHSS ASGLAVASNG GFNTAWKGTD GYQWVANGSG AGISAKGNPF LLGVAANTTP SMATLSDGSW ISAWQGSDGY LWLATGSGAN ISAKGNPFLL GVAAGTSPSI VALPNGGWEI AWKGQDGYLW LATGSGINIS AKGNPFLLGV SGTTSPSLAA LPNGGFEAAW KGGDGYLWLA SGSGITITAK GNPFLLGVVN NPALVTMPDG SFEAAWKGGD GYLWLASGSG ATITAKGNPF LLGVSGDTSP SIAALPSGGF ETAWKGNDGY LWLATGNGAN ITAKGNPFLL GVANNPELVT KSDGSFEAAW KGGDGYLWLA SGSGINISAK GNPFLLGVA
|
| |