Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3691 |
Symbol | |
ID | 8335044 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4135359 |
End bp | 4137014 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644956831 |
Product | hypothetical protein |
Protein accession | YP_003114434 |
Protein GI | 256392870 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03605] SagB-type dehydrogenase domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.392558 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGATTCG CCCACGAGTA CGCCACCGCC GTGGCCTGGC GCGGCAGGGT CCTGATGGAA CCCGCCGACT TCGTCCCGAA CTGGGCGGAC AAGCCACGGC GCGCCAAGTA CTACCCCGGC GCGCTCGGTT TTCCGCTGCC GGACACCGAG GACGAGGCCG CGGCCAGCGT CCAGAAGGGA CTGTTCGATC CGGCCGGTTC GCAGCCCTTC ACCCTGAGCC TGCTCGGCGG CATGCTGCGC GACTCCTACG GACTGATCGG CCGCCGGCTC GGCGTGCAGG CGAACACCGA CCTGGCCGCG CTGCCGTCGT ACAAGGACGC GAACTGGTCG CGGGGAACCG CCTCGGGCGG CGGCCTGTAC CCGATCGGCG TCTACTGGTT GTCCGGTCCC TCCGGGCCGC TTCTCCCCGG CGCCTACCAC TACTCGCCCG GCCATCACGC GATGCAGCGG CTCGTCGTGG GCGACCCGAC CGGCGAGGTG CGGGCCGCGG TTGGCGACGA GGCGCTCACC GCGGACACCG ACCAGTTCCT CGTCCTGGGC ATCAAGTTCT GGCAGAACGC CTTCAAATAC AACAGCTTCT CCTACCACGC GGTGACGATG GACGTCGGCA CGGTGCTGCA GACGTGGCGG ATGTGGGCGG GAGCCAGAGG CCTGCGGATC GACCCGCTCC TGTGGTTCGA CGAGCAGCGC CTGAGCCGCC TGCTCGGGGT GTCGACCGAG GACGAAGGGC TGTTCGCGGT GGTTCCCGTG CGGTGGGACG CGCCGTCGGC GCCCACCGCC GAGCCGGCGA CCGAGCGGCT GACTGAGCCG CCGAACGAGC GGCCGACCGA GCCGCCGATC CAGGTGCGGC GCACCGACCA GGAGCGTTCT CGCACCGTCC TGACCTTCGA CACCATCCGC CGGGTTCACG CCGCCACCAT CGAGCACGCG ACGCAGCGCC CCGACCGCCT GGCCCTGGAA GCGGCCAGGG CTCACGCGCC CGATGAGCGG CGCGAGGCTG CGACGCTGCC CGAGCCGCGT CCGCTCCAAG CCACCGTCCG CGCGGCTCTG CACGCCAGGC GCAGCAGCTT CGGACGGTTC TCCGCGCAGC GGACGATCGC TGCGGACCAG CTCTCCGCCG TGCTCGCAGC CGCTGCCGCC GGTGCCGCGC TGGAATGCGA CGTGACGAAG CCAGGAGGCG CCGAGCTGGT CAAGCTCTAC GCGTTCGTCT CCCACGTCGA CCAGATCGCC CCGGCGAGTT ACGAGTACGA CCCGCAGGAA GGTGCGCTGC GGATGGTCAA GCCGGGCGCG CCCGGCTCGT TCCTCCAGCG CAACTACTTC CTCGCCAACT ACAACCTGGA ACAGGCCGCG GCCGTCCTGG TCCCCTCGGT GCGCACGCAC GCCGTGCTCG ACGCGGTCGG CGACCGCGGC ATCCGGCTGG TGAACGCGCT GGTCGGGGCG GTGGCGCAGG CGGTGTACAC CGCGAGCGCG GCGGCCGGCA TCGCCTGCGG CGTCGCCCTC GGCTTCGACA CCATCTCCTA CATCGAAGAA CTCGATCTCC ACCAGGCCGG CGAGATCCCC TTGCTGACCA TGATGATCGG CGCCGAGCGG CCGCGGCCGG CGGACTTCCG CCACGATTTC GGCCCGCTCG GCCCTGTCCC GGGGAGCGTG CGGTGA
|
Protein sequence | MGFAHEYATA VAWRGRVLME PADFVPNWAD KPRRAKYYPG ALGFPLPDTE DEAAASVQKG LFDPAGSQPF TLSLLGGMLR DSYGLIGRRL GVQANTDLAA LPSYKDANWS RGTASGGGLY PIGVYWLSGP SGPLLPGAYH YSPGHHAMQR LVVGDPTGEV RAAVGDEALT ADTDQFLVLG IKFWQNAFKY NSFSYHAVTM DVGTVLQTWR MWAGARGLRI DPLLWFDEQR LSRLLGVSTE DEGLFAVVPV RWDAPSAPTA EPATERLTEP PNERPTEPPI QVRRTDQERS RTVLTFDTIR RVHAATIEHA TQRPDRLALE AARAHAPDER REAATLPEPR PLQATVRAAL HARRSSFGRF SAQRTIAADQ LSAVLAAAAA GAALECDVTK PGGAELVKLY AFVSHVDQIA PASYEYDPQE GALRMVKPGA PGSFLQRNYF LANYNLEQAA AVLVPSVRTH AVLDAVGDRG IRLVNALVGA VAQAVYTASA AAGIACGVAL GFDTISYIEE LDLHQAGEIP LLTMMIGAER PRPADFRHDF GPLGPVPGSV R
|
| |