Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4620 |
Symbol | |
ID | 8335974 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5257352 |
End bp | 5258899 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644957720 |
Product | hypothetical protein |
Protein accession | YP_003115322 |
Protein GI | 256393758 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.456962 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00062706 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCGCGCG ACTGGACCAG CTTCGACTGG CAGACCGCCT CCGTCACCCA GGGCGACCTG ATCCCCGGAG ACCCCTACCA GGTGGCCGCG CTCGGCAAGC GCTTCCGCGA CACCGCCGAC GCCATCAACA AGCAGGCTGC GAACCTGCGC AACCTGGTCG ACGGCCAGGG CTGGGACGCC GACTCCGGCC GCGCCTTCGC CAAGAAGGTC GGCGATACCG CGGACCTGCT GCAGAAGGCC TTCGGCCGGT ACAACGAGGC CGCCGAAGCC CTGGGCAGCA CCGTGCGCCC GTCGGGACCG CACGACGAGG TCTCCTGGCG CTCCGGCACC GAATGGGCCT CCACCCTGGA GCTGGCGCAG GTGAAGGCTC AGGACGCGCT GAACCGCGGC CGTGCCGCCG ACGCCGACTC CCAGTCCGCG GCGCGCCAGC TCAGCACCGC GCAGGGCAAC GCGCTGCCCA AGCCGGTCGC CGGCGCCCCG GCGACGCCGA ACACGCACCC GGACCCCACC AACACCCCCG GTGCCAACGG CCTGACGCCC GCGGAGCAGA AGGCGCACAA CGACAAGGCC GCCGCCGACG CCGTGGTGAA CAAGGCCGGC GCCGACATCC AGCACGCCAT CGACATCCGC GACAGGGAAG GCAAAGCCGT CGCCGGCGCC ATCAACGACT TCATCAACGG CGACGGCCTG AAGAACCCCA CCCACCACTG GTGGGACGTG GACTGGAAGG ACCTGGTCGC CGACATCGGC CACATCGCCG GCGCGATCGC CGGCGTGTGC GGGATCCTGG CGCTGGCGCT GGCCTGGGTG CCGATCCTCG GCGAGGTGCT GGGCGCGATC GCCCTGATCG CCGGTGCCGT GGCGTTGATC AGCGACACCA TCTCCGCCCT GGACGGCAAA GGCAACTGGT TCGACGTCGC CATCGACGTC GTCGGCCTGC TGTCCTGCGG CGCCGGCCGG ATGCTGGGCA CCGCGGCGAA GCTGTCCAAG GGTGCTGAGG CGTTCAACGC CGCGCGCGCC GGCGGCAAGG GCATCAGCGA GGCGCTGGAG CTCTCGGACA TGTCCGCCAA GGACGTGCTG GCGCTGAAGA ACGGCAACAG CGTGTTCAAG GTCGCGCGCT CGGAGTTCGG CAAGGGTCTG ACCACCGGGC CGTTCAAGGA CGTCCTGGGC AAGGTCGGGG ACCTGAAGGC CGGGAACCTG AAGTTCGAGG CGCCGAGCCT GACCAGCTTC GGGCCGAACT TCGCCAAGGA AGCCGGCTGG AAGTTCCACC TCGCCGGCTG GTCCAACAGC ACCTTCCCGC TGGGTCTGGG GCTGGCGAAC CTGCAGATCC CGGAGAACAT GAAGTCCTGG GAGCCGGGGT TCATGAACGT CAACCTGTTC AAGAGCGACC AGATCCCCAG CTGGGTGCCC GGTGTCGGCG GCGACCACGC GGGCGTGGGC TGGCTGCACG CCGGCGACTG GAACGCCACC AACTCCGGCA TGGAGAACTA CGACGCCCAG CCGCTGCGTC CCATCGCCTC CGCTGTGGGC GCCGACCCGG GCAACTGA
|
Protein sequence | MSRDWTSFDW QTASVTQGDL IPGDPYQVAA LGKRFRDTAD AINKQAANLR NLVDGQGWDA DSGRAFAKKV GDTADLLQKA FGRYNEAAEA LGSTVRPSGP HDEVSWRSGT EWASTLELAQ VKAQDALNRG RAADADSQSA ARQLSTAQGN ALPKPVAGAP ATPNTHPDPT NTPGANGLTP AEQKAHNDKA AADAVVNKAG ADIQHAIDIR DREGKAVAGA INDFINGDGL KNPTHHWWDV DWKDLVADIG HIAGAIAGVC GILALALAWV PILGEVLGAI ALIAGAVALI SDTISALDGK GNWFDVAIDV VGLLSCGAGR MLGTAAKLSK GAEAFNAARA GGKGISEALE LSDMSAKDVL ALKNGNSVFK VARSEFGKGL TTGPFKDVLG KVGDLKAGNL KFEAPSLTSF GPNFAKEAGW KFHLAGWSNS TFPLGLGLAN LQIPENMKSW EPGFMNVNLF KSDQIPSWVP GVGGDHAGVG WLHAGDWNAT NSGMENYDAQ PLRPIASAVG ADPGN
|
| |