Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4428 |
Symbol | |
ID | 8335782 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5033074 |
End bp | 5034696 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644957531 |
Product | protein of unknown function DUF187 |
Protein accession | YP_003115133 |
Protein GI | 256393569 |
COG category | [S] Function unknown |
COG ID | [COG1649] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.998711 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0596861 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAGGG TTCCCAGCCG CAGGAGTTTC CTGGCGACGT CGGGCGGACT CACGGCCGCG TTGGCCGGCG GCGCGATGCT GATCGGCAGT TCGCCGGCGG CAGCCGCCGC GCCGGCTGGG AACCCGTACC CCGCTCCGTG CGCCGGCGAT CCCGCGCACC CCAAGCGCCA GCTGCGGGGC GCGTGGATCG CGAGCGTGTC GAACATCAAC TGGCCCTCGG CTCCCGGATT GACCGCCGAG CAACAGCAAA CCGAGCTGAC CGGCCTGCTG GACGCCGTCG TCCGGATGCG CATGAACGCC GTGTATTTGC AGATCCGTCC GGCCGCGGAT GCCCTCTACG CCTCGCCCTA CGAACCCTGG TCGCAGTACC TGACCGGCAC GCAGGGACAG GATCCGGGCT ACGACCCGCT GGCCTTCGCC GTCGCCGAGG CGCACAAGCG GAACCTGGAA CTACACGCCT GGATGAACCC CTACCGCGTG TCCACGCAAC CGGATCCGAG CCGGTTGGTA CCTACGCACC CGGCGCGCGT GCACCCCGAC TGGTGCGTGG AGTACAGCGG CGAGCTGTAC TACAACCCGG GCGTCCCGGC GGTCCTGGAC TTCGACGTCC AGGTGATCAC CGACGTCGCG ACCCGCTACG ACATCGACGG CATCCACTTC GACGACTACT TCTACCCCTA CCCCGTGGGC ACGGCTGACT TCCCGGACGA CGCGGCCTAC GCCGCCTACG GCGCGGACTT CCCCGACAAG GCCTCCTGGC GCCGGGCGAA CGTCGACAAG CTTGTCAGCA CCCTGCAACG CGAGCTGCGC GCCGTGAAGC CCTGGATCAA GTGGGGGATC AGCCCCTTCG GCATCTGGCG CAACCAAGCC ACCGACCCCC TGGGCTCGGC GACCAACGGA CTCCAGTCCT ACGACGCCCT GTCCGCCGAC ACCCGCGGCT GGATCCAGAA GGGCTGGCTG GACTACGTGG CGCCGCAGCT GTACTGGAAC ATCGGCTTCC CGGTCGCCGC CTACGACGTC CTGGTCGACT GGTGGTCCAA GGCCGTGGAC GGCACCGGCA CGCAGCTGCT GATCGGCCAG ACCGTCTCCA AGATCGGCAC CCCGACCCCG CCGGCCTGGC TCGACCCGAA CGAGATGCCG AACCACCTGA TCCTCAACCG CCGGTATCCC GAGGTCGCCG GCGACATCTT CTTCAACATC ACCAAGCTTC TCACCGACCC GCTCGGCTTC CAGACCCGCC TGATCGACGA CCTGTACGAA TACCCGGCAC TGGTCCCCGA AATGTTCCGG CACTCCGGCC CAGCACCAGA GCGCACCGCA CTGACCGAGG CCCAGCCCAC CTCCACCGGC ACGCGACTGC GCTGGCTCCA CCTCGGCCGC CCGCACGGCG TCGAAGCCGC GTACTACGCG GTCTACCGCT TCGACGGCCG CCCACCACAG CCCGCGTGCG ACTTCACGGA TGCGAAGAAC CTGCTCGGCA CCGCCCGCGC AGTACCGGAT CTCTTCAACG GCTGGACCGA CACCAGCGCC ACCTCAGGAA AGCAGTACAC GTACTACGTC ACGGCCCTGG ACCGCTCGCA CCACGAGAGC GCGCCGAGCA ATCCGCAGGT GGTGGGCCGG TAA
|
Protein sequence | MTRVPSRRSF LATSGGLTAA LAGGAMLIGS SPAAAAAPAG NPYPAPCAGD PAHPKRQLRG AWIASVSNIN WPSAPGLTAE QQQTELTGLL DAVVRMRMNA VYLQIRPAAD ALYASPYEPW SQYLTGTQGQ DPGYDPLAFA VAEAHKRNLE LHAWMNPYRV STQPDPSRLV PTHPARVHPD WCVEYSGELY YNPGVPAVLD FDVQVITDVA TRYDIDGIHF DDYFYPYPVG TADFPDDAAY AAYGADFPDK ASWRRANVDK LVSTLQRELR AVKPWIKWGI SPFGIWRNQA TDPLGSATNG LQSYDALSAD TRGWIQKGWL DYVAPQLYWN IGFPVAAYDV LVDWWSKAVD GTGTQLLIGQ TVSKIGTPTP PAWLDPNEMP NHLILNRRYP EVAGDIFFNI TKLLTDPLGF QTRLIDDLYE YPALVPEMFR HSGPAPERTA LTEAQPTSTG TRLRWLHLGR PHGVEAAYYA VYRFDGRPPQ PACDFTDAKN LLGTARAVPD LFNGWTDTSA TSGKQYTYYV TALDRSHHES APSNPQVVGR
|
| |