Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3524 |
Symbol | |
ID | 8334877 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 3935596 |
End bp | 3937497 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644956668 |
Product | hypothetical protein |
Protein accession | YP_003114271 |
Protein GI | 256392707 |
COG category | [S] Function unknown |
COG ID | [COG4529] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.284733 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.8933 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTCGAA CCGCCCCATG CCCGAGGGTG CTCATCGTCG GGGCAGGGTT GGCCGGGACT GCTACGGCGA TCCGGTTGTT GTGCTTTGCT CGTCGGCCTT TGGAAGTCGT GTTGGTGGAG CGGCGGGCCG ATTATCGGTC GGCGGGGGTC GCTTATCATC GCGACGGCAA TCCCTGGGAT CATGTGTTCA ACATTCAGGC GGGGCGGATG TCGGCGTTCC GGGAGGATGT GCTCGACTTC GTGCGGTGGG CGAATCGGGA GGCTGATCGG GGGGATTGGC CGGCGCCTTG GGCCGATTGG GAGTTCACCG AGCATGGTCC GGCGCCTCGG CGGATCTTTC AGGACTACCT CGCCGAGCGG TTGGCGTACG CCAGGCAGGA GGCGAGTGAG GGCGTCGTGC TCGTCGAGGC CGACGGCGAG GTGGTCGACA TCGCGCGCTG TGGCGCGGGC TTGGACGTGA CTGTCCGGCC GCCGGTTGCG GATCCTGAGA ATCCCGGCGG GGAGGCCTTG CCGGGCACGC CGAGCGTCCT CTACGCCGAT CACGTCGTGC TCGCTACAGG GCTCGAGCTT CGGGACATGC CCTTCTCCAC CGATGTGCTC GGGCACGCCT CGTTCATCCG CAATCCCTAC TCCCGCACCG GGATCCGCAC CGTCGAGTCA TTGGCACCGG ATGCGACCGT CGCGATCGTC GGGTCCGTGC TCAGCGCGTA CGACTCGACG GCGTTCCTGC TGCGCCGCGG CCACAGCGGT CCGATCCACC TGATCTCCAG GACCGGCACG ATCTTCCGGA CCTATCCGGA GAACCACGAG CACGCCGTCG TCCAGCTCCC CTGCCCGACG TCGTTGTTGC AGCCGTATCA GAACCGTGAG GAACTCATCG AGCGGGTTCG CACGGAGTGG ACCGCGGCCT GCGCTCTGGT CACGAAGGAC CATCCCGACA TCTCCCCCGA GGTCGTCGCC GAGCGCGTGA CCAAGGCGTG GGAGCCGTAT CTGCCCGAAG CCATCGCCCT GATTCCCAAC CCCGAACTGC GCAACCTCCT GGATGAATTC AGTACCCTGA TAGCTGCCCT GCGGGTGAGC GCCGTGCACT ACACAATGTC CGTCATCGAG CCCGCGATGC GCCCGGCCGA CGGGACGGTG AAACTCGTCG TCGGCAAGGT CGAGAACGTC GCCCCGGCGG ACTCCGGGCG CCTCGTGGTG ACCGTCGCCG GTCCGGAGAC CAAGCAGGCC ATCGAAGCGG ACCTGGTCAT CTCCAACTTC GGGCGGGAGC CGGACTACGA CCGGGTCGAC CATCCCTTGT GGCGCAACCT GTTCCAGCGG GGTCTGGCCG TGCCGCACCG GCGCACCGGG CGCGGTGTGG ACGTCGACGG CGACGGGACG CTGCTGACGC CCGACGGCGA GCCGTCCGGA CCGCTGTCGG TGGTCGGTGT GCCGCGCGAG GGCGACGAGA TCGTCCGGAA CGGCCGCACC GGGGCGTTCG CCTTCAACCT GGCGGCCGTG AAGAACCAGT CCATCGTGGT CGCCGCGCAG GTCCTGGAGC AGATCGAGCT GCGCGAGGGC GACCTGGCGC GCCACCTGGA GGGCTACCGC AAACAACTCG GCACGCTCGA ACAAACAGCC GCGGCCGGAT TCGAGGAGGC TGTCGTACTG AAGGTGAGGA GTATGGCCAT GCGCGCACGA AGCGGGCGAA GCTCGCTCGA CGCCGAGACC GGCGACCGGA TCCGCTCCGT GTCAGCGCTC TGTGACACCC CTGCTTACCC GATCGACGCC TCCCATCGCG ACCGGCTGAT GGGGGTGATC GTCACCCGCG CCGCGGTGCG ACGTCTCACG GACGTCTCGG TGACGCCGCG GCAGCTGCGC CGACAACTGG GTTTGGCGAA CCCCGACGAC ACGGAGGACT GA
|
Protein sequence | MLRTAPCPRV LIVGAGLAGT ATAIRLLCFA RRPLEVVLVE RRADYRSAGV AYHRDGNPWD HVFNIQAGRM SAFREDVLDF VRWANREADR GDWPAPWADW EFTEHGPAPR RIFQDYLAER LAYARQEASE GVVLVEADGE VVDIARCGAG LDVTVRPPVA DPENPGGEAL PGTPSVLYAD HVVLATGLEL RDMPFSTDVL GHASFIRNPY SRTGIRTVES LAPDATVAIV GSVLSAYDST AFLLRRGHSG PIHLISRTGT IFRTYPENHE HAVVQLPCPT SLLQPYQNRE ELIERVRTEW TAACALVTKD HPDISPEVVA ERVTKAWEPY LPEAIALIPN PELRNLLDEF STLIAALRVS AVHYTMSVIE PAMRPADGTV KLVVGKVENV APADSGRLVV TVAGPETKQA IEADLVISNF GREPDYDRVD HPLWRNLFQR GLAVPHRRTG RGVDVDGDGT LLTPDGEPSG PLSVVGVPRE GDEIVRNGRT GAFAFNLAAV KNQSIVVAAQ VLEQIELREG DLARHLEGYR KQLGTLEQTA AAGFEEAVVL KVRSMAMRAR SGRSSLDAET GDRIRSVSAL CDTPAYPIDA SHRDRLMGVI VTRAAVRRLT DVSVTPRQLR RQLGLANPDD TED
|
| |