Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3970 |
Symbol | |
ID | 8335323 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4503084 |
End bp | 4504694 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644957085 |
Product | hypothetical protein |
Protein accession | YP_003114688 |
Protein GI | 256393124 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00762799 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.606587 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCGAGC AGTCGACGTT CACTGTTGCC CAGTTGATCG ATCAGGCGGA GCGGTTCGAC GTCCCGGAGG TACAGCGCCT TTTCACCGCT CGACCCGAGT GGGTCGTCCT GCTCATTGAC TCCCTTTACC GAGGGATCGG GGTTGGCGCG CCTCTCCTGT GGAGTCCGCG CGAGGGTGCC CCAGATTCCC GCTATCGGTG TGAATCGACA GCCGATTATT GGATTATCGA CGGGCAGGGG CGCCTTACTG GAACGCTGGC CGCCTTCGGG ATCCGGCCGC CGTGGATCGC CGGCGAGCAG TGGGAAGCCA TGGGTGGTCC GGAGCGCGAG GTGGCGGTGG CGTTCACCCC GCTCGGCCAA ACTAATTTCG TACAGTACAA GCCTGGTGAG CGTTGCCAGA TCCGGCTTCG CGATCTGCTT GATCCGGGGC CAGGGGGACT GTCAAAGCTG CTGCGTGAGA CGACTGGGAC TATGCCGGAC GCTGTGATGA TCGAGACGCT CGCCGTACTC GCTCAACGGT TGCGCGATGC GGCGTTCCAT GTGTACTGGC AGGACGGTGG TCTGCGCAAT GTCGTCGACG CGTTCATCCG GCACAATCAG AGGGGCTCGG GGCGATTCCT GTCCCGTGAA GAGTGTGACC TGGCAGTTCT GGCCCTATCG TGCCCAGGGC TGCAACGGGA CATCATCGAC CCGGCTGTGG CTGATGTCGC CGCTGCCGGC TTCCCCCTGA CTCTGGATCG GCGCCGCATC TTCGCAGTCA TGAAGGTCCT GACGCCGGTG AAACTACGCA TGTGCATGGC GGACAACCCC GATCGGCTGC GGGCTGTCGC ATATACCGCC GTAGCCGGGG CCCGAGCTGT AGCCGAGTAC CTATCGCGCT GCGGGATCGC GGGCGACGAA CTGTTCGCCC GCCGTCCACT GGCGTTGGTA CTCGCGACGT TGTTCGCCCG CTTCCCGCAG TCTGCCTCAC GCGACTTCGC CCGACGCTGG CTGGCCCAGG CGCTGGCCTC AGGACGATAC GACTTCGGAG GCAACCAGTT CGCCGACAGC GACGCCTCCG CGGTCGCCCG CTGTACGACT CTGGACGACG CCGAAACCGT CCTGGCGGCT CGGATCGCGC AATTCACTGA ACCGCAGCTT GACCCGGAGG ACCTGACCAC CAGCCACTCC GCCGCCGGCA AGGCCTGGAC ACTTTACGCT CTGGCCTGCC ACGCACAGAC CTGCGGCCCG GTCAGCGATC TGGCCGATCC CACGATCGGC GCCGGCGACC CGGCACTGCA GTTGCACCCC TTGTGGCCAC ACACAGCCAG CAGGACTCGC CGCACCTTGG CCGCCTACGC GATGATGACC GAGGCCAGCG CGGAGCGCAT CGCGGCGGTC GGAGGGTTCA CCGTAGATGC CTACCTGGAC CTTCGTTGCT CGGACCAATC ACTACACGCC CAACAAATCT GTCGCCCCAG CTCCGATACC GACGTCGAGG AAGTGGTTCG TCACCGGACG GTCGCCCTCG TCGACATGAT CGGCGGCTTT CTAGCGCGAC TTGAACCGCT GGCACCCCCA CCCTTGGTCG GCGCGGACGT TGCACTGCCA CGCGCGCTGG AAACCGCGTG A
|
Protein sequence | MAEQSTFTVA QLIDQAERFD VPEVQRLFTA RPEWVVLLID SLYRGIGVGA PLLWSPREGA PDSRYRCEST ADYWIIDGQG RLTGTLAAFG IRPPWIAGEQ WEAMGGPERE VAVAFTPLGQ TNFVQYKPGE RCQIRLRDLL DPGPGGLSKL LRETTGTMPD AVMIETLAVL AQRLRDAAFH VYWQDGGLRN VVDAFIRHNQ RGSGRFLSRE ECDLAVLALS CPGLQRDIID PAVADVAAAG FPLTLDRRRI FAVMKVLTPV KLRMCMADNP DRLRAVAYTA VAGARAVAEY LSRCGIAGDE LFARRPLALV LATLFARFPQ SASRDFARRW LAQALASGRY DFGGNQFADS DASAVARCTT LDDAETVLAA RIAQFTEPQL DPEDLTTSHS AAGKAWTLYA LACHAQTCGP VSDLADPTIG AGDPALQLHP LWPHTASRTR RTLAAYAMMT EASAERIAAV GGFTVDAYLD LRCSDQSLHA QQICRPSSDT DVEEVVRHRT VALVDMIGGF LARLEPLAPP PLVGADVALP RALETA
|
| |