Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4363 |
Symbol | |
ID | 8335717 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4950660 |
End bp | 4953551 |
Gene Length | 2892 bp |
Protein Length | 963 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644957466 |
Product | protein of unknown function DUF1680 |
Protein accession | YP_003115068 |
Protein GI | 256393504 |
COG category | [S] Function unknown |
COG ID | [COG3533] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0125989 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.151908 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCTGC CCCATGCGCC GCGGCCTTCC CTGCCGCTTT CCCGCCGCGC CGTGCTGCGC ACCGGCGCCC TCGCGGCGGC CGCACTCACC ACCGGTCCGT ACCTGGCGTC CTCCGCCTCG GCGGCCACGG CTCGTCTCGC GCCGGCCGGC GGCGGCCTGT ACTCCCCGAA CGCCGCTCCC CTGGCGCCGA CCGCGCTGCT GCGCCTGCCG CCCGGCGCCG TGCGCGCCTC CGGCTGGCTC GCCGGACAGC TCCAGCTCCA GGTGGACGGT CTGTGCGGCA AGTACCAGGA CACCTCGCAC TTCCTGAACA AGTCGACCAC CGGCTGGCTC AACCCGTCGC AGACCGGCTG GGAGGAGGTG CCCTACTGGC TGCGCGGCTA CGGCGACCTC GGCTACGTCA CCGGCAACGC CGCAGTCCTG GCCGACACCG CTAACTGGAT CAACGGCATC CTGGCCACAC AAGCCGCCGA CGGCTTCTTC GGCCCGGCTT ACCTGCGCAC CAACCAGAAC GGCCAAGCCG ACTTCTGGCC GTACCTGCCC CTGCTCCAGG CGCTGCGCAG CTACCAGGAG TACACCGGCA GCCAGCAGGT CCTGAACGCG ATGACCGCGT TCCTCCGGTT CATGAACGCG CAGCCCGGCT CGGTGTTCTC CGCCTACTGG CTCTCCTTCC GCGTCGCCGA CGGCCTGGAC GTCGTCTACT GGCTCTACAA CCGCACCGGG GAGGCCTTCC TGCTCAACCT GGCCGACACG ATGCACGCCA ACAGCGCGAA CTGGCTGAAC AACCTGCCCA CGCCGCACAA CGTCAACCTG GCGCAGGGCT TCCGCGAACC GGCGGTATAC GCGCTGCGCT CGGGCCAGTC CGGCATGACG CAGAACGCGT ATCAGAACTA TGCGTCGATC ATGGGACGCT GGGGTCAGTT CCCCGGCGGC GGCTTCACCG GCGACGAGAA CGGCCGGATC GGCTACGCGG ACCCCCGCCA GGGCTTCGAG ACCTGCGGCG TGGTGGAGCT GATGGCCAGC CACGAGCTGC TGAACCGGCT CACCGGCGAC CCGGTCTGGG CCGACCGCTG CGAGCAGCTG GCGTTCAACA TGCTGCCGGC CACCCTGGAT CCGCAGGGCA AGGGCACGCA CTACATCACC TCGGCGAACA GCGTGGACCT GTCGAACACC GCGAAGACCC ACGGCCAGTT CAGCAACGCC TGGGCGATGC AGGCGTACAT GCCCGGCGTG GACCAGTACC GCTGCTGCCC GCACAACTAC GGCCAGGGCT GGCCGTACTT CACCGAGGAG CTGTGGGCCG CCACGCCGGA CAACGGTCTG TGCGCGGTGA TGTACGCCCC TTGCTCGGTC ACCGCAAACG TGTCCGGCGG CCACTCGGTC ACCATCACCG AATCCACCGG GTATCCGTTC ACGCAGTCCG TGACGCTGAC GCTGACCATG TCCGCCCCGG CAACGTTTCC GTTGTACCTG CGCGTCCCGG GCTGGTGCTC GGCTCCGGCG GTCGCGGTCA ACGGCGGGCA CGTGAGCGCA CCGGCAGGAC CCGCCTACAC CTCGATCTCG CGGACCTGGC ACACCGGGGA CACGGTGACG ATCCAGCTGC CTTCCACTCC CGTCGTCAGG ACGTGGAGCG CGATCGGCGG CGCGCTGTCG GTGTCGAACG GCGCGCTGGA CTACTCGCTG AAGATCGGCG AGAACTACGT CCAGTTCGCC GGGAACTCCG AGTTCCCCGA GTACGAGGTG CACGCCACGA CGCCCTGGAA CTACGGGCTC TCGCTGCCCG CGGCGAACCC GGCGGGCGCT CTGTCCTTCC ACGCCGCCGG CGGCGCTGTG CCAGCGAACC CGTTCACGCA GCAGAGCGTG CCGGTCAGCA TCACCGCACC GGCCGCGCAG ATCGCCAAGT GGACCACCGA CGATCAGAAC GTCGCCACGG AGCTGCCGAC CGGACCCTTC CAAACGTCCG GGACGACCAA CGTCACCCTG ATCCCGATGG GTGCGGCGCG GCTGCGGATC ACCGCGTTCC CCGCCGCCGG CTCCAGCGGC AACGCCTTCT CCCAGCCCGG CGGCTACTTC CGGCTGTTGA ACGCCAACAG CGGCAAGGTC ATGGGCGTGT CGAACATGTC CTGGGGCGAC TCGGCGAACG TCGTGCAGTT CGACGACAGC GGAACCGCCG ACCACGTCTG GCAGCTGCTG GACAACGGCG ACGGGAACGT CCGCATCCGT AACGCGAACA GCGGTCTGGT GCTCGGCGTG GACGGCATGT CGACGGCGAA CTCGGCGAAC GTCGTCCAGT TCGAGAACAC CAACACCCTG GACCATGTCT GGACCCTGAT CGACAACGGC GACGGCCGGA TGCGCATCCG CAACGTCAAC AGCGGACGGG TCGCCGGCGT CGCCAACATG TCGACCGCCG ACTCGGTGAA CGTGGTCCAG TACGACGACA ACGGCACGGC GGACCATCTC TGGACCCTGA TTCCCGACGG GCCGGTACGG ATCGTCAACA AGAACAGCGG TCTGGTCCTC GGCGTGGCGA ACATGTCCAC CGCGAACTCG GTCAACGTCG TGCAGTACGA CGACAACGCC ACCGCCGATC ACCGCTGGAC CTTCCTGAGC GATTCCGGCG GCTGGTGGCG GATCCAGAAC CAGAACTCCG GCAAGGTCAT GGGCGTGTCG AACATGGCGA CCACGGATTC GGCGAACGTC GTGCAGTACG ACGACAACGG CACCGCCGAC CACCTGTGGC GGCTGCGCCC CGGCGGCGGT CCGTGGTTCC GCATCCAGAA CAAGAACAGC GGCCTGGTGC TCGGCGTGGC GAACACGTCC ACGGCCGACT CGGCGAACGT CGTGCAGTTC GACGACAACG GGTCCGCAGA CCACCTGTGG CGGATTCTCT AG
|
Protein sequence | MSLPHAPRPS LPLSRRAVLR TGALAAAALT TGPYLASSAS AATARLAPAG GGLYSPNAAP LAPTALLRLP PGAVRASGWL AGQLQLQVDG LCGKYQDTSH FLNKSTTGWL NPSQTGWEEV PYWLRGYGDL GYVTGNAAVL ADTANWINGI LATQAADGFF GPAYLRTNQN GQADFWPYLP LLQALRSYQE YTGSQQVLNA MTAFLRFMNA QPGSVFSAYW LSFRVADGLD VVYWLYNRTG EAFLLNLADT MHANSANWLN NLPTPHNVNL AQGFREPAVY ALRSGQSGMT QNAYQNYASI MGRWGQFPGG GFTGDENGRI GYADPRQGFE TCGVVELMAS HELLNRLTGD PVWADRCEQL AFNMLPATLD PQGKGTHYIT SANSVDLSNT AKTHGQFSNA WAMQAYMPGV DQYRCCPHNY GQGWPYFTEE LWAATPDNGL CAVMYAPCSV TANVSGGHSV TITESTGYPF TQSVTLTLTM SAPATFPLYL RVPGWCSAPA VAVNGGHVSA PAGPAYTSIS RTWHTGDTVT IQLPSTPVVR TWSAIGGALS VSNGALDYSL KIGENYVQFA GNSEFPEYEV HATTPWNYGL SLPAANPAGA LSFHAAGGAV PANPFTQQSV PVSITAPAAQ IAKWTTDDQN VATELPTGPF QTSGTTNVTL IPMGAARLRI TAFPAAGSSG NAFSQPGGYF RLLNANSGKV MGVSNMSWGD SANVVQFDDS GTADHVWQLL DNGDGNVRIR NANSGLVLGV DGMSTANSAN VVQFENTNTL DHVWTLIDNG DGRMRIRNVN SGRVAGVANM STADSVNVVQ YDDNGTADHL WTLIPDGPVR IVNKNSGLVL GVANMSTANS VNVVQYDDNA TADHRWTFLS DSGGWWRIQN QNSGKVMGVS NMATTDSANV VQYDDNGTAD HLWRLRPGGG PWFRIQNKNS GLVLGVANTS TADSANVVQF DDNGSADHLW RIL
|
| |