Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_7330 |
Symbol | |
ID | 8338699 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 8510912 |
End bp | 8513071 |
Gene Length | 2160 bp |
Protein Length | 719 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644960412 |
Product | type IV secretory pathway VirD4 components-like protein |
Protein accession | YP_003118000 |
Protein GI | 256396436 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000633696 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00000000626768 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGACCGCGT GGGCGTGGTA CGAGGCCTTC CCGCCGCGCG GCATGACGCT GGAAGCCTCC ACGAACGTCA TGCGGGCGCT CGCCGGACGG CCGCGGTTCG GATTGCTCGG GCTGGTGCCG CTGGCTGTCT TCGAGCTCTG GCTGTATCCG GAACGCGTGC GCTGGTTGGT CGGCATGGAT GAGCAGATCA GTGACCGGCT ACCGGGGGAA CTCATGGCGC AGTGCCCGGA ACTGGTACTG GTACACACCG ACAATCCGAA GCGTCCGGCA GTCGTTGCCG GCCGTGAGGT TCGCTTCAAG AGCCTGGCGT ATCCCATTCG CGGTGACGTC GCCGAAGGAG TCACCGCGGC GCTGCTGCGT ATCCGTCAGG GACTGCGCCC TGGTGAAGCC GTCGCACTCC AGTTCGTCAT CGGGCCCGGG CAGTTCTTTG CGGTTTTGCC ACAGCGGCGG ACGCCGTTGG ATGTCCTGGG CTTTACCAGC CCACCGGAGC CAGACAGCAA CGACCGTCGA GCATTCAAGC GCAAGGTTTC CGAGCCCTTG TTCGCCATCC GCGGCCGAAC GGGGGCGGTA GCAAAGGACC CTCGTCGCGC CGCTGGGCTG ACCCGACCGG TGTTCTCGGC CCTGGCTTTG GCCAACGACC GGCACGGCCG CGTCCAGGTC TCGGCGCAGT CTTCGCGGGT AGCGCATCAG CTCATGGCGG TCATGGGTAG GGCTCGAACT TGGTCGAGCA TCGTGAATGC CGGCGAACTG GCGACCGTAA TCGGCTGGAA TATCGCTGAA ATGGACGTGC CGGGCAGCGG CAACGGCTTC GCGCCGCCAC CTCCCGAACT TCTGGACACC GGGGACAACA CCAGCCGTCC GGTGGGTCGG AGCCAGCACC CAGCTACCCC AAACCGGCCA GTGCATCTAC CGCTGCGCTC GTATGCGGCA CACTGCCACG TCATTGCTCC GACCAACGCG GGCAAGTCAA CGATGCTCGC CCACTGGGCA GTTGCCGAAG CGGAAGCTGG CCGCTCACTG GTGGTCATCG AGCCCAAAGG CGATCTGGTA GACGACATTC TCAGCCTGTT GCCGGAGGAG CGGCGAGACG ATGTGGTCAT CATCGACCCG GGAGCAGACG CTCACCTGCC GGTGATCTCC TTCAACCCCT TGCAGGGGGC GGTCAGTGAT GCCGAACGCC GCGCCGATTC ACTGCTCGGG CTGTTTAAGG AGCTGTTCGG GGCAAACATC GGGCCCCGGA GCAGTGACGT GCTGTTGCAC GCACTCATCG CACTGTCTAG ATCGGCAGAC GGAACCTTGA CCGACGTCAT GCCCTTCCTG AGCGACGAGC GCTTCCGGCG CTCAGTGCTT GCTCACGTCA GCGATCCGCT GACCTTGGCG CCGTGGGCGG CCTGGTTCGA CACTCTCAGC GTCGGTGAGC TCGGCCAAGT GGTCGCGCCG ATCGGGAACA AGCTACGGAT CATCAGCGCC CGGCCGAGTA TCCGTCGACT GCTCGGCCAG CCAGACCCCG CGTTTCGACT GGAGTCGATC TTCGAGTGCC CGACGATCGT GCTCGTCAAC CTCAACGCCG GAGCTATTGG AGCCGAGGGC TCGAAGATCA TCGGGACATT GCTGCTACAT CAACTGTGGG ACGCTATCCA GCGCCAGACC ACCAAGCCCC CGAAGCAGCG TCGGGCGGTG CCAATCTTCG TCGATGAGTT CCAGGGCTAC ACGTCAGGCC TCGACTTCGC TGATGTGCTG GCCAGAGCCC GCGGGGCAGG GGCACCATTC ACTGTGGCGC ATCAACACCT TGACCAGTTG AGCCCAACTC TCAAATCCGC GGTGCTGGCG AATGCACGCT CACGCGTTGT GTTCCGGCCG GCTGAGGGCG ACAGCGCCGC CCTGGCGACC GTCCTCGGCA AGCCGGTGAC TGCTGACGAC TTGGCGCGGC TCCCGGCGTT CCATGCGCTG GTGCGTGTCC CGCTCGGTGG CACTCCGTCA CCGGCGTTCG AGGTGGCCAC ACTGCCACTG CCGAAGTCCA CCACCGACCC GAGGCGACTC CGGCGCCAGT CTGCCGAGCG CTACGGCACC GATCCGAAAG CTATCGACGA CGCCATCTTG CACCGCTGGC GGGGCGATGA GCCGGATGAA CCAGTCGGAG CGCGGAGGAA GCAGCCATGA
|
Protein sequence | MTAWAWYEAF PPRGMTLEAS TNVMRALAGR PRFGLLGLVP LAVFELWLYP ERVRWLVGMD EQISDRLPGE LMAQCPELVL VHTDNPKRPA VVAGREVRFK SLAYPIRGDV AEGVTAALLR IRQGLRPGEA VALQFVIGPG QFFAVLPQRR TPLDVLGFTS PPEPDSNDRR AFKRKVSEPL FAIRGRTGAV AKDPRRAAGL TRPVFSALAL ANDRHGRVQV SAQSSRVAHQ LMAVMGRART WSSIVNAGEL ATVIGWNIAE MDVPGSGNGF APPPPELLDT GDNTSRPVGR SQHPATPNRP VHLPLRSYAA HCHVIAPTNA GKSTMLAHWA VAEAEAGRSL VVIEPKGDLV DDILSLLPEE RRDDVVIIDP GADAHLPVIS FNPLQGAVSD AERRADSLLG LFKELFGANI GPRSSDVLLH ALIALSRSAD GTLTDVMPFL SDERFRRSVL AHVSDPLTLA PWAAWFDTLS VGELGQVVAP IGNKLRIISA RPSIRRLLGQ PDPAFRLESI FECPTIVLVN LNAGAIGAEG SKIIGTLLLH QLWDAIQRQT TKPPKQRRAV PIFVDEFQGY TSGLDFADVL ARARGAGAPF TVAHQHLDQL SPTLKSAVLA NARSRVVFRP AEGDSAALAT VLGKPVTADD LARLPAFHAL VRVPLGGTPS PAFEVATLPL PKSTTDPRRL RRQSAERYGT DPKAIDDAIL HRWRGDEPDE PVGARRKQP
|
| |