Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4979 |
Symbol | |
ID | 8336333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 5692635 |
End bp | 5695817 |
Gene Length | 3183 bp |
Protein Length | 1060 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644958078 |
Product | coagulation factor 5/8 type domain protein |
Protein accession | YP_003115680 |
Protein GI | 256394116 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00350881 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0761827 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAACG GATTGACCCG GCGGCAGTTC ATGAACGGGA TCGGGGGTGC GGCGCTGGCA ACGACGGTGT TCTCCTCCGG CGCCGCTTTC GCAGCGGACG GCGCTGGCAC TGGCACCGAG CGAGCGCCCG CGCCCGCGAC CCCGGCGCCC GACGACACGG TTGCCGCCGC GTACTACCAA TACCTGCTGC GGCACACGAA TTGGGCGCAG CAGAAGTGGG ACGCGACTGC CGGCCACTAT GCGGCGAGTG ACTACAACTT CGCCGTGGTG CTCGGCAACG CTGTCTTGCT GACGCACGGC ACCTATGACG CATCCGTCGC GGGCGTGGAT GCCGCGACCC TGAAGACCCA GACCCTCGCC ACGATCGACC ACTTCGCCGC CAGCAATGTG CTCAACGGCG GTACCGAGTG GGGCGAGACG ATGTTCTTCG ACAGCACCTT CGAGCTCTAC TTCATCCTCG CCGCCAAGTT GTTGTGGAAC GACCTCGACG CCGCGACGCA GACCCTGATC GACAAGATGA CCGCGGCGCA GGCGGCGTAC ACCACGGCCC TGGGCAGCGG GAACGATCCG CGCAGCGGCA GCTGGTCGCC CAACGGGCTG GCCGGTGGCT GGGAGGGCGA CACCAAAGTC GACGAGATGG CTGTGTACGC GCAGTGTCTC GGACCGGCTG TGGCCTGGCT TCCACAGCAC CCTGACAATC CCACTTGGAG CACCTGGCTG ACCACCTGGA TGCTCAACGA CACCGGGTTG CCGTCGGCCG ATCAGGCCAA CCCCACCGTC GTGGACGGTC GGCCGATCTC GGACTGGAAC ACCGCGCACA ACATCTTCGA CACCTTCTTC GTGGAGAACC ACGGCTCCTT CGAGCCGCAC TACCAACTCG AGACCTGGCG CATGTCCGCG CGAGTGGCCG CGCATTTCCT GGCCGCGGGC CGGCAGATCC CGGCCGCTGC CGGGGCGGCC AAGCCGAACG CCGCGCAGCT GTGGCGCACC ATCCGGCATG TGCAAAGCGA CAGCGGCGAG CCGTTCATGC CGATGAATCC CGACCGATAC CACCTCTTCG GTCGCGACGT GCTGCCGCTG GCGTTCCTGG CGCAGGTCAT GGGCGATCCG CTGGCGGCGC GTGCCGAAGC GAACATGGCC GCGCAGCTCG GGCCGTACCA GCTTTATCCG CCGGAGTATC AGCTCACGAA GTTCAGCGGG GAGGCGAAGT ACGAGCCGGA GGCGCGGGCC GAGCTGGCGA TCAGTTACCT GTTTCATGTG TGGCGTGCGC AGCAGGGGGC GCCGGTTCGG CCGGTCACGG GGGAGCAGTT CGTCGCTTCG GCGTCTGGCG CTACCGACTA TGGCGCCGTC CCCGGGTTGC TCGTGCACAA TACGGCGAAC GCGTTCGCTG CGACCGTTTC CAAGCCGGGG TATGTGAAGT TCTGCTACGC GCCCAACCAT GACGACTGGC TGTTCGACCT CTCCGGCGCG GCGCCCTCGC TGCTGCCTGC GACCGGGGCG ACGGTCACGA ACCGCTTCGC TGCCGCTTAC AGCACCTTGC GTGACGGGTA TGACGCGAGC GCTTCGCTGT TGACGTTGGC GAGTGGGTAC GCGGGATATG CGACGCTTCC CGATGGCGGC GTGGTGTACG CGACCAGCGG GAACGGTGCC GGGGAAGGTG TGCTGAATGT GTTCAACCTG GCGATGCCTG GCATCGCGGG GCTGGACGGG AGCCGGACTT ACGCCGGTGC CGGCGGCGCT TTCACCGTCT CGGCTGGCGA TGTGCAGACC GGCGGCACTA ACGATCTGTC CTTCAGCGCG GTGACGGCGC GGTACGTCCG GATGCTCGGT ATCAGGCCTG CTACACAGTA TGGCTACTCG ATCTACGAGC TTCAGGTCTA CGCGCCTGGC GGCACGACCA ACCTGGCGCA GGGTAAGGCC ACGACTGCTT CGTCCTTCAC GCCTGCCTAT CCGCCGCCCG CTGCCACCGA TGGCAATCCG GCCACGCGGT GGGCGGTCGC CGTCGCCAGC CGATCGGTGC CGGACAGCTG GCTGCAGGTT GACCTGGGAG CCGCCACGCA GATCGACCGC GTGTCCATCG CCTGGGAAGC TGCTTACGGC GCGGCCTTCG CTATCCAGAC CTCGGACGAC GGCAGCACCT GGAGCACCGC GGCGGCCTTG CCTGTGCAGC ATCACGTAGA CGGCGGTTGG GTGAACGTCG ACGGCCGTGC GGGATTCGTC GTGCGCGGCA GCCGCAATCC GCTGACCGTG ATCGGGAACA CGCTCACCCT GTCCGACGGG CCGGCGACCG GCGCCGCCGG GATGGTCGTG GAGGGCTATC CCGCGCAGAC GCCGCAGGGC ACCGCAGCCA TGGCGGCCTT GCCGACGCCG ACGCCGACGC CGACGCCGAA CAGCACTGTC GCAGGGCTCG CGGCGAGTAT CGCCGGGTCG CACCTGAGCC TGTTCAACCT CAGCGGACAG AACATCGGTA CCGAATCAGC GCCGGCCGAG CTGAGCGTTC CAGCTTCGGG GCGTGCGCGG CTGCTGTACC GCGGCCTACA GACCACCACC GCCACCGGGA CGGTCTACGA CGTCGTACTC GCCGCCGCGA CCGCGCGCGT CGAACCCCCG CGCTTCACGC TGACCGGGAC CATTCCCGCA GGCGTCGTAG CAGAGGTAGC GGACTCCCTG CACCTCACGC TCACAGCGCC CCCGACGAGC GGCTGCACCC TGACCGTCGT CTCCCGCTCC AACGCCACAA CCCGAACCAT CTCCCTGCGA GCCGGCCAGC AGAAACAGCT GACCTTCCCC GGAACCCTCA CCCCAACCCC CGACCTGGCC CTGGGCCGCA CCACCTACCC CACCTCACCC CTACCGGCCG GCATGTCCGA CCCCGCCTCC GCCGTAGACG GCAACCCCCA CACCGCCTGG ACCCCCGGCA CCCCCACCGC CCGCATGGTC ATCGACCTCG GCACCCCCAC GCCCCTGGCA ACCCTGACCA CCCACTGGAC CACCCCCCAC ATCCCCCCAT TCGCCATATC AGCATCCGCC GACGGCCAGC ACTACACCCC CCTCACCACC AGCAACCCCC ACCACCCCGA CCAACCCCTC CCCATCAACA CAACCACCCG CTACCTAGCC CTCACCCTCC AGAACTGGTC GCCCGCCCAC GCCAGCCTGA CCGCACTTTC GCTCACGTCC TGA
|
Protein sequence | MRNGLTRRQF MNGIGGAALA TTVFSSGAAF AADGAGTGTE RAPAPATPAP DDTVAAAYYQ YLLRHTNWAQ QKWDATAGHY AASDYNFAVV LGNAVLLTHG TYDASVAGVD AATLKTQTLA TIDHFAASNV LNGGTEWGET MFFDSTFELY FILAAKLLWN DLDAATQTLI DKMTAAQAAY TTALGSGNDP RSGSWSPNGL AGGWEGDTKV DEMAVYAQCL GPAVAWLPQH PDNPTWSTWL TTWMLNDTGL PSADQANPTV VDGRPISDWN TAHNIFDTFF VENHGSFEPH YQLETWRMSA RVAAHFLAAG RQIPAAAGAA KPNAAQLWRT IRHVQSDSGE PFMPMNPDRY HLFGRDVLPL AFLAQVMGDP LAARAEANMA AQLGPYQLYP PEYQLTKFSG EAKYEPEARA ELAISYLFHV WRAQQGAPVR PVTGEQFVAS ASGATDYGAV PGLLVHNTAN AFAATVSKPG YVKFCYAPNH DDWLFDLSGA APSLLPATGA TVTNRFAAAY STLRDGYDAS ASLLTLASGY AGYATLPDGG VVYATSGNGA GEGVLNVFNL AMPGIAGLDG SRTYAGAGGA FTVSAGDVQT GGTNDLSFSA VTARYVRMLG IRPATQYGYS IYELQVYAPG GTTNLAQGKA TTASSFTPAY PPPAATDGNP ATRWAVAVAS RSVPDSWLQV DLGAATQIDR VSIAWEAAYG AAFAIQTSDD GSTWSTAAAL PVQHHVDGGW VNVDGRAGFV VRGSRNPLTV IGNTLTLSDG PATGAAGMVV EGYPAQTPQG TAAMAALPTP TPTPTPNSTV AGLAASIAGS HLSLFNLSGQ NIGTESAPAE LSVPASGRAR LLYRGLQTTT ATGTVYDVVL AAATARVEPP RFTLTGTIPA GVVAEVADSL HLTLTAPPTS GCTLTVVSRS NATTRTISLR AGQQKQLTFP GTLTPTPDLA LGRTTYPTSP LPAGMSDPAS AVDGNPHTAW TPGTPTARMV IDLGTPTPLA TLTTHWTTPH IPPFAISASA DGQHYTPLTT SNPHHPDQPL PINTTTRYLA LTLQNWSPAH ASLTALSLTS
|
| |