Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_0493 |
Symbol | |
ID | 8331820 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 569440 |
End bp | 572580 |
Gene Length | 3141 bp |
Protein Length | 1046 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644953657 |
Product | hypothetical protein |
Protein accession | YP_003111284 |
Protein GI | 256389720 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5635] Predicted NTPase (NACHT family) |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00353648 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00104497 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAAGGCT CCATCATCAC ATTCTATTCC TACAAGGGTG GAACCGGCCG CTCCATGGCC CTGGCCAACG CGGCGTGGAT CCTGGCAAGC AACGGCCTGA GGGTACTCGT GGTCGACTGG GACCTTGAAG CGCCGGGCCT TCACCGCTAC TTTCACCCCT TCCTCTCCGA CCGGGATCTC AGGACATCGA TGGGACTCAT CGATCTCGTC TGGGAGTTCG TCGCGGCCGC CTTGGACTCG GAAGCCCAGG ACGAGCCCGG CTGGCACGAG GACTTCGCTC GGATCTCGCC CTATGCAATG TCGATCGACT TCGACTTCCC CGGCCGAGGT ACCGTCGACC TGGTGCCAGC GGGAAAGCAG GACGCGTCGT ACTCGGCGCT GGTCAGCTCC TTCGACTGGG GGAACTTCTA TGAGCGTCTC GGCGGCGGCG GCTACCTTGA GGCTTTGAAG CGCGACATGC GTCTTCACTA CGACTACGTA CTCATCGACA GTCGCACCGG TCTCAGCGAC ACCGCCGGGA TCTGCACTGT CCACCTGCCA GACATCCTCG TCAATTGCTT CTCGCTGAGC ACCCAGGCCA TCGACGGCGC CGCCGCCGTG GCCGGGTCGG TGCACCGCCA GCGGGAGAAT CGGCAGGTCC GTATCTTTCC GGTACCCATG CGCGTCGAGG ATGGAGAGCA GGACAAGCTG GACGCCAGCC GGGACTTCGC CCGCGAACGA TTCGGCCGCT TCCTGTTCCA CGTCTCCGAT CCTGAACGCT ACTGGGGAGA GGTCGAGGTC CCGTACAAGA GCTTTTACGC ATATGAGGAG ATCCTGGCCA CCTTCGGTGA CCGGCCGCAG CAGGAGAACA GCGTACTGGC CGCGACCGAG CGGATTGTCG GGTACTTGTC CGACGGGCAG GTCACAGCTT TGAGTTCATC ACTCTCCGAG TCTGAGCGGC GGGTCTGGTT GACCCGCTTC CAACGCAACG GACCGAAGCT TTCGCGCACC GAACTCGACG TCCGGCTCCC CGGGCCGCCA CTGAAAATCT TCATCTCGTA CGCACACGAC TCGCACGAGC ACTTCGAAGA GGTCCGCGAG CTGTGGTTCC TCCTTCGCGC CAATGGCACC GACGCCCAGC TTGGCTCACC CGATAACGAG CGCAGGTGGA ACGGGCCGGT TCCGACCGCG ACGGAGATCC GCACAGCCGA CCTGATCATT GTCGCTGTGT CGCCACTTTA TCGACGATCC ACCGGCGATG ACGGTCAGAC ATCAGGGGAC GAGAGTGGTG CTGGAGTAGA AGCACGCCTG ATCCGAGACG AGGTCCAGGG CGGATACTCC GTGCAACGGA TCATCCCGGT CGTTCTCCCG TCGGCCGCCG TCACGGAGGT GCCAGCCTAT CTAGCGGACA GCCCGGTCAG GCCCGTCATC GTCGAGCAGC TCACAGCGGA CGGCATCCGA CCGCTCCTTC GCCAGGTGGC TCGCTTCAAG CGAGTCGATC ATAAGCGGGC AGACTTCAAC CCCTCCTGGC GTGAAAATCT TCCGAAGCCG GAACCCGATC TCGAACAAGC AGCACTTCAG CTCGCCCGAA GCGTAGCCAG CGGTTGGGAG GTGTCGGCGG CGGCGACACT CAGTCGCCCA CCGCTGGCGA CCCGATGGTC GGGAAGCCAG CGGCGCGGCT CGTTCGATGA TCTCGTGAAC ATTTTCCTGA CCTCTTCCAG AGGACGGCTG CTCGTCCTGG GCTCAGCCGG CTCGGGTAAG ACGACGACGG CAGTCCGTCT ACTCCTCGAC CTGCTAGCCA GAAGAACAGA CGAGCCGATT CCGATCCTGC TGGCGGTAGC CACATGGCGG CCGGACGTTG AACATATCAA CACCTGGATT ATCCGACGCT TGCAGGAGGA CTACACTGTC GGCGGCATCG CCGAGCGTCT CGTCACGACG GGGATGATCC TTCCGATTCT AGACGGGCTC GACGAGCTGC CGGTCACCCA GCGTGCGCTG GCACTCGATG CACTTTCTAC GCTGAGGATC AACCAGCCCT TCGTGGCAAC ATGCCGAACC GGCGATTATC GACAGGTGGC GGCCGAGTTG GGACAAATGC TGGCGAACAC TGTCGTCGTC GAACTCGAGC CAATCACCAG CAACGACATC CGCGCCTACC TTGCTGTAGA CCTGCCGGAT AGCGATCGTC GTTGGCAGCC GGTATTCGCG CACCTCCGAG AGTTTCCCGG TGGACTACTA GCACAGTCCA TGTCGAATCC ATTCATGTTG GACCTGATGC GGGACGCATA TCGGAGGCCA AACACCGATC CGGCCGAACT CCTAAATAAC CGGCTGTTTC CCACCACGCA AGCAATCGAG CATCGTCTCT TCGGCACGTT TATCGGCAGA GCCTATGCGC CGCGTCCAAG CACCAGACCA GAATCGACAT ACGACGCTTC ATCGGCCGAC CGCTGGCTTA GCTTCCTCGC GCGCTCGCTG CATGAGCAGC ACACGCGAGA TTTGTCCTGG TGGCATCTTC ACCGCGTGCT GCCTCCAGGG GGCCTGAGGA CTCTGCTCGC GCTCGCGGCC GGATTCTTCG GGGCCCTTAT GCTTGGCTTG GCTGGCCTCC GATTCGACGT GTTGCCGGAC ATCGCAGCAA TCCCCGGCTG GGTGAAAGGC GCGCTCGCCG GCACCGGACT CGGGCTATTC GTTGCCCTCG TGGCGAGGGA GCCTGTACAG CTCAGCAAAC CATCCCCCCG GCGCGCCGAA CCAAGTCCAA GGCGCCGTCA GCTGGGGCGT CTCGTCGGCA TTTCCGCTGA TCTCAACCAG GCGTCCAGCC CAGCTGCGGC ACTGCTCATT GAACGCAGGA CAGTATCGAC GGTCGGACTG CTAATGGTTC TCGTGATCGG CATCTCAGAA CTGGTATCCG GCATCCCAAG AGCCTTGGTC GAGCCGATTG TGGCAACGGC CGTCGGCCTG ATTATGGTCC GCTTCAGCAC GAGCTCGTGG GGGTGGTACT CGGTGACGCG CTGGTGGCTG GCGCTCCGAG GGGTACTGCC GTTGCGAGTA ATGCGGTTCC TCGGCGACGC GCATGCTCGT GGCGTTCTGC GCCAGCAAGG AGTCGCCTAC CAGTTCCGCG ACGTCGCACT CCTCGAATGG TTCGCCGGAG ATCATATATG A
|
Protein sequence | MKGSIITFYS YKGGTGRSMA LANAAWILAS NGLRVLVVDW DLEAPGLHRY FHPFLSDRDL RTSMGLIDLV WEFVAAALDS EAQDEPGWHE DFARISPYAM SIDFDFPGRG TVDLVPAGKQ DASYSALVSS FDWGNFYERL GGGGYLEALK RDMRLHYDYV LIDSRTGLSD TAGICTVHLP DILVNCFSLS TQAIDGAAAV AGSVHRQREN RQVRIFPVPM RVEDGEQDKL DASRDFARER FGRFLFHVSD PERYWGEVEV PYKSFYAYEE ILATFGDRPQ QENSVLAATE RIVGYLSDGQ VTALSSSLSE SERRVWLTRF QRNGPKLSRT ELDVRLPGPP LKIFISYAHD SHEHFEEVRE LWFLLRANGT DAQLGSPDNE RRWNGPVPTA TEIRTADLII VAVSPLYRRS TGDDGQTSGD ESGAGVEARL IRDEVQGGYS VQRIIPVVLP SAAVTEVPAY LADSPVRPVI VEQLTADGIR PLLRQVARFK RVDHKRADFN PSWRENLPKP EPDLEQAALQ LARSVASGWE VSAAATLSRP PLATRWSGSQ RRGSFDDLVN IFLTSSRGRL LVLGSAGSGK TTTAVRLLLD LLARRTDEPI PILLAVATWR PDVEHINTWI IRRLQEDYTV GGIAERLVTT GMILPILDGL DELPVTQRAL ALDALSTLRI NQPFVATCRT GDYRQVAAEL GQMLANTVVV ELEPITSNDI RAYLAVDLPD SDRRWQPVFA HLREFPGGLL AQSMSNPFML DLMRDAYRRP NTDPAELLNN RLFPTTQAIE HRLFGTFIGR AYAPRPSTRP ESTYDASSAD RWLSFLARSL HEQHTRDLSW WHLHRVLPPG GLRTLLALAA GFFGALMLGL AGLRFDVLPD IAAIPGWVKG ALAGTGLGLF VALVAREPVQ LSKPSPRRAE PSPRRRQLGR LVGISADLNQ ASSPAAALLI ERRTVSTVGL LMVLVIGISE LVSGIPRALV EPIVATAVGL IMVRFSTSSW GWYSVTRWWL ALRGVLPLRV MRFLGDAHAR GVLRQQGVAY QFRDVALLEW FAGDHI
|
| |