Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4170 |
Symbol | |
ID | 8335524 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4724009 |
End bp | 4727101 |
Gene Length | 3093 bp |
Protein Length | 1030 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644957273 |
Product | protein of unknown function DUF214 |
Protein accession | YP_003114875 |
Protein GI | 256393311 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.134814 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.186117 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGGTTC GTGCGGCACT ACGCGAGATG CGCCTCGACG CCGCCCTGCT CGGCTGCCTC GCCGCGATCA TCGCCATCAG CACCCTGATG GTCAGCGCCG GCCCGGCGTT GGTCGCGCGC CTGGACGACC GCTCCCTGCG TCAGAACGTC ACGACGGCGG CACAGCAGGG GCGCGGCGTC AGCGTGACGC TCAACACGAT CGGCCTGGAC ACCAGCGCCC TGGGCACCGG CGCCAACACC TTCTCCGACT TCACCAACCA GCTCCTGAAG CCGCTCCCAC CCTGGGCCCG CCCCATGCTC GGCACCCCAT CAGTCGACAT CGCCACCCCC TTCACCCCCG CCACCGCCCC CGGCATCACC ACCCTCGGCG CCATCCCACC CCAACTCCGC ATCGAGTACA CCAACCCCAT CCCCGGCGGC CTCCACTACA CAGCCGGCAC CCCACCACCC CCCGGCCCCC TCCCACCCGG CACCCCCATC CCCATAGCCA TCTCCACCGC CACCAGCACC GCCCTCCACC TCACCCTCGG CGAAACAATC ACCCTCGGCC AACCCGGTCC GGGATCCGCC GCCACCTCAG CAGCGAGCTC GGCGACCAGC CCCAACAACT CGGCGAGCGC CGCCACCCTC GCTGGTTCGA CAGCGGCCTC AGGCTGGTCT GCCGCTCCTG CCAGCCCGAC CCGCGCTACC TCCGCAAGCT CTGCTGTCGC CTCAGGCCGA CTTGCCGCCC GGACCAGTTC GACTCGCTCA GCCAGCTTGG CTGCTTCGGT CAGTTCGGCT GCCGGCTTGG CGAGTGCCGC CGATTCGGGT GCCACTTCTG GTAGCTCGGA GGGTGTCTCA GCCGCCGCTC CATCACCGCG TGCCGGTGGC TCGGCTGTGT TCGCCACTGC CGCTGGTCCG GCTGCTGCGA CCATTGGTGC TCGCGTTGTC GGCATTTTTG AGCCTGTTGA TACCTCGGCG GCGGCGACTA GCAACTTTTG GTACACGCAT CCGTGGTTGC GGGCGCCGGT GGCGGGGGTG GGGGCGGCGG TTGGGGGGTT TGCGCCGTTG GTTTTGCGGG GTGCTGTGTT GACGAGTGCG GCGGGGGTGG GGGCTGCCTT GCCTGTGGGT GCGGGTGCGG CGGCGACGAC GTTTGTGTTT CCGCTTGCGG CGGGGGCGTT GAGTGCTGAT GGTGCTGGGG CTTTGGCGGT GCAGTTGGGG CGTATGACTG ATCCGGCTTT TGATGATCCG TGTGGGCCGG CGCCGCGGGG GCTGCCGGAG TTGTGTGGGC CGTTCATTGT GACGCGGGGC GGGTTGGTCA TTGCTGCCGA CATCAAGGCG ACGTTGGATC AGTTTGTTGC GGCGCGGGCT GAGGTGTGGG TGGTGGACTC GTTCTCGCTC GCGAGTCTGG CGACCGTGGC GCTGATCACG CTGTTCAGTG CGGCGCGGTT GGCGGTTGGG CGGCGGGAGC GGGATCTTGC GCTGCATCGG GCGCGTGGGG CGACTGTCAG GGATCTGATG ACGGTGCGCG CGGTGCATGG TGCGGTGGTG ACCATTCCTG CGCTGGCCGT TGGGTGGGTG GTCGCGCGGA TGGTGCTCCA CGTGGGGCGG CATCCTGTGC AGAGCAAGGG TCCTAGCGCT GCGTGGCTGC TCGCGGCGAT CGGTATCGCG GGTCTCGTGC TGCTGCCGGC GCTCACTTGG GCTCGGAGTC GTCCGCGGGC TGCCGTTGTC GACCGGGCTG TGGTGCGACG GCGCCGGTTG GCGGTTGAGG CGGGTCTTGC GGTGCTCGTC GTTGCGGCGG TGGCTTCGCT GCATAGCCGC GGGATTGACG GGCTGCGCTC GGTGGGCGTG GACCCGCAGA TCAGCCTGGT GCCGGCGCTG CTTGGGGCGG TGGGTGCGGG GGTATTGCTG CGTGTGCATC CTGTGGTAAT TGGTTCCTGC CTGCGGTGGG CACGACGGCG TCGGTCGGCT GTGCCGGTTC TGGCGTTCGC GCAGGCGCGG CGAAGTGTTG GGCTCGGAGC GGTTGGGCTG CTCGTGCTGG TCCTCACCCT TGCCGGGCTG GTGTTCGGTG GGCTGGTTAC CAGGACAGTC ACCGGTGCTC ATGCTGATGT TGCGAAATCG GTCGGTGGTG ACGCGGTCAT CTCTGGCCGC GGTCTGACGC CGAGAGTGCG GAGCGACGTT GCCGGAGTGG CGGGGGTTCG GCAGGTTATA GCTGAACAGG CGCTGTGGGT GACGCCTGAT GTTCCAGCCG GTGCTGCGCC GATCAGGACT ATCGGCGTAG ATGTCAAAGC CCTGATGCAG GCTGATCCCG GCTCGAAGCT GGGGAGGCTG CTGGCGGGTC CCGGGCAGCC GGCGTACGCC TCGGCCGCAG CGATTCAGCA CCGTAGCGCC GCCATGATCC AGGCGGGGTC GACGACGTTT GACGTCAAAG CCGTCGACAC GCTCAGCGCC GACGACCTCC GCGTGATCGG CACCGAGCTC GACGGGCTGG CGCCGGACGC CCCGTACCTC GTCGTCCCGC TGACCGTCGC CGCCAAGCTG ACCGCCGACA CCGATCCCGA CACGCTGGTG ATCGACGGTC CCGGCGTCGC CGCATCCGAC CTGCGCGCCG CGCTCCCGGC GAACGTCGCC TACCAGATCC AAACCCGCAG CGACCTGGCA GGCAGCCTAG ATGCCAGCAC CCTGACAGAC TCGCTCAACC TCGTCGCGAA CTCCTGCGCC GCCCTCGCCT CGGCCTTCGC CCTGCTCGCG GTCGTGCTCG AGCTGCTCGC CGGTGCTCGG GCGCGCGGCG AGGCGGTGTC GTTCCTGCGG ACGATGGGGC TGCGCAGCCG CGCCGCTACC GGGATGCTCA TCGTGCAGCT GCTGCCGCCG GCGTGTCTGG CGGCGCTGGC GGGGGTGGGT CTCGGGGTGC TGATTCCGCC GGTGCTGGGG TCGGCGCTGC GGCTGCAGGC GGTCACCGGC GGTGCGGCGG AGCCGACGGT GCGGGTCGAC TTCGCGACCG CCGCCGCGCT CGGCGCCGCG ATGGTCGCGC TGGTGCTGCT CGCGGCGCTG ATCGACAGCC GGTTGGCGCG GCGGCGCAAG CTCGGATCCG TGCTGCGCTT CGACTCCCGA TAG
|
Protein sequence | MRVRAALREM RLDAALLGCL AAIIAISTLM VSAGPALVAR LDDRSLRQNV TTAAQQGRGV SVTLNTIGLD TSALGTGANT FSDFTNQLLK PLPPWARPML GTPSVDIATP FTPATAPGIT TLGAIPPQLR IEYTNPIPGG LHYTAGTPPP PGPLPPGTPI PIAISTATST ALHLTLGETI TLGQPGPGSA ATSAASSATS PNNSASAATL AGSTAASGWS AAPASPTRAT SASSAVASGR LAARTSSTRS ASLAASVSSA AGLASAADSG ATSGSSEGVS AAAPSPRAGG SAVFATAAGP AAATIGARVV GIFEPVDTSA AATSNFWYTH PWLRAPVAGV GAAVGGFAPL VLRGAVLTSA AGVGAALPVG AGAAATTFVF PLAAGALSAD GAGALAVQLG RMTDPAFDDP CGPAPRGLPE LCGPFIVTRG GLVIAADIKA TLDQFVAARA EVWVVDSFSL ASLATVALIT LFSAARLAVG RRERDLALHR ARGATVRDLM TVRAVHGAVV TIPALAVGWV VARMVLHVGR HPVQSKGPSA AWLLAAIGIA GLVLLPALTW ARSRPRAAVV DRAVVRRRRL AVEAGLAVLV VAAVASLHSR GIDGLRSVGV DPQISLVPAL LGAVGAGVLL RVHPVVIGSC LRWARRRRSA VPVLAFAQAR RSVGLGAVGL LVLVLTLAGL VFGGLVTRTV TGAHADVAKS VGGDAVISGR GLTPRVRSDV AGVAGVRQVI AEQALWVTPD VPAGAAPIRT IGVDVKALMQ ADPGSKLGRL LAGPGQPAYA SAAAIQHRSA AMIQAGSTTF DVKAVDTLSA DDLRVIGTEL DGLAPDAPYL VVPLTVAAKL TADTDPDTLV IDGPGVAASD LRAALPANVA YQIQTRSDLA GSLDASTLTD SLNLVANSCA ALASAFALLA VVLELLAGAR ARGEAVSFLR TMGLRSRAAT GMLIVQLLPP ACLAALAGVG LGVLIPPVLG SALRLQAVTG GAAEPTVRVD FATAAALGAA MVALVLLAAL IDSRLARRRK LGSVLRFDSR
|
| |