Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4811 |
Symbol | |
ID | 8336165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5481357 |
End bp | 5484353 |
Gene Length | 2997 bp |
Protein Length | 998 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 644957911 |
Product | protein of unknown function DUF214 |
Protein accession | YP_003115513 |
Protein GI | 256393949 |
COG category | [V] Defense mechanisms |
COG ID | [COG0577] ABC-type antimicrobial peptide transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.661642 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGTCA TGATCTGGAA GGTCGCCGTG CGTATCGCAC GTCGCGAGGC GGTGCGTCAC AAGGGGCGGT CGGCACTGGT GGCGGCCATG TTGGCGTTGC CCGTGGCGGG GGCGAGTGCG GCTGACACGC TGTACCACAC CACCAAGTTG ACCACGGTGG AGCAGGTGCG GCGGGACATC GGGCGGTCCG ATGCGCTGGT GTCGTTTGTG GCGCCGGTGC CGATCGAGCA GATGCCGGAC GGGTCCGTGC CGGTGCCTCC GCAGGCGGCC GGGATGCTCG CGGGGGCGTT GCAGGGCGCG CAGATCGCTT CGCCGGTGGA CGATGGGGCG CTGCCGCCGG GGTCGCGGGT CATCCCGATG CCGGCGCCGC AGGAGTTGCG GATCAGTTCG CCGCAGGGTG CTTATTCGGT GCAGGGCGCT GAGCGGGATC TGGCCAGCCC GCTGCTCGAT GGGCTCTATG TGAAGAAGGC CGGGCGCGCG CCAAGCGCCA AGGGCGAGGT GACGCTGTCC ACCTCGATGC TCTCCGAGCT GAACAAGCAC GTCGGCGACC AGGTGAGTGC CGAGTTTCTC AGTGCCGGGG AGCAGCCCGG GTCTGACGGT CGGCTGCCGG AGCATCAGCT GACCATCGTC GGCGAGTACG ACGACCCCAC GCACCTGGAC GCCGACCGCC TCGTCGCCTA TCCCGGCACG CTGCCCAGCG CGCCGGGCGA TCCAGGTCCG TCGCAGTGGC TCGCGCAGGT GCCCGGCGGG CTCGACTGGG CGCAGGTGCA GAAGCTGAAT CAGAGCGGGT ACTCCGCCGA GTCGCTGCTG GTCGCCGAGA ACCCGCCGCC GGATTCCGCG GTCCCGCTGA TGCGGCACTA CACCGCGGCG TCGGTGGGCA GCAGCCGGCT GACGCAGGCG CTGGTCGCGG TGGGGGCGCT CGGCGTCGTC ATGGCGCTGC TGGAGGTCGT GCTGCTGGCC GGGCCGGCGT TCGCGGTGGG GGCGCGCAAG CGGCGGCGGG ACTTCGGGCT GATGGGTGCC GCCGGCGCCG ATCAGCGGAT GGTGCGGGCG GTGGTGCTGG CCGACGGGCT GGTGCTGGGT GCCGTCGGCG GGGTCGTCGG GGCGCTGGCC GGGATCGGGT TGGGGCGGCT GGCGCTGCCG GAGGCTGTGA GTCTCACACA CCGCGAGCCG GGGGCGTTCC GGATCGCGCC GGGCGATCTG GCCGGGGCGG CGCTGATCGG CGTGGCGACC GGGCTGATCG CGGCGCTGAT CCCGGCCGTC ATCACCGGGC GGCAGTCGGT GTTGCAGGCG CTGACGGGTG GCCGGCGCGG GGCGCGCGGC GTGCCGTGGA AGCTCGCGGT GTTCGGCGTG CTGATCCTCG GCGCCGGGAT GGCCGCGACC GGTTACGCCG TCTACCAGCC CGGGATCCAC ACCGTACCGC TGGTCGCGGG AATCGCCGTG TGCGAACTCG GCGTGGTGGC GTGCACGCCG CTGCTGGTCG CGCTGACCGG CAAACTGGGG CGGATGCTGC CGCTGACCGG GCGGATGGCG CTGCGCGACA GTGCCCGCAA CCGGGGTCGC ACCGCGCCGG CCGTCGCGGC GATCCTGGCC GCCGTCGCCG GCTCGACCGC CGTGGCGACG CTGCTGGCCA CCAACGACGC CCGCGACCGC GCGCAGTATC ACAGCACTCT GCGTCCCGGG CAGGTGGCGC TGATGTTCGG CAGGCAGGCG GGCCAGACGG GCCAGACGGG GCAGGGAGGC CTGAGCGACC AAGGCAGCCA GTTCGCGTTG TCCTCCAGCG ACCCGACCGA CCCGAACAGC CCGAACGGAC CCGCCGGAGC CGCCGGAGCC GCCGGAGCCG CCGGAGCCGC CGGCAAGCCG ATCGACACCG CCAAGGCGAT CGCCGCGATC AGCGCCACGC TGCCGGTCCG CTCCGCCGCG GTGATCCCGA CGCAGGACTA CACCGGCTTC CTGCCGCTGA ACATCGTCGT CAAGCGCACC GTGGCGAACG ACTGCCCCTT CTTCGGCAAC GGAGACGGCG CGATCACCTT CCCCTCGGGC CAGGACAGCG AGTCGATGAT CCAGGCCGAT CCGCGCTGCG TGGCGGCGTT CAGCGGCGGG CAGGTCGTGC CCGGGGACGC CGCGACGCTG CGCGCGCTCA CCGGAACCGT CGATCCGCAG GCTGAGAAGG TGCTCGCCGC GGGCGGCATG GTGGTGTTCA GCCCGTACGA CCTCACCGGC GACGGCGTCT CGACCATCGC CCTGCAGCGC AACTGCCCGC CGTCGGACGA CAGCGTGCCG GACGTCGTCC GCGACCAGTT CGCGCCCTTC TGCAGCGGAC CGGCGCCGAA GCCGCTGACC CTGCCGGCGG CGGTGGCCAA GACCAAGGAC GGCAGTGCGG TCAACGGTGT CAGAGCTCTG ATTCCGGCGT CGGCGGCGGC CGGCTACGGC ATGAAGTACA TGCCGTCGAT GATCCTGTTC GACACCACGC GCATGCCGAC CAAGGCCGAA GAAGAGCGCG CGAACGCCGC CGCCGAGGCT CTGGGCACCA CCGCGCTGCT GAAGGTCGAG CGCGGCTACC AGGGCGGCAA CGACACCACC ATGCTGGCGC TGGCCGCCGT CGCCGCGTTC GTCACGCTCG GCGCCGCGGC GATCTCCACC GGTCTGGCCA TCACCGACGG CCAAGCCGAC CTGGAGACGC TGGCCGCGGT CGGCGCGCGC CCGCGCGTGC GGCGCACCCT GGCCGGCTCG CAGGCCTCGA TCACCGCCGC GATGGGCGCG GTGTTGGGCT CGGCGACCGG TCTGGTCCCG GCGGTGGCGG TGGTCGAGGC ACGGTCGCAC AGCTTCGTGC AGTCGGCGCT GGAGGGCCGG GCGGACCGCG GCGTCCACGC CCAGAGTTAC CTCGCAGTCC CCTGGTGGTT CCTGCTCGGC ACGATCGTGA TCGTTCCGAT GCTAGCCGGG ATCGGTGCGG TGCTGGTGAC GCGCTCGAAG GTAGAGATTC GGCGACGACG GGGGTAG
|
Protein sequence | MDVMIWKVAV RIARREAVRH KGRSALVAAM LALPVAGASA ADTLYHTTKL TTVEQVRRDI GRSDALVSFV APVPIEQMPD GSVPVPPQAA GMLAGALQGA QIASPVDDGA LPPGSRVIPM PAPQELRISS PQGAYSVQGA ERDLASPLLD GLYVKKAGRA PSAKGEVTLS TSMLSELNKH VGDQVSAEFL SAGEQPGSDG RLPEHQLTIV GEYDDPTHLD ADRLVAYPGT LPSAPGDPGP SQWLAQVPGG LDWAQVQKLN QSGYSAESLL VAENPPPDSA VPLMRHYTAA SVGSSRLTQA LVAVGALGVV MALLEVVLLA GPAFAVGARK RRRDFGLMGA AGADQRMVRA VVLADGLVLG AVGGVVGALA GIGLGRLALP EAVSLTHREP GAFRIAPGDL AGAALIGVAT GLIAALIPAV ITGRQSVLQA LTGGRRGARG VPWKLAVFGV LILGAGMAAT GYAVYQPGIH TVPLVAGIAV CELGVVACTP LLVALTGKLG RMLPLTGRMA LRDSARNRGR TAPAVAAILA AVAGSTAVAT LLATNDARDR AQYHSTLRPG QVALMFGRQA GQTGQTGQGG LSDQGSQFAL SSSDPTDPNS PNGPAGAAGA AGAAGAAGKP IDTAKAIAAI SATLPVRSAA VIPTQDYTGF LPLNIVVKRT VANDCPFFGN GDGAITFPSG QDSESMIQAD PRCVAAFSGG QVVPGDAATL RALTGTVDPQ AEKVLAAGGM VVFSPYDLTG DGVSTIALQR NCPPSDDSVP DVVRDQFAPF CSGPAPKPLT LPAAVAKTKD GSAVNGVRAL IPASAAAGYG MKYMPSMILF DTTRMPTKAE EERANAAAEA LGTTALLKVE RGYQGGNDTT MLALAAVAAF VTLGAAAIST GLAITDGQAD LETLAAVGAR PRVRRTLAGS QASITAAMGA VLGSATGLVP AVAVVEARSH SFVQSALEGR ADRGVHAQSY LAVPWWFLLG TIVIVPMLAG IGAVLVTRSK VEIRRRRG
|
| |