Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4158 |
Symbol | |
ID | 8335512 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4702263 |
End bp | 4705067 |
Gene Length | 2805 bp |
Protein Length | 934 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 644957261 |
Product | protein of unknown function DUF214 |
Protein accession | YP_003114863 |
Protein GI | 256393299 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3127] Predicted ABC-type transport system involved in lysophospholipase L1 biosynthesis, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.434869 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0912085 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCAGCT GGGGACTCGC GCTGCGGGTG GCGCGCCGGG AGGCATGGCG CAACAAGAAG CGCTCGGCGC TGGTCGTGGC GATGCTCGCG CTGCCGGTGG CGGGTGCCTC GGCGGCCGAC ACGCTGTGGC GCAGCTCGCA GATCACGGCC GAGCAGAAGG CCGTGTGGCA GATGGGCAGC TACGACGCGC TCGTCACCGA CGTCAGCGTC CCGATGTACC AGACCCCTGA CATGCGGGGT GCGGAACCGG TGCGCGACGC GAACCAGAAC ACGCTGCCGG AGCTGCGCAC GTCGCCGGAG CTGCGCACGT CGCCGGCCAG CGTGGCGAAC CTGTCCGCGC TGCTGCCCGC CGGATCGCAC ATCGCGCAGC CGGAAACGCT CTACGGCTCG CAGGTGCAGA TCGTCAACGA CGCCGGCCGG GCCTGGGGGC AGCTGCAGGA CGCCGACATC GCCGACCCGA TGCTGGCCGG CACCGTCCAC CGCGTCGCCG GCAGCGCTCC GGCCTCCGCC GAGGAGGTCG CGCTGAGCAG CTCGCTGGCC GGCAAGCTCC ACAAGCACGC CGGCGACACC ATCGCCGTGC GCCCGGTCGG CACCGCCGAC GGCCAAGGCA GCGATGATCC ACTGCCGGCC AAGACCGTCC GCGTGAGCGG GATCTACACC AGCAAAGAAG ACCCGTACGA CGACATCGTC TTCGCCCACC CGGGCGTCTT CGCCACGCAG CAGTCGGCCG GTCAGACCGC CTTCGCGATC GCCGTCCCCG GCGGCGTCGA CTGGAGTCTG ACACAGCGCC TGAACGCACA AGGCTTCACC GTGCAATCCA AGCGCCTCCT GGCCGATCCG CCGCCGCGAT CGCAGGTCCC TTACTACACC GCGTACCCCA GCGGCGTCTT CTCCGACGCC GGCAACAACG GCGCGACGCT CGCCGTGGCG GCGATCGGGC TGTCCATCGT CCTGCTGGAG GTGGTGCTGC TCGCCGGGCC GGCGTTCGCG GTCAGCGCGC GCCGCCGCAG ACGGGACTTC GGTCTGCTCG GCGCGGCCGG GGCGGACGGC CGGCGGTTGC GCCGGATCGT GCTGGCCGAC GGCGTGGTGC TCGGGGCGGC CGGCGGGGTG ATCGGCGCGG TGGTCGGCAT CGCGAGCGCG GCGGCGGTGC TGCCGTGGTT CGGGCACTAC TCGCAGCAGA CCATGGGAGG GTTCCGGCTC AAGCCGCTGG AGCTGTTGGC TGCCGCGGCG GTCGGTGTCG GCACAGGGCT CGCGGCGGCG GTGGCTCCGG CGATCGCGAC CGCGCGCCAG GACGTCCTGG TCGCGCTGAC CGGACGGCGC GGGCAGTCCT CGACGCCATG GAAGCTGCCG CTGGTCGGGC TGATCGGCGT CGTGCTCGGC ACGGCCCTGA TCCTGTTCGG CGCCTGGCAT CACGGCAACG CGGTAATGGT CGCAGCGGGG GTGTTCCTGG CCGAGCTGGG TCTGGTGGCG TGCACGCCGG TCCTGGTGGC GTGGGCCGGC AAGCTGGCTC GCGCGCTGCC GCTGACCGGA CGGCTGGCAC TGCGCGACAG CGCCCGCAAC CGCGGCCGGA CCGCTCCGGC GGTGGCGGCG ATCATGGCGG CGGTGGCGGG GGCGACGACG GTCGCCATGG TCATCAGCTC CGACGACGCG CAGCAGCGGC ACTACTACAC CAGCCAGCTG CGCATGGGTC AGGCGACGGC CTCACTCGGT GACGGCAACG GTGTCACGCC GGCGGTGGCG AACCGCCTGG CCACGCAGAT CAACGCGGTA CTGCCGTCGA CGCAGTCCGC GGTGTTGCAG GGCGTGGGCT ACGGCGGCGG CGACACCTCG CCGCCCTCGC CGACGGTGGC CCGCCTGCCG GAGAACGCTT GCCCGCAGGC GCTGAACGGC GCGGCGATGG CCAAGGACGC ACGCTGCGGC CCCAGCCTCA GCGGCGTGCT CGAGTTCTTC ACCGGCAGCT CCACCGTGGT CGCCGGCGGT CCCGATCTGG TGCGGCTCCT GCTGGGCCAC ACGGACCAGG CTGCCGAGGA CACGCTGAAG GCCGGCGGCG CCGTGGTGTT CAACAAGTAC GACCTCGCCA CCGCCGGGGC GAAGCCGACG GTGGCGTTCA CCTTCAACTC CGGGTGCGAT CAGCAGCAGA CGCAGTGTCC TGTGAGTACC GCGACGGCGA CGCTGCCGGC GGCGTTCGTC AACAGCCCGC GCGCGGACGT CTCGGCCATC GTCTCCCCGG AGGCGTTCGC CGGCTTCGGG GTGAAGTACG CACCGATGGC CCTTCTGTTC GACGACTCGC GGATGCCGAC CAGCAAGGAG GAGCAGAAGG CCGACGATCT GGTGGGAGCA GCGGGAATCC AGCAGCGTTT CTACGTCGAG CGCGGCTATC AGAGTCAGAC CTGGGTCGGC GTGCTCGCCC TGGCGGCGGT CGCCGGGATC GTGATGCTCG GGGCGGCGGC GGTGGCCACC GGGTTGGCGA TCACCGACGC GCAAGCCGAC CTGGAGACGC TGGCGGCGGT CGGCGCACGT CCGCGGGTCC GGCGCCTGCT GGCCGGTTCG CAGGCCGCGG TCACCGCCGG GCTCGGCTCG GTGTTGGGAG TGGCGTTCGG GTTGCTGCCG GCGGTGGCCA TCATCGAGTC GCAGGCACAG CAGGTCGCCG CGAACCCAGA GAACCTGCTG AACGCGCAGC AGACACAGTT CGCGCCGCCC TGGCTGTACC TCGGCGTGGT GGTCGTGGCG CTGCCGCTGC TGGCCGCCGC CGGGGCGGCC GGGTTCACTC GCTCGCGGAT CGAGATGCGC AGGCGGCGCG GCTGA
|
Protein sequence | MSSWGLALRV ARREAWRNKK RSALVVAMLA LPVAGASAAD TLWRSSQITA EQKAVWQMGS YDALVTDVSV PMYQTPDMRG AEPVRDANQN TLPELRTSPE LRTSPASVAN LSALLPAGSH IAQPETLYGS QVQIVNDAGR AWGQLQDADI ADPMLAGTVH RVAGSAPASA EEVALSSSLA GKLHKHAGDT IAVRPVGTAD GQGSDDPLPA KTVRVSGIYT SKEDPYDDIV FAHPGVFATQ QSAGQTAFAI AVPGGVDWSL TQRLNAQGFT VQSKRLLADP PPRSQVPYYT AYPSGVFSDA GNNGATLAVA AIGLSIVLLE VVLLAGPAFA VSARRRRRDF GLLGAAGADG RRLRRIVLAD GVVLGAAGGV IGAVVGIASA AAVLPWFGHY SQQTMGGFRL KPLELLAAAA VGVGTGLAAA VAPAIATARQ DVLVALTGRR GQSSTPWKLP LVGLIGVVLG TALILFGAWH HGNAVMVAAG VFLAELGLVA CTPVLVAWAG KLARALPLTG RLALRDSARN RGRTAPAVAA IMAAVAGATT VAMVISSDDA QQRHYYTSQL RMGQATASLG DGNGVTPAVA NRLATQINAV LPSTQSAVLQ GVGYGGGDTS PPSPTVARLP ENACPQALNG AAMAKDARCG PSLSGVLEFF TGSSTVVAGG PDLVRLLLGH TDQAAEDTLK AGGAVVFNKY DLATAGAKPT VAFTFNSGCD QQQTQCPVST ATATLPAAFV NSPRADVSAI VSPEAFAGFG VKYAPMALLF DDSRMPTSKE EQKADDLVGA AGIQQRFYVE RGYQSQTWVG VLALAAVAGI VMLGAAAVAT GLAITDAQAD LETLAAVGAR PRVRRLLAGS QAAVTAGLGS VLGVAFGLLP AVAIIESQAQ QVAANPENLL NAQQTQFAPP WLYLGVVVVA LPLLAAAGAA GFTRSRIEMR RRRG
|
| |