Gene Caci_3519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3519 
Symbol 
ID8334872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3925688 
End bp3930214 
Gene Length4527 bp 
Protein Length1508 aa 
Translation table11 
GC content69% 
IMG OID644956663 
ProductYD repeat protein 
Protein accessionYP_003114266 
Protein GI256392702 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0534715 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.146276 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGTC CAACGGGGTG GGATGTTCTG GGGTTGGATG GGGATCCGAC ACCTGGTGTG 
GTGGAATCGG TGCAGGCGTT GGCGAAGGAG TTCGGGGACT TCGCCCACGA TGTGGAAGCG
GCGTATCACA GCCTGAACTC GTTCGGTTCG GATGCGACGG CGCTGACGTG GGTGGGTCAG
ACTGCTGATG CGTTCAAGGG CAAATACGGT CCGCTGCCCG GTCGTCTACA GAAGCTGTAC
ACCTCGTACA GCGAAGCCTC TGATGCGTTG TCGGCGTATG CGCCGCTGCT TCAGGCCGCA
CAGACCAAAG CCGACGCCGC CTTGCGTCAA GCTCAAGACG CGCATGCTGA TCTGCAGCGT
GCCACCACCA ACGCCAACAC CGCAGCGACG GATCTGAAGA CCGCGCAGCA GAACCACGCC
ACAACCCCTA ACCCCCAAGC GGTCACCGAC GCGCAGACCG CGCATGACAC CGCACAGACC
AACCTGAACA ACGCCAAGGC GCATATGGCG TCGTTGACCA AGCAAGCCAA CGACGCCTAC
AACGACCGCA TCAACGCCGC CAAAACCTGT GGCAGTGCCC TGCATCACGC CCAATCCGAC
GGCATCCACA ACAAGTCCTG GTGGCAGCAC GTCGGCGAGG ACCTGTCCAA GTGGGGCGGC
GAGATCGGCA AAATCGCCGG CGAGCTCGCC CCGATCCTGG ACATCATCGC CTTGGCGACC
TCCTGGATCC CCGGTGTCGA CGTCGTCACG GCAGCCATCG CCGAAGCCGA CAACATCATC
GCCATCGCCG GCACGGCCGT CGGCGCTATC GGGGACGCCA TGCAAGGGCA CTGGGGCGAT
GCGCTGCTCG GCGCCGGCAT GGTCGCCCTG ACCTTCGTCG GCGGCCGAGC CCTGGGTTCG
GGAGCCGAAG ACCTCGAAGG CGAAGCCGGT GCCCTGGAAG GGGAAGCCGG CGCTCTCGAA
GGCGAGGGCG GCGCGCTCGG CGACGCGGCG GAGGGCGGCG GAGCCGATGT ACCGACTGGG
GAGGGCGAAT CGCCCGGCGG GGTCGGCGAC GACGCGCTCG AAGGCGACGG CGTCAGCAAC
GCCAACACCG GCGGCGACCC CGTGGACCTG GTCTCGGGCC AGATGGTCGC GCTGGAGACG
GATCTGGAAC TGCCCGGCGT GCTGCCGATC GTGCTGCGCC GCGCCTACGC CTCCGGCTAC
CGCACCGGAC GCCTGTTCGG TCCCGGCTGG TCCTCGACCC TCGACCAGCG GCTGTCGGTC
AACGCCGCGG GCATCCACCT CGCCGGCGAC GACGGTCAGA CGTTGCACTA CCCCCTGCCG
GCCGGTGAGG AGGAAGTGCT TCCGTCCCGA GGCGCGCGCT GGCCGCTCAT CTGGGACCGC
GCCGCGGACG AGATCCGTGT CACCGATCCT CGCAGCGGCC TGATACGGCA CTTCCCGGTC
GTGCACTTCG GGAGCCCCGA CGGCCAGATC CGCGACGTCG TCCGCATCAG CGATCGCAAC
GCCAACCGGA TCGACATCGT GCGCGACCCG GAAGGGACGC CGACCGGTGT CGAGCACAAC
GGCTACCGGC TCGCCATCGA CGCCGTGGCG ACCACGAACG GTCCGCGCGT CGCCGCGATC
CGCCTCCTGG ACGGCAGCGA GCAAGGCGTC GTCGTCAAGC GGTACGAGTA CGACGAGCAC
GGTCGGCTGA CCGGAGTCGT CAACTCCTCC GGGATGCCGT ACACGTATCA GTGGGATGAT
TTCCACCGGA TCACGGCGTG GGTCGATCGT TCCGGCTTCC GTTACTCCTA CGAGTACGAC
ACCAACGGCC GAGTGGTGCG CGGCGTCGGC GCAGACGGCT ATCTGTCGGG CGATTTCGCC
TACGATCCGG CTGCCCGAAC AACGGTCTTC ACGGACTCGC TCAACCGCGC CAAGACCTAT
CACTACGACC TGAATGGCCA CCTGGATCGG GCCGTCGAGG CGTCCGGCGC CGCTCTGGTC
TTCCTCGACG ACGCCTACGG CCGCCGGCTG CGCGGCACCG ACCGGCGCGG GATGACGACG
GTCTTCGACC GCGACGCGGC CGGGAACACG ATCCGCGTCC AGCGTCCGGA CGGCACGACG
CTCAGCGCGG AATTCAATGC CCTGAATTCC CCCACCTCGG TGGTCCAGGC CGACGGCACG
TGCTGGCGGC AGGAGCACGA CGCCGCCGGG AACGTCATCG TGCAGCTCGA TCCGCTCGAC
GCGCGCACCG AGTTCGGCTA CGACAGCCTG GGCGCGCTGG TGTCGATCAC CGATGACGCC
GGCGACCGCA CCGCCTACGA GGTCGACGCG GCGGGACAGA CTGTCTCGGT GACCGGGCCC
GACGGCTCGG TGCACCGGAT CGAGCGCGAC GCGTTCGGCC GGGTCGTCGC CGTGACGGCG
CCGGACGGCG GCGTGACGCG GTATGCATGG AGCGTCGAGG GCGGGCTGCG CTCCCGTGTC
AGGCCCGACG GTTCGCGGGA GGCGTGGGAG ACCGACGCCG ACGGGAACCT CGTGGCGTAT
ACCGACCAGC TCGGCGCCGT CACGCGCTAC GAAGTCGGTC CGTTCGGCAA GGTGGCGGCG
CGGATCGAGG CCGACGCCAC CCGGTTCGAG TACCGGTACG ACACCGAGAT GCAGGTGGTG
TCGGTCACCG GTCCCGGGCA GGTCCAGTGG TCCTACACCT ACGACCAGAG CGGGAATCTG
GTCCAGGAGA CGGATTACAA CGACCGCAGC CTCGCCTACT CCTATGACGC GGACGGCTGG
CTCGCCGAGT CCGTCGACGC GCTCGGTCAG CGGATGGTCT TCCAGCGCGA CCCGTTGGGC
CGGGTCGTCG GGCGGCAGAC CGTGGACGCC GAATACACCT ACGGCTACGA CGCGAACGGC
CGGATGCTGT CCGCCTCCGG CGCCGGTTCG CGGCTCGAAT TCGCCTACGA CGCCGTCGGC
AGGGTGGTGG CGGAGACGAT CGACGGCCGT ACGACACGCT CCGGATACGA CGCCCTGGGA
CGGCGCGTCG CCCGCGTCAC TCCGGCCGGC GCCGAGTCGT CGTGGCGGTA CGACGCCACG
GGTCGCGCCG TGGTCTTCGC CGCGGGTGAC GTCGAGCAGC ACATCGGGTA CGACCTCGGC
GGTCGGGAAG TCAGCCGGGC GCTGGGGATC GGGGGCGCGG TGTTGTCCCG CGCCTTCGAC
GCCGCCGGTC GGATCGCCAC GGAGCGGATC GCCACCGAAC GGATCGCCAC CGAACGGATC
GATCCTTCTC TGAGCCGCTC CTACGCCTAT CACGCCGACG GCGTCCCGGC CGCGGTGACC
GATTCCCTGC GCGGGACGCA GGAGTACATC GCCGACCCGC GAGGAAGGAT CTCCGAGGTC
CGGGGTCCGC ACGGCGGCGG CGAGTCGTAC TCCTACGACG GATTCGGCAA CATCGTGCAG
GCCGCCAGTG CGCTGTCGAC CGACAGCATC GGCGGTCGCG ACGTACGCGG CACGCGTGTG
CTTCGCGCGG GGCACGTCCA CTACGACTAC GACGCGGCCG GGCGGGTCGT CCGGATCCGC
AAGCAGACGC TGTCCGGCCA GGTCCGCATC CAGACCTTCG CCTGGGACGC CGACGGCCGG
CTCAAGCAAG CCGTCCTGCC CGACGGCTCG GCGTGGCACT ACCGCTACGA TCCGCTCGGC
CGGCGCCTCG CCAAGCAGCA CGTCCTGGCC GACGGCACCG AGGCCGAGCG TGTCGAGTTC
GCCTGGGACG GACCGCGCCT GATCGAGCAG CACAGCCTCG ACGCCGCGGG CCGGACGACC
ACGGTGACGT GGGACTACGA CCCGGACACC GGACTGCCGG TGGCGCAGCG GCGGCGGTCG
TGGGCGGCGG AGGCCGCGCA GGAGCGGGTC GATGAGATGT TCCACGCGAT CGTCACCAAC
TTGGTCGGCA TGCCCACCGA GCTCGTCACC CCCGACGGCC GGGTCGCGTG GTATCAGAAC
ACCGATTTGT ACGGGCAGTC CGTCGCCGTG GCGACCGGCG GCGACCCAGA CCTCGAATGC
CCGCTGCGCT TCGCCGGGCA GTACTTCGAC GCCGAGACCG GCCTGCACTA CAACGTGCAG
CGGTACTACG ACCCGGCGAT CGCGGCGTAT CTGACCCCTG ACCCGCTGGG CTTGGCGCCG
GCGCTCAACG ACCACGCCTA CGTCCCGAAC CCGCTGACCA TGGTCGACCC GCTCGGTCTG
GCCTCCGGCG CGCGCGTCCC GTCGCCGCCG TTCTCCGCCG GCGGGTACAG CACGCCCGGC
AACAACCAGG AACTCAACGT CGACGACATG CTCAAGGCCG GCGAGGACTG GCTCGGCCCG
GGCTACGGCG AGCCGAAGGC GGGCAGCGGA CGCTTCGTGT CGTCGGACGG CAGCCGCGTC
TTCCGCATGG GCGACAGCGA TATCCTCGGC AAACACGGCG GCGGCCCGCA CTGCAACTTC
GAATACATGG AACCGGACCC CAAGACCGGC AGGATGTCGG TCACCCAGAA CGACCACGTC
TACTTCACAA ACGGATGCAT CCAATGA
 
Protein sequence
MARPTGWDVL GLDGDPTPGV VESVQALAKE FGDFAHDVEA AYHSLNSFGS DATALTWVGQ 
TADAFKGKYG PLPGRLQKLY TSYSEASDAL SAYAPLLQAA QTKADAALRQ AQDAHADLQR
ATTNANTAAT DLKTAQQNHA TTPNPQAVTD AQTAHDTAQT NLNNAKAHMA SLTKQANDAY
NDRINAAKTC GSALHHAQSD GIHNKSWWQH VGEDLSKWGG EIGKIAGELA PILDIIALAT
SWIPGVDVVT AAIAEADNII AIAGTAVGAI GDAMQGHWGD ALLGAGMVAL TFVGGRALGS
GAEDLEGEAG ALEGEAGALE GEGGALGDAA EGGGADVPTG EGESPGGVGD DALEGDGVSN
ANTGGDPVDL VSGQMVALET DLELPGVLPI VLRRAYASGY RTGRLFGPGW SSTLDQRLSV
NAAGIHLAGD DGQTLHYPLP AGEEEVLPSR GARWPLIWDR AADEIRVTDP RSGLIRHFPV
VHFGSPDGQI RDVVRISDRN ANRIDIVRDP EGTPTGVEHN GYRLAIDAVA TTNGPRVAAI
RLLDGSEQGV VVKRYEYDEH GRLTGVVNSS GMPYTYQWDD FHRITAWVDR SGFRYSYEYD
TNGRVVRGVG ADGYLSGDFA YDPAARTTVF TDSLNRAKTY HYDLNGHLDR AVEASGAALV
FLDDAYGRRL RGTDRRGMTT VFDRDAAGNT IRVQRPDGTT LSAEFNALNS PTSVVQADGT
CWRQEHDAAG NVIVQLDPLD ARTEFGYDSL GALVSITDDA GDRTAYEVDA AGQTVSVTGP
DGSVHRIERD AFGRVVAVTA PDGGVTRYAW SVEGGLRSRV RPDGSREAWE TDADGNLVAY
TDQLGAVTRY EVGPFGKVAA RIEADATRFE YRYDTEMQVV SVTGPGQVQW SYTYDQSGNL
VQETDYNDRS LAYSYDADGW LAESVDALGQ RMVFQRDPLG RVVGRQTVDA EYTYGYDANG
RMLSASGAGS RLEFAYDAVG RVVAETIDGR TTRSGYDALG RRVARVTPAG AESSWRYDAT
GRAVVFAAGD VEQHIGYDLG GREVSRALGI GGAVLSRAFD AAGRIATERI ATERIATERI
DPSLSRSYAY HADGVPAAVT DSLRGTQEYI ADPRGRISEV RGPHGGGESY SYDGFGNIVQ
AASALSTDSI GGRDVRGTRV LRAGHVHYDY DAAGRVVRIR KQTLSGQVRI QTFAWDADGR
LKQAVLPDGS AWHYRYDPLG RRLAKQHVLA DGTEAERVEF AWDGPRLIEQ HSLDAAGRTT
TVTWDYDPDT GLPVAQRRRS WAAEAAQERV DEMFHAIVTN LVGMPTELVT PDGRVAWYQN
TDLYGQSVAV ATGGDPDLEC PLRFAGQYFD AETGLHYNVQ RYYDPAIAAY LTPDPLGLAP
ALNDHAYVPN PLTMVDPLGL ASGARVPSPP FSAGGYSTPG NNQELNVDDM LKAGEDWLGP
GYGEPKAGSG RFVSSDGSRV FRMGDSDILG KHGGGPHCNF EYMEPDPKTG RMSVTQNDHV
YFTNGCIQ