Gene Caci_0389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_0389 
Symbol 
ID8331716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp431849 
End bp435232 
Gene Length3384 bp 
Protein Length1127 aa 
Translation table11 
GC content68% 
IMG OID644953555 
ProductWGR domain-containing protein 
Protein accessionYP_003111182 
Protein GI256389618 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.489109 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.333615 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCG CCACCAACGC ACCCACCGCA GTGTCGGAAG ACACGTGCCA GATCCCCGGG 
GCGATGCGCC GCCTCGTGAT TCCCCGCCGG GGCGATGTTC CGAGCGAACC GGTCGCCGTC
GATCCGAAAG CCGCCGCGTA TTACGCCGCA TGGTTTGACG AGCAGAGCCC GATCGTCGCC
GGGATCGTCG ACCACGGACA CAGCGAAGCC GATCTCGTCG AGGAGTACCG AGCTTCCAAG
GGCGACCTCG TCATAGCCAC GCCGCTTGCC GCAGCGGTCG CGGCGGCGAT GACGATCTCG
CAGAGCACGC CGGCGTACGA GAACCCCGCC GGCATCGTCG ATGCCTGGAT CGCCACGCGC
GGTCTCGCAT TCGCCGCGGC CGCCGCTGTC GAGCTGTTCG GCGTGAAGAC CTCGGGCCGC
ACCTCCAAGG GAGAGCAAGC TTCAGCGTGG CTCTGCCGCA AGGACGGCGC CTATGAAGAC
ATACCTCATG GAATCGCCGT CCGCGTGCGC CAGCACCTGG CATCGGCGGG GGAGCAGGAC
TACGCGGAGG TCGCCGATGT CCTTGCGCGC TACCGGCTAT CGCTGGCGTC CCCGACGACC
TCCGGTCAAG TGCTGTCGCG CATGCTCACG TCCTTCCTGA CGCCGGAGAA CGTCGACTGG
GTCGAGAGCG ACTTCGCGTT CGTCTCGCGG GCGCATTACT CCGCGTGGCT CGCGCTCGCC
TCGATCACGA CCGAGGAGCA AGTAGCCGCC TTGCCGGCGC AGAAGATCTC GGCGTGGGGC
ATCGCCCGAG AGCTCAGTGC GCTGTGGACC GCGCTCGACA ACGTCGGCGT CCATCTCCTC
CCGGTCCTGA CCGGATGGCT CGCTCCCGAG TCGGACCCGG AAAGCGTTCA GCGCCTGCTG
TCCATCGTCG CGGGCATCGC GTCCGCAGAC GCCTTCCGCT ACCTCGTCGA CAACATCGAC
AAGAAGTACT TCACGGTTGC GACGCAGAAC GCTGCCACCA CCTTTCCCCG GCGCGCTCTC
CAGATCCTCG GCGACGCCGC GCTCCGCGAC ACTCCGCGCG GGCTGGCTGC CGCCAACGTG
CTGCGCTTGC ACATCGTCGC GCACGACGAA CTCGTGCAGG ACGAGCTGGC GAACCTCACT
CCGGCGGTAC GGGATCACGT CGAGAAGATC CGCGCCGCCA ACGTCCCCGT GCCCGACGCC
GAGACAGACA GCCTGCCGAC GCTGTTCGTC AGCCCGCCGT GGCTGACCGC CGTGAAGCCG
GCCAGGCCGG TCGTCATCAC CGGTCTGGAG GCGCCGACCG AGACGGTCTT GGCGTGGCGC
GACGGCGAAA AAGAGCAGTG GGCTTCCGCC GAGCTCATCA CGAATCCCCA GTGGAAACTC
GAAAGCTGGA AGGACATCGC CGACCGCATC AGCACCGGTG GCTCGGCCGG CTGGTACGCG
AACGGCCACT TCGCCGTCGA GGCTCCCGAG GATCTGGTCC GGTCGGTTCT CACCAGCTGG
GAGCCCGACT CCTGGCAGTC CGACACCTGG ATCCCAACGC TGGTAATGCG TTTCGGCGCC
GACGCGCTGC CGCCGATCCT GAGCTTGGCA CGCAGCGTAC CGGCGTCGGG TGCCGTTTTC
CTCGCCCCGT TCGAGGCTCC CGAGGTCGCC CTCCTCGCGG CCGACTGGCT CAGCCGGCTG
AAGACCGCGC GACCGTTCGC GCTGGCTTGG CTGACGCGGC ATCCCGGCTT CGCAGCCAGG
ACCCTGATAC CGGCGGCGCT CGGCAAGGCG GGCAAGGCGC GCAGTGCCGC CGAGCTGACC
ATCCGCACAC TGGCCGCACG CGGATTCACT GCCGAGATCA CCGAAGCCGC CGCAGGGTAC
GGCGACGCCG TGGCGCAGGC GGTCGCCGCC ATCGTCGCCG ACGACGGCAC GCTCACGCTG
CCCAAGACGA TGCCCGCCGT CCCGGACTGG GCCGAAACCC GGCTGCTGCC GCAGATCCTG
CTCAAGGGAC GGCAGACCGC GTTGCCCGAG CAGTCTGTGA AGCACCTGCT CCTGATGCTC
GCGGTGTCCA AAGGTACCGA GCCCTACGCC GGAATCCAGA TGGCGAAGGG CATCTGCGAC
CCGGCCTCGC TCGCCACGTT CTCGTGGGCG TTGTTCGAGA ACTGGCGCGG TGTCGACTAC
CCGGCGAAGG AGAGCTGGGC GTTCGACGCG CTGCGGTGGT TCGGCGACGA CGAGACGGTG
CGGCGGCTGT CCCCGATGAT CCGCCTGTGG CCCGGCGAGA ACGGGCATCA GCGCGCGGTC
GCCGGTCTGG ACGTACTGGC CGACATCGGC GGAACCGTGG CGCTGATGCA CCTCTACGGC
ATCTCGCAGA AGGTCAAGTT CAAGGGGTTG AAGGAGCAGG CGACGCAGCG CGTCACGGAG
ATCGCCGACG ACCTCGGGCT CACCGCCGAG CAGTTGGGGG ACCGTCTCGT CCCGGACCTC
GGGTTGGCGT CCTCCGGGAC GCTGTCCCTG GACTACGGTC CACGGTCGTT CACTGTCGGA
TTCGACGAAC AGCTGAAGCC GTATGTCGCC GATCAGAGCG GCAAGCGTCT CAAGGCTCTG
CCGAAGCCGG GAGCGAAGGA CGATCAGGAG CTCGCACCCG CTGCCCACCA GCAGTTCTCC
GCGCTGAAGA AGGACGTGCG GACGCTCGCC GCGAGCCAGA TCGCGCGCTT CGAGCTCGCC
ATGGTGACGC AGCGCCGCTG GACGTCCCAG GAGTTCGGCG AGTACTTCGT CGGGCATCCC
TTGCTGCGAC ACCTGGTCCG GCGCCTGGTC TGGGTGACCT TCGTCGATGA GAAGGTAGGC
AGCGCCTTCC GGGTCGCCGA AGACCTGAGT CTCGCCGACA TCGCAGATGA CGAATTCACC
CTCGCCGACG ACGCGGTGAT CGGCGTCGCG CATCCGCTGC ACATCGGCGG GGACGTCGCC
GCGTGGTCGG ACGTGTTCGC CGACTACGAG ATCCTTCAGC CGTTCGCGCA ATTGGGACGC
ACTGTGTTCG CATTCACCGA CGCGGAGAAG GCTTCGACGC GCCTGACCCG GTTCGGCGGC
ATCGAGACTC CCGTCGGCAA AGTCCTGGGA CTCGAACGGC GCGGCTGGCG CCGGGGTGCT
CCGCAGGACG CCGGCATCCA GGGCTGGATA TCACGCGCGC TCCTCGGCGG GGGTTCGGTC
ACCGCGACCC TCGACCCCGG CATCACCGTC GACTACGTCG CCGAGTGGGG CGAGACGCAG
CGTCTCCAGG AGGTCTTCAT CAGCCGCCAC GCCGACGGCG AGAGCTACTG GAACGCCTCG
GCGAAGTTCC GCGAGCTCGG CAGCCTAGAC GAGATCACGG CCTCCGAACT CCTCCGCGAC
CTGACCGAGG CGACGACCCA GTGA
 
Protein sequence
MSGATNAPTA VSEDTCQIPG AMRRLVIPRR GDVPSEPVAV DPKAAAYYAA WFDEQSPIVA 
GIVDHGHSEA DLVEEYRASK GDLVIATPLA AAVAAAMTIS QSTPAYENPA GIVDAWIATR
GLAFAAAAAV ELFGVKTSGR TSKGEQASAW LCRKDGAYED IPHGIAVRVR QHLASAGEQD
YAEVADVLAR YRLSLASPTT SGQVLSRMLT SFLTPENVDW VESDFAFVSR AHYSAWLALA
SITTEEQVAA LPAQKISAWG IARELSALWT ALDNVGVHLL PVLTGWLAPE SDPESVQRLL
SIVAGIASAD AFRYLVDNID KKYFTVATQN AATTFPRRAL QILGDAALRD TPRGLAAANV
LRLHIVAHDE LVQDELANLT PAVRDHVEKI RAANVPVPDA ETDSLPTLFV SPPWLTAVKP
ARPVVITGLE APTETVLAWR DGEKEQWASA ELITNPQWKL ESWKDIADRI STGGSAGWYA
NGHFAVEAPE DLVRSVLTSW EPDSWQSDTW IPTLVMRFGA DALPPILSLA RSVPASGAVF
LAPFEAPEVA LLAADWLSRL KTARPFALAW LTRHPGFAAR TLIPAALGKA GKARSAAELT
IRTLAARGFT AEITEAAAGY GDAVAQAVAA IVADDGTLTL PKTMPAVPDW AETRLLPQIL
LKGRQTALPE QSVKHLLLML AVSKGTEPYA GIQMAKGICD PASLATFSWA LFENWRGVDY
PAKESWAFDA LRWFGDDETV RRLSPMIRLW PGENGHQRAV AGLDVLADIG GTVALMHLYG
ISQKVKFKGL KEQATQRVTE IADDLGLTAE QLGDRLVPDL GLASSGTLSL DYGPRSFTVG
FDEQLKPYVA DQSGKRLKAL PKPGAKDDQE LAPAAHQQFS ALKKDVRTLA ASQIARFELA
MVTQRRWTSQ EFGEYFVGHP LLRHLVRRLV WVTFVDEKVG SAFRVAEDLS LADIADDEFT
LADDAVIGVA HPLHIGGDVA AWSDVFADYE ILQPFAQLGR TVFAFTDAEK ASTRLTRFGG
IETPVGKVLG LERRGWRRGA PQDAGIQGWI SRALLGGGSV TATLDPGITV DYVAEWGETQ
RLQEVFISRH ADGESYWNAS AKFRELGSLD EITASELLRD LTEATTQ